State vs State Covid-19 Analysis
Summary
In this project I break down the duration of the pandemic stats between the states of Florida and South Carolina. Written in R, this write-up goes to show how effective each state was at minimizing the risk to Covid-19. Variables included vaccines, tests admitted, confirmed cases, number of people who recovered, and those who died.
Tags: R, Covid-19, Florida, South Carolina, Correlation, visualizations, t-squared test, time-series.
Association Rule Mining
Summary
In this assignment, I used association rules to mine text. These techniques are beneficial because they show which items often are bought together. For this example I used simple grocery store data for analysis. The outcome could help the business owner alter the store in terms of suggestions, moving items in the store physically closer to one another, and adjusting ads to pick up extra sales.
Tags: R, text mining, association rules, frequent item purchase analysis, transaction data, and POS.
Google Store Application Analysis
Summary
The goal of the project was to explore the Google Store data and discover the types of apps that bring in the most revenue. This would be beneficial if you are a tech company that wants to explore establishing new applications for a new stream of revenue. In this project, I demonstrated several features that often come up on data science projects including working with unstructured/missing data, establish/build a server to host a database, and run query language for analysis. My results showed that "Get Rich" apps returned the most money, while lifestyle, finance, and family categories typically were the most successful.
Tags: R, product development, business analysis, data mining, SQLite, database storing, query language.
Temperature Forecasting
SUMMARY
In this Analysis, I break down regional temperature observations for the states of New York and California to forecast future temperature for those areas. The processes involved include residual analysis, detrending, ARIMA model forecasting, and spatial trend analysis of time-series data.
Tags: R, forecasting, temperature, detrending, ARIMA, Spatial Analysis, and time-series data.
Movie Rating Sites Analysis
Summary
As one of the first projects I did, this assignment stemmed from an interest in movies and sites that have ratings for them. As a hobby, I had made a large dataset that listed movies and my ratings for them. This easily allowed me to compare which movie site was closest to my own. As a result, my ratings were more similar to IMDB then Rotten Tomatoes. This may be due to a few reasons such as maybe the algorithms used to determine the overall score which can be misleading.
Tags: R, IMDB, Rotten Tomatoes, cross platform rating analysis, visualizations.