Data Preprocessing Project Unit 1
After every step done to read and preprocess a dataset is the time to apply all the knowledge.
Exploratory Data Analysis: Game reviews from user on Steam
Steam is the world's most popular PC Gaming hub, with over 6,000 games and a community of millions of gamers. With a massive collection that includes everything from AAA blockbusters to small indie titles, great discovery tools are a highly valuable asset for Steam. How can we make them better? YES! Reviewing their games
First we are going to analyze the data set in order to understand it.
Here, we describe and see if there is some null variable on the data set
Data cleaning
In the review section, we can see that the only problem that this dataset has is the null values on the review column. Let's make something about it.
Let's make tidy this dataset
Conclusions
After this multiple processes, we can conclude that the user trend to judge the games during the early access games, and even if they are released they do not change the comment (based on the commentary date and the release date of the game), making report difficult and with a lack of sufficient information in order to deliver to the developer the information. Nevertheless, is important to understand what does the community wants and try to improve the user experience of each game, and which games have the biggest problem.
Project Part II: Visualization
After eliminated the null data, is time to get some order in the data set and make understandable the information.
This data set only have 48 games of the 6k+ that the platform has. Nevertheless, it is important to see if which game has the biggest