This notebook aims to cleans and analyzes data from our first mechanical turk user test.
This notebook uses data found here as a csv. Much more is needed to be done in order to streamline the cleaning and analysis of this data, but this notebook will serve as a kicking off point to which further investigations can be pursued.
example
Inspecting
Our dataframe columns are now in the format Step #
Slide #
We're missing a s4s4
column, which we should have, let's see why.
Looks like s4s3
contains titles from s4s3
and s4s4
. We can unpack that later.
Cleaning
Step 1
- Co-erce values from strings to floats
Steps 2-5
- Remove slide 4 titles from s4s3 results
- Build new Series for slide 4 results with new lists of lists that should not be in slide 3