Assignment 1 Group 1
Throughout the code SQL queries are used to filter and select data. Also pandas is used to convert the data type of the columns with data from text to numeric and to filter/sort data to get insight in max/min/count of a dataset. The code fragments are repeated with different queries. Certain accounts are explored in more depth considering their relations. Where there were interesting findings it has been documented.
Connect to database file
Unique senders and receivers
It can be seen that the amount of senders and receivers differ from each other in the range of 4 million. Thus there are more senders than receivers.
Frequency of receiving per destination
Inspecting the data of C52983754
Frequency of sending per origin
Inspecting the outlier (C1286084959)
First the outlier is identified,