Thank you for joining me today. I've been working on a comprehensive data analysis, and I'm excited to present the insights and findings I've uncovered. I'll be sharing the results and key takeaways towards the end of this presentation, so please stay tuned for the EDA results.
Regards, Bryan Ronquillo
Information
ORDERNUMBER: A unique identifier for each order. QUANTITYORDERED: The quantity of items ordered in each transaction. PRICEEACH: The price of each item in the order. ORDERLINENUMBER: A line number indicating the item's position within the order. SALES: The total sales amount for the order. ORDERDATE: The date on which the order was placed. DAYS_SINCE_LASTORDER: The number of days since the last order was placed. STATUS: The current status of the order (e.g., Shipped, Pending). PRODUCTLINE: The category or line of the product ordered. MSRP: Manufacturer's Suggested Retail Price for the product. PRODUCTCODE: A unique code identifying the product. CUSTOMERNAME: The name of the customer who placed the order. PHONE: The phone number of the customer. ADDRESSLINE1: The primary address line for the customer's location. CITY: The city in which the customer is located. POSTALCODE: The postal code for the customer's address. COUNTRY: The country of the customer. CONTACTLASTNAME: The last name of the contact person for the customer. CONTACTFIRSTNAME: The first name of the contact person for the customer. DEALSIZE: The size of the deal (e.g., Small, Medium, Large).
Importing Data
Data
Now we will check how many rows and columns and what type of Dtype we have
Numerical columns
Categorical columns
Descriptive Summary
Descriptive Numerical
Descriptive Category
Cleaning
Missing Values
Duplicated Values
We have observed that there are no missing values or duplicate values in the dataset.
Univariate Analysis
Now, we'll perform univariate analysis to extract insights from each column in our data frame.
Univariate Analysis for Categories
Univariate Analysis for Numericals
Bivariate Analysis
Correlation
Yearly, Quarterly, Monthly, weekly Sales Trend
by Year
by Month
by Quarterly
by Weekly
Results and Insights
Key Statistics: - On average, each order consists of 35 units. - The typical price for one unit is about $101. - The average revenue per order amounts to around $3,553.
Missing Data: - No data is missing or incomplete in any of the dataset columns.
Top-Performing Product Categories: - Classic cars are the top-selling items, with total sales nearing $3.84 million. - Following closely are vintage cars, trucks, and buses.
Sales Patterns Over Time: - Notable increases in sales occurred in November 2018 and November 2019, indicating a possible seasonal pattern.
Best-Selling Products: - The highest-selling product is 'S18_3232,' generating approximately $284,249 in sales. - Other popular items include 'S10_1949,' 'S12_1108,' 'S10_4698,' and 'S18_2238.'
Customer Distribution by Country: - The United States has the most customers, followed by Spain, France, Australia, and the United Kingdom.
Average Sales per Order Line: - Average sales vary based on the order line number, with line 17 having the highest average sales ($3,800.60) and line 18 the lowest ($2,443.72).
Average Unit Price by Product Category: - Classic cars have the highest average price per unit ($115.20), while trains have the lowest average price ($84.11).
Sales Distribution by Deal Size: - Medium-sized deals contribute the most to sales, totaling $5,931,231.47, followed by small and large deals.
Average Days Since Last Order by Country: - The Philippines has the longest average time between orders (2080 days), while Switzerland has the shortest average (1362.94 days).
Number of Unique Customers by Country: - The USA has the highest count of unique customers (32), followed by France (12). Ireland, the Philippines, and Switzerland each have only one unique customer.
Order Status Breakdown: - Most orders are marked as "Shipped" (2541), with "Cancelled" (60) and "Resolved" (47) being less common.
Top Products in Each Category: - For instance, in the classic cars product line, 'S18_3232' is the best-selling item, with sales reaching $284,249.02.
Correlation between Quantity Ordered and Sales: - There is a moderate correlation of 0.55 between the quantity of products ordered and the total sales amount.
Recommendations to the stakeholders
Optimize Sales During Peak Months: Given the observed significant increases in sales in November 2018 and November 2019, it may be beneficial to plan special promotions or marketing campaigns during these months to capitalize on the potential seasonal trend. This can help maximize revenue during the high-demand periods.
Product Focus: Since classic cars are the top-selling product line, consider expanding the product range within this category or promoting related products to further boost sales. Additionally, continue to monitor and analyze the performance of 'S18_3232,' as it's the best-selling product, and explore strategies to maintain or enhance its sales.
Market Expansion: While the United States has the highest number of customers, consider further expanding into Spain, France, Australia, and the United Kingdom, which also have significant customer bases. Targeted marketing efforts and customer engagement strategies in these regions can help increase sales.
Deal Size Strategy: Since medium deals constitute the majority of sales, you might want to explore ways to increase the average deal size. This could involve bundling related products or offering discounts for larger orders to encourage customers to buy more.
Product Pricing: Given the variation in average price per unit across product lines, consider evaluating pricing strategies for products with lower average prices to potentially increase profit margins. Conversely, for high-priced products like classic cars, ensure that pricing remains competitive to maintain sales.
Customer Engagement: Monitor customer engagement and satisfaction, especially in countries with longer average days since the last order, like the Philippines. Implement customer retention strategies to encourage repeat business and reduce the time between orders.
Order Status Analysis: Investigate the reasons behind the relatively high number of "Cancelled" orders (60). Identify common issues leading to cancellations and implement measures to reduce them, such as improved customer support or order fulfillment processes.
Data-Driven Decision-Making: Continue to analyze correlations and trends in your data to make informed decisions. Invest in predictive analytics to forecast future sales and identify potential growth opportunities or areas requiring improvement.
Inventory Management: Given the moderate correlation between quantity ordered and sales, consider optimizing inventory levels to match demand. Overstocking or understocking can impact both revenue and costs.
Data Security and Privacy: Ensure that data security and customer privacy are maintained, especially as you expand into new markets. Comply with local data protection regulations to build trust with customers.