Assessment: Module "Moving from VBA to Python Pandas"
Assessment:
For this assessment, you will be provided with a large dataset containing sales transactions from a retail company. Your task is to perform various data analysis tasks using Pandas to provide insights into the company's sales performance.
Tasks:
- Load the dataset into a Pandas DataFrame.
- Perform data cleaning and preprocessing as necessary.
- Calculate the total revenue for each month.
- Calculate the average revenue per transaction for each month.
- Calculate the total revenue for each product category.
- Identify the top-selling products and product categories.
- Create visualizations to present your findings.
Dataset:
The dataset contains the following columns:
- TransactionID: unique ID for each transaction
- CustomerID: ID for the customer who made the transaction
- ProductID: ID for the product sold
- ProductCategory: category of the product sold
- TransactionDate: date of the transaction
- TransactionAmount: total amount of the transaction
You can download the dataset from the following link: https://www.kaggle.com/kyanyoga/sample-sales-data
Note: The dataset contains 1 million rows, so make sure your computer has enough memory to handle it. If your computer is not powerful enough, you can use a smaller subset of the dataset for your analysis
Comments
Post a Comment