This project demonstrates an end-to-end Data Analytics workflow using Python, MySQL, and Power BI. The project includes data loading, data cleaning, exploratory data analysis (EDA), SQL-based analysis, dashboard creation, and business reporting.
The goal of this project is to extract meaningful insights from raw data and present them through interactive dashboards and reports for better decision-making.
-
Dataset Source: (customer_shopping_behavior.csv)
-
File Format: CSV / Excel
-
Total Records: (3900)
-
Features Included:
- Customer Information
- Product Details
- Sales Data
- Transaction History
- Ratings & Reviews
- Python
- Pandas
- MySQL
- Power BI
- Gamma (for presentation creation)
- Jupyter Notebook
- MySQL Workbench
- Imported dataset using Python
- Loaded data into Pandas DataFrame
- Checked data structure and column details
- Removed duplicate records
- Handled missing/null values
- Standardized column names
- Fixed inconsistent values
Performed detailed analysis to identify:
- Sales trends
- Customer behavior
- Product performance
- Revenue insights
- Correlation between variables
Used visualizations such as:
- Bar Charts
- Column Charts
- Donut Charts
Executed SQL queries for:
- Aggregations
- Subqueries
- Window Functions
- Customer segmentation
- Top-performing products analysis
Built interactive dashboards including:
- KPI Cards
- Sales Overview
- Customer Analysis
- Product Insights
- Trend Analysis
- Filters & Slicers
- Created business reports
- Generated presentation slides using Gamma
- Summarized key insights and recommendations
The Power BI dashboard provides:
- Interactive visualizations
- Real-time filtering
- Business performance tracking
- Data-driven insights for decision-making
Key Metrics:
- Total Sales
- Total Profit
- Customer Count
- Top Products
Some key insights identified from the analysis:
- Top-performing product categories
- High-value customer segments
- Seasonal sales trends
- Revenue growth opportunities
- Customer purchasing behavior patterns
The project helps businesses make informed decisions using data-driven insights.
Data-Analytics-Project/
│
├── Dataset/
├── Python Scripts/
├── SQL Queries/
├── Power BI Dashboard/
├── Reports/
├── Presentation/
└── README.mdgit clone <repository-link>pip install pandas numpy matplotlib seaborn mysql-connector-python- Open Jupyter Notebook
- Execute EDA and data cleaning scripts
- Create database in MySQL
- Import cleaned dataset
- Run SQL query files
- Open
.pbixfile in Power BI Desktop - Refresh data connections if required
- Add machine learning models
- Automate ETL pipeline
- Deploy dashboard online
- Integrate real-time data
Kapil Sanjay Mali Data Analyst | Python | SQL | Excel | Power BI LinkedIn: https://www.linkedin.com/in/kapil-mali/ GitHub: https://github.com/Kapilmali07