This fast-track roadmap is designed for focused learners who want job-ready Data Analyst + Data Science skills in just 1.5 months. It blends theory with hands-on mini and major projects to build a strong portfolio.
| Days | Focus Area | Topics / Tools | Resources / Task | Project |
|---|---|---|---|---|
| 1β2 | Python Basics | Syntax, Loops, Lists, Functions | Python | π§ͺ CLI Calculator |
| 3β4 | Python Intermediate | OOP, File Handling | Write Python scripts | π§ͺ CSV Data Cleaner |
| 5β6 | SQL Basics | SELECT, JOIN, Aggregations | SQL | π§ͺ Employee DB Queries |
| 7 | Linux CLI + Math | Bash Commands, Maths, Probability, Stats | Linux & Maths Basics | π§ͺ Linux Automation Script |
| Days | Focus Area | Topics / Tools | Resources / Task | Project |
|---|---|---|---|---|
| 8β9 | NumPy | Arrays, Math, Indexing | NumPy | π§ͺ Weather Matrix |
| 10β11 | Pandas | DataFrames, Merging | Pandas | π§ͺ EDA on Titanic |
| 12β13 | Data Cleaning | Missing values, Outliers | Clean Housing Dataset | π§ͺ Clean Housing Data |
| 14 | Feature Engg. | Encoding, Scaling | sklearn.preprocessing | π§ͺ Feature Engg. for ML |
| Days | Focus Area | Topics / Tools | Resources / Task | Project |
|---|---|---|---|---|
| 15β16 | Matplotlib/Seaborn | Bar Charts, Heatmaps, etc. | Matplotlib | π§ͺ Student Score Heatmap |
| 17β18 | Power BI / Tableau | Data modeling, Filters, Cards | Power BI | π§ͺ Sales Dashboard |
| 19β21 | Final Viz Project | Combine EDA + Viz | Build Presentation | β Retail Sales Insights Dashboard |
| Days | Focus Area | Topics / Tools | Resources / Task | Project |
|---|---|---|---|---|
| 22β23 | Regression | Linear, Logistic | ML Course | π§ͺ House Price Prediction |
| 24β25 | Classification | KNN, Decision Tree | scikit-learn | π§ͺ Iris Classifier |
| 26β27 | Clustering | K-Means | Unsupervised Learning | π§ͺ Customer Segmentation |
| 28 | Eval & Tuning | Accuracy, ROC, GridSearchCV | Model Evaluation, Cross Validation | π§ͺ Model Evaluation on Fraud Dataset |
| Days | Focus Area | Topics / Tools | Resources / Task | Project |
|---|---|---|---|---|
| 29β30 | Neural Nets | Activation, Layers, Forward | DL | π§ͺ Custom 3-layer NN |
| 31β32 | CNN | Image Classification | TensorFlow | π§ͺ MNIST Digit Classifier |
| 33β34 | LLMs/Transformers | Prompting, Tokenization | HuggingFace | π§ͺ Mini Q&A Chatbot |
| 35 | Final ML Project | Fraud Detection w/ Autoenc. | Build end-to-end model | β Credit Card Fraud Detection |
| Days | Focus Area | Topics / Tools | Resources / Task | Project |
|---|---|---|---|---|
| 36β37 | Flask | Create ML API | Flask | π§ͺ Flask App for Classifier |
| 38β39 | Docker | Build + Containerize | Docker | π§ͺ Dockerized ML App |
| 40β42 | MLOps + Cloud | MLFlow, AWS / GCP | MLOps | π§ͺ Track Model with MLFlow |
| 43β45 | Capstone Project | Domain: Retail/Health | Full E2E Pipeline | β Final Capstone Project Deployed |
| Type | Count |
|---|---|
| Mini Projects | 7β9 |
| Major Projects | 3 |
| Dashboards (BI) | 1 |
| Deployed Projects | 2 |
-
Credit Card Fraud Detection using Autoencoders in Keras | TensorFlow for Hackers Assess credit risk by applying probability distributions and statistical analysis on credit card data.This project employs autoencodersβa type of neural networkβto detect fraudulent credit card transactions. By learning the patterns of normal transactions, the autoencoder can identify anomalies that may indicate fraud. This approach leverages unsupervised learning, making it effective even when labeled fraudulent data is scarce.
-
Predictive Maintenance using Machine Learning by Medini Kumar Bora This project focuses on predicting equipment failures before they occur, allowing for timely maintenance and reduced downtime. By analyzing historical sensor data and operational metrics, machine learning models can forecast potential issues, optimizing maintenance schedules and extending equipment lifespan.
-
Customer Segmentation using K Means: A Step by Step Guide by Leonardo A This project utilizes K-Means clustering to segment customers based on shared characteristics. By identifying distinct customer groups, businesses can tailor their marketing strategies, improve customer satisfaction, and enhance overall profitability. The iterative process of training, testing, and tweaking ensures the model's effectiveness.
-
Harvestify Github Harvestify is a machine learning-based website that assists farmers by recommending the best crops to cultivate, suitable fertilizers, and diagnosing crop diseases. By inputting specific parameters, users receive data-driven suggestions, promoting efficient farming practices and potentially increasing yields.
-
Machine Learning for Retail Demand Forecasting by Samir Saci This project compares different methods for forecasting retail demand, specifically contrasting XGBoost models with rolling mean approaches. Accurate demand forecasting enables retailers to optimize inventory levels, reduce stockouts, and enhance customer satisfaction. The study provides insights into the effectiveness of machine learning techniques in retail settings.
- Follow the roadmap step by step π₯
- Work on real-world projects & open source
- Deploy your models on cloud platforms
- Keep learning MLOps & scaling techniques
π Happy Learning!