This project builds a backend AI-based model to classify network traffic as either benign or attack using supervised machine learning techniques. It processes CSV network flow data, trains a Random Forest model, and outputs a trained model for further use.
| File | Description |
|---|---|
app.py |
Complete code for data cleaning, preprocessing, model training, evaluation, and saving the trained model |
cleaned_dataset.csv |
Cleaned dataset used for model training |
ai_threat_intelligence_model.pkl |
Trained machine learning model |
scaler.pkl |
Trained scaler object used for feature scaling |
.gitignore |
Files/directories excluded from version control |
requirements.txt |
Required dependencies to run the project |
The dataset used in this project is publicly available on Kaggle:
π IDS Intrusion CSV Dataset - Kaggle
File used: 02-14-2018.csv
After downloading, place
02-14-2018.csvinside the project root directory.
- Load and clean the dataset
- Handle missing values and irrelevant columns
- Perform label encoding
- Conduct exploratory data analysis (EDA)
- Train a Random Forest Classifier
- Evaluate model performance
- Save trained model and scaler
- Python
- pandas
- numpy
- seaborn
- matplotlib
- scikit-learn
- joblib
git clone https://github.com/yourusername/ai-threat-intelligence.git
cd ai-threat-intelligence