🛡️ Insurance Fraud Detection using Machine Learning

This project is focused on building a machine learning model to detect fraudulent financial transactions based on patterns in the data.

📌 Problem Statement

To detect and classify insurance claims as fraudulent or genuine using supervised learning techniques. Insurance frauds are often hidden and lead to huge financial losses in the insurance sector.
The goal is to train a machine learning model to recognize patterns that typically indicate fraud.

🧠 Technologies & Libraries Used

Python
Pandas
NumPy
Scikit-learn
Jupyter Notebook
Matplotlib / Seaborn (for data visualization)

📊 Project Workflow

Data Loading & Preprocessing
- Handled missing/null values
- Feature selection and encoding
- Scaling and normalization
Exploratory Data Analysis
- Distribution plots
- Class imbalance analysis
- Correlation matrix
Model Building
- Used Logistic Regression for classification
- Split data into training and test sets
- Fit the model and evaluated results
Evaluation Metrics
- Accuracy Score
- Confusion Matrix
- Precision & Recall

🚀 How to Run

Clone this repository:

git clone https://github.com/Adilkhan6465/fraud_detection_project.git

📈 Results

Accuracy Achieved: 93.12%
The model shows strong performance in identifying fraudulent transactions.
Further tuning and use of ensemble models may improve performance slightly.

folder structure

fraud-detection-project/

├── data/ --> all datasets │ ├── insurance_data.csv │ ├── fraud data FY 2023-24.csv │ └── feature_engineered_fraud_data.csv │ ├── models/ --> save trained model │ └── fraud_detection_model.pkl │ ├── scripts/ --> (training, preprocessing etc.) │ ├── model_training.py │ ├── data_preprocessing.py │ ├── feature_engineering.py │ └── app.py │ ├── test_api.py --> API test ├── requirements.txt --> Python libraries list ├── README.md -->

🔮 Future Improvements

Implement advanced models like Random Forest, XGBoost, or SVM
Handle class imbalance using SMOTE or undersampling
Create a web interface using Flask/Streamlit for real-time predictions
Add model explainability (SHAP/LIME)

👨‍💻 Author

Adil Khan
GitHub: github.com/Adilkhan6465

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.vscode		.vscode
data		data
docs		docs
models		models
notebooks		notebooks
scripts		scripts
Fraud data FY 2023-24 for B&CC.csv		Fraud data FY 2023-24 for B&CC.csv
README.md		README.md
app.log		app.log
import pandas as pd.py		import pandas as pd.py
insurance_data.csv		insurance_data.csv
requirements.txt		requirements.txt
test_api.py		test_api.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ Insurance Fraud Detection using Machine Learning

📌 Problem Statement

🧠 Technologies & Libraries Used

📊 Project Workflow

🚀 How to Run

📈 Results

folder structure

🔮 Future Improvements

👨‍💻 Author

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🛡️ Insurance Fraud Detection using Machine Learning

📌 Problem Statement

🧠 Technologies & Libraries Used

📊 Project Workflow

🚀 How to Run

📈 Results

folder structure

🔮 Future Improvements

👨‍💻 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages