Domain: Finance & Cybersecurity | Tools: Python, XGBoost, Scikit-learn, Matplotlib, Seaborn
This machine learning project investigates the detection of fraudulent credit card transactions using a real-world-style dataset. The goal was to develop and compare classification models and provide insights into transaction patterns and fraud detection techniques.
Three models were compared:
Model | Accuracy | ROC-AUC | F1 Score |
---|---|---|---|
Logistic Regression | 64% | 0.66 | 0.64 |
Random Forest | 97% | 0.997 | 0.97 |
Tuned XGBoost | 92% | 0.976 | 0.92 |
Top predictors included transaction type, amount, merchant ID, and time-of-day features.
The Random Forest and XGBoost models were serialized using Joblib for deployment in application as a Flask API. Future enhancements include real-time fraud scoring and user feedback loops.