Skip to content

gorkbravo/Credit-Risk-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Credit-Risk-Analysis

This project aims to build a predictive model to assess credit risk using a dataset of financial and demographic variables. The notebook walks through the full pipeline from data preprocessing to model evaluation.

Project Overview

This analysis aims to evaluate the probability of default or financial distress of clients using machine learning techniques. The Data Used for this project can be found in Kaggle: https://www.kaggle.com/datasets/ranadeep/credit-risk-dataset/data

Features

  • Data loading and cleaning

  • Exploratory Data Analysis (EDA)

  • Feature engineering

  • Model Implementation (Binary Target)

    • Model training and evaluation (Logistic Regression, XGBoost)
    • Model performance metrics (Accuracy, ROC AUC, Confusion Matrix)
  • Model Implementation (Multi-class Target)

    • Model training and evaluation (XGBoost)
    • Model performance metrics (Accuracy, ROC AUC, Confusion Matrix)

Libraries used

  • Pandas
  • Scikit-learn
  • Matplotlib / seaborn
  • SHAP
  • Others

Future Improvements

  • Exploration of other models (LightGBM)
  • Hyperparameter Tuning
  • Others

Author

Gorka - @gorkbravo

About

Credit Risk Classification, using XGBoost primarily

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published