This is a Pytorch implementation of the model in the paper "Forecasting Credit Default Risk with Graph Attention Networks".
pytorch
pytorch-geometric
-
Download the datasets at here.
-
Run the data processing code:
python process.py
-
Run the credit default risk prediction code:
python myGAT.py
process.py
: This file is used to process the three raw datasets and output relevant attributes for future credit default risk prediction tasks.
- Input: "application_train.csv", "bureau.csv", "credit_card_balance.csv"
- Output: "one.csv"、"binary.csv", "r1_onehot.csv", "d1_onehot.csv", "l1_onehot.csv", "r1.csv", "l1.csv"
A&D_distance.Rmd
: This file is used to calculate distances between mixed data by Ahmad & Dey method.
- Input: "r1.csv", "l1.csv"
- Output: "ahmad_r1.csv", "ahmadl1.csv"
myGAT.py
: This file is the core implementation of the prediction model.
- Input: "r1_onehot.csv", "d1_onehot.csv", "l1_onehot.csv", "ahmad_r1.csv", "ahmadl1.csv", "binary.csv"
- Output: "pre.csv"