Building graphs from credit card transactions
The dataset used in this chapter is the Credit Card Transactions Fraud Detection dataset available on Kaggle at the following URL: https://www.kaggle.com/kartik2112/fraud-detection?select=fraudTrain.csv. We will build two approaches for fraud detection, based on bipartite and tripartite graphs.
Overview of the dataset
The dataset is made up of simulated credit card transactions containing legitimate and fraudulent transactions for the period January 1, 2019 to December 31, 2020. It includes the credit cards of 1,000 customers performing transactions with a pool of 800 merchants. The dataset was generated using Sparkov data generation. More information about the generation algorithm is available at the following URL: https://github.com/namebrandon/Sparkov_Data_Generation.
For each transaction, the dataset contains 23 different features. In the following table, we will show only the information that will be used in this chapter...