ARTICLE
TITLE

Methods of Handling Unbalanced Datasets in Credit Card Fraud Detection

SUMMARY

Nowadays fraudulent transactions of every type represent a major concern in the financial industry due to the total amount of money that are lost every year. Manually analyzing fraudulent transactions is unfeasible if we think at the huge amount of data and the complexity of bank fraud in the digitization era. In this context, the problem to detect the fraud can be achieved by machine-learning algorithms due to their ability of detecting small anomalies in very large datasets. The problem that arise here is that the datasets are highly unbalanced meaning that the non-fraudulent cases heavily dominates the fraudulent ones. In this paper, we are going to present three ways of handling unbalanced datasets by: resampling methods (undersampling and oversampling), cost-sensitive training and tree algorithms (decision tree, random forest and Naïve Bayes), emphasizing the idea of why the Receiver Operating Characteristics curve (ROC) should not be used on this type of datasets when measuring the performance of the algorithm. The experimental test was applied on a number of 890,977 banking transactions in order to observe the performance metrics of all the three methods mentioned above.

 Articles related

I.K.A. Atmika,I.N. Sutantra,Agus S. Pramono    

Motorcycle in its operation needs high stability. To improve a stability of motorcycle especially in turning direction is done by many methods. The handling turn inclination angle of motorcycle with addition gyroscopic component will be discussed in this... see more



Appropriate airport ground handling service (AGHS) equipment vendor selection (AGHSEVS) can prevent aircraft damage and delays in airlines schedules, and ensure reliable and high-quality ground handling service. Previous research has seldom integrated mu... see more

Revista: Sustainability

Pujan K. Desai, Hubert Tseng and Glauco R. Souza    

There is a significant need for in vitro methods to study drug-induced liver injury that are rapid, reproducible, and scalable for existing high-throughput systems. However, traditional monolayer and suspension cultures of hepatocytes are difficult to ha... see more


Phil Glatz, Zhihong Miao and Belinda Rodda    

A literature review was undertaken to identify methods being used to handle and treat hatchery waste. Hatchery waste can be separated into solid waste and liquid waste by centrifuging or by using screens. Potential methods for treating hatchery waste on ... see more

Revista: Sustainability

Susilawati Susilawati,Eljawati Eljawati,Gradiana Tefa,Siti Nuraisyah Suwanda,Dadang Suwanda    

Garbage as a elementary problem of human life in forward territory, raises the handling urgency through providing performance of public service in hygine which is the success depends on leadership of a leader. This research uses quantitative methods with... see more