Class balancing framework for credit card fraud detection based on clustering and similarity-based selection (SBS)

Ahmad, Hadeel H.; Kasasbeh, Bassam; Aldabaybah, Balqees; Rawashdeh, Enas

doi:10.1007/s41870-022-00987-w

ScienceOpen: research and publishing network

For Publishers

For Researchers

Blog
About

Search
Advanced search

views

recommends

Record: found
Abstract: found
Article: not found

Class balancing framework for credit card fraud detection based on clustering and similarity-based selection (SBS)

research-article

Author(s): Hadeel Ahmad ¹ , Bassam Kasasbeh ¹ ^, , Balqees Aldabaybah ¹ , Enas Rawashdeh ²

Publication date (Electronic): 21 June 2022

Journal: International Journal of Information Technology

Publisher: Springer Nature Singapore

Keywords: Under-sampling technique, Fuzzy C-means, Credit card fraud detection, Machine learning, Unbalanced dataset

Read this article at

ScienceOpenPublisher PMC

Bookmark

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Credit card fraud is a growing problem nowadays and it has escalated during COVID-19 due to the authorities in many countries requiring people to use cashless transactions. Every year, billions of Euros are lost due to credit card fraud transactions, therefore, fraud detection systems are essential for financial institutions. As the classes’ distribution is not equally represented in the credit card dataset, the machine learning trains the model according to the majority class which leads to inaccurate fraud predictions. For that, in this research, we mainly focus on processing unbalanced data by using an under-sampling technique to get more accurate and better results with different machine learning algorithms. We propose a framework that is based on clustering the dataset using fuzzy C-means and selecting similar fraud and normal instances that have the same features, which guarantees the integrity between the data features.

Related collections

Most cited references 29

Record: found
Abstract: found
Article: not found

SMOTE: Synthetic Minority Over-sampling Technique

N. Chawla, K. W. Bowyer, L Hall … (2002)

An approach to the construction of classifiers from imbalanced datasets is described. A dataset is imbalanced if the classification categories are not approximately equally represented. Often real-world data sets are predominately composed of ``normal'' examples with only a small percentage of ``abnormal'' or ``interesting'' examples. It is also the case that the cost of misclassifying an abnormal (interesting) example as a normal example is often much higher than the cost of the reverse error. Under-sampling of the majority (normal) class has been proposed as a good means of increasing the sensitivity of a classifier to the minority class. This paper shows that a combination of our method of over-sampling the minority (abnormal) class and under-sampling the majority (normal) class can achieve better classifier performance (in ROC space) than only under-sampling the majority class. This paper also shows that a combination of our method of over-sampling the minority class and under-sampling the majority class can achieve better classifier performance (in ROC space) than varying the loss ratios in Ripper or class priors in Naive Bayes. Our method of over-sampling the minority class involves creating synthetic minority class examples. Experiments are performed using C4.5, Ripper and a Naive Bayes classifier. The method is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.

0 comments Cited 3146 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

A study of the behavior of several methods for balancing machine learning training data

Gustavo E. A. P. A. Batista, Ronaldo C. Prati, Maria Carolina Monard (2004)

0 comments Cited 502 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

MWMOTE--Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning

Sukarna Barua, Md. Monirul Islam, Xin Yao … (2014)

0 comments Cited 130 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Bassam Kasasbeh:

ORCID: http://orcid.org/0000-0002-3240-3002

b_kasasbeh@asu.edu.jo

Journal

Journal ID (nlm-ta): Int J Inf Technol

Journal ID (iso-abbrev): Int J Inf Technol

Title: International Journal of Information Technology

Publisher: Springer Nature Singapore (Singapore )

ISSN (Print): 2511-2104

ISSN (Electronic): 2511-2112

Publication date (Electronic): 21 June 2022

Pages: 1-9

Affiliations

[1 ]GRID grid.411423.1, ISNI 0000 0004 0622 534X, Department of Computer Science, , Applied Science Private University, ; Amman, 11931 Jordan

[2 ]GRID grid.443749.9, ISNI 0000 0004 0623 1491, Department of Management Information Systems, , Albalqa’ Applied University, ; Amman, 11931 Jordan

Author information

Bassam Kasasbeh http://orcid.org/0000-0002-3240-3002

Article

Publisher ID: 987

DOI: 10.1007/s41870-022-00987-w

PMC ID: 9209320

PubMed ID: 35757149

SO-VID: 2ae9b381-eefe-4d8a-a144-ab56d6bcd514

License:

This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.

History

Date received : 14 February 2022

Date accepted : 2 May 2022

Funding

Funded by: FundRef http://dx.doi.org/10.13039/100016624, Applied Science Private University;

Class balancing framework for credit card fraud detection based on clustering and similarity-based selection (SBS)

Read this article at

Abstract

Related collections

Novel Coronavirus Disease COVID-19

Most cited references 29

SMOTE: Synthetic Minority Over-sampling Technique

A study of the behavior of several methods for balancing machine learning training data

MWMOTE--Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning

Author and article information

Contributors

Journal

Affiliations

Author information

Article

History

Funding

Categories

Comments

Comment on this article

Similar content 826

Cited by 2

Most referenced authors 295