16
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Class balancing framework for credit card fraud detection based on clustering and similarity-based selection (SBS)

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Credit card fraud is a growing problem nowadays and it has escalated during COVID-19 due to the authorities in many countries requiring people to use cashless transactions. Every year, billions of Euros are lost due to credit card fraud transactions, therefore, fraud detection systems are essential for financial institutions. As the classes’ distribution is not equally represented in the credit card dataset, the machine learning trains the model according to the majority class which leads to inaccurate fraud predictions. For that, in this research, we mainly focus on processing unbalanced data by using an under-sampling technique to get more accurate and better results with different machine learning algorithms. We propose a framework that is based on clustering the dataset using fuzzy C-means and selecting similar fraud and normal instances that have the same features, which guarantees the integrity between the data features.

          Related collections

          Most cited references29

          • Record: found
          • Abstract: found
          • Article: not found

          SMOTE: Synthetic Minority Over-sampling Technique

          An approach to the construction of classifiers from imbalanced datasets is described. A dataset is imbalanced if the classification categories are not approximately equally represented. Often real-world data sets are predominately composed of ``normal'' examples with only a small percentage of ``abnormal'' or ``interesting'' examples. It is also the case that the cost of misclassifying an abnormal (interesting) example as a normal example is often much higher than the cost of the reverse error. Under-sampling of the majority (normal) class has been proposed as a good means of increasing the sensitivity of a classifier to the minority class. This paper shows that a combination of our method of over-sampling the minority (abnormal) class and under-sampling the majority (normal) class can achieve better classifier performance (in ROC space) than only under-sampling the majority class. This paper also shows that a combination of our method of over-sampling the minority class and under-sampling the majority class can achieve better classifier performance (in ROC space) than varying the loss ratios in Ripper or class priors in Naive Bayes. Our method of over-sampling the minority class involves creating synthetic minority class examples. Experiments are performed using C4.5, Ripper and a Naive Bayes classifier. The method is evaluated using the area under the Receiver Operating Characteristic curve (AUC) and the ROC convex hull strategy.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            A study of the behavior of several methods for balancing machine learning training data

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              MWMOTE--Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning

                Bookmark

                Author and article information

                Contributors
                b_kasasbeh@asu.edu.jo
                Journal
                Int J Inf Technol
                Int J Inf Technol
                International Journal of Information Technology
                Springer Nature Singapore (Singapore )
                2511-2104
                2511-2112
                21 June 2022
                : 1-9
                Affiliations
                [1 ]GRID grid.411423.1, ISNI 0000 0004 0622 534X, Department of Computer Science, , Applied Science Private University, ; Amman, 11931 Jordan
                [2 ]GRID grid.443749.9, ISNI 0000 0004 0623 1491, Department of Management Information Systems, , Albalqa’ Applied University, ; Amman, 11931 Jordan
                Author information
                http://orcid.org/0000-0002-3240-3002
                Article
                987
                10.1007/s41870-022-00987-w
                9209320
                35757149
                2ae9b381-eefe-4d8a-a144-ab56d6bcd514
                © The Author(s), under exclusive licence to Bharati Vidyapeeth's Institute of Computer Applications and Management 2022

                This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.

                History
                : 14 February 2022
                : 2 May 2022
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100016624, Applied Science Private University;
                Categories
                Original Research

                under-sampling technique,fuzzy c-means,credit card fraud detection,machine learning,unbalanced dataset

                Comments

                Comment on this article