2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      An autonomous mixed data oversampling method for AIOT-based churn recognition and personalized recommendations using behavioral segmentation

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The telecom sector is currently undergoing a digital transformation by integrating artificial intelligence (AI) and Internet of Things (IoT) technologies. Customer retention in this context relies on the application of autonomous AI methods for analyzing IoT device data patterns in relation to the offered service packages. One significant challenge in existing studies is treating churn recognition and customer segmentation as separate tasks, which diminishes overall system accuracy. This study introduces an innovative approach by leveraging a unified customer analytics platform that treats churn recognition and segmentation as a bi-level optimization problem. The proposed framework includes an Auto Machine Learning (AutoML) oversampling method, effectively handling three mixed datasets of customer churn features while addressing imbalanced-class distribution issues. To enhance performance, the study utilizes the strength of oversampling methods like synthetic minority oversampling technique for nominal and continuous features (SMOTE-NC) and synthetic minority oversampling with encoded nominal and continuous features (SMOTE-ENC). Performance evaluation, using 10-fold cross-validation, measures accuracy and F1-score. Simulation results demonstrate that the proposed strategy, particularly Random Forest (RF) with SMOTE-NC, outperforms standard methods with SMOTE. It achieves accuracy rates of 79.24%, 94.54%, and 69.57%, and F1-scores of 65.25%, 81.87%, and 45.62% for the IBM, Kaggle Telco and Cell2Cell datasets, respectively. The proposed method autonomously determines the number and density of clusters. Factor analysis employing Bayesian logistic regression identifies influential factors for accurate customer segmentation. Furthermore, the study segments consumers behaviorally and generates targeted recommendations for personalized service packages, benefiting decision-makers.

          Related collections

          Most cited references38

          • Record: found
          • Abstract: found
          • Article: not found

          A Systematic Review on Imbalanced Data Challenges in Machine Learning : Applications and Solutions

          In machine learning, the data imbalance imposes challenges to perform data analytics in almost all areas of real-world research. The raw primary data often suffers from the skewed perspective of data distribution of one class over the other as in the case of computer vision, information security, marketing, and medical science. The goal of this article is to present a comparative analysis of the approaches from the reference of data pre-processing, algorithmic and hybrid paradigms for contemporary imbalance data analysis techniques, and their comparative study in lieu of different data distribution and their application areas.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Customer retention, loyalty, and satisfaction in the German mobile cellular telecommunications market

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Comparing Oversampling Techniques to Handle the Class Imbalance Problem: A Customer Churn Prediction Case Study

                Bookmark

                Author and article information

                Contributors
                Journal
                PeerJ Comput Sci
                PeerJ Comput Sci
                peerj-cs
                PeerJ Computer Science
                PeerJ Inc. (San Diego, USA )
                2376-5992
                2 January 2024
                2024
                : 10
                : e1756
                Affiliations
                [1 ]Department of Computer Science, Comsats University Islamabad, Attock Campus Pakistan , Attock, Punjab, Pakistan
                [2 ]Big Data Research Center, Jeju National University , Jeju, Korea
                [3 ]Department of Computer Engineering, Jeju National University , Jeju Special Self-Governing Province, South Korea
                [4 ]Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University , Riyadh, Saudi Arabia
                Author information
                http://orcid.org/0000-0001-8737-2154
                Article
                cs-1756
                10.7717/peerj-cs.1756
                10773761
                933ab931-1845-48aa-83d1-275966f31515
                © 2024 Fatima et al.

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

                History
                : 9 June 2023
                : 23 November 2023
                Funding
                Funded by: Princess Nourah bint Abdulrahman University Researchers Supporting Project number
                Award ID: PNURSP2024R407
                Funded by: Princess Nourah bint Abdulrahman University
                Funded by: National Research Foundation of Korea (NRF)
                Award ID: 2022H1D3A2A02055024
                Funded by: Creative Research Project
                Award ID: RS-2023-00248526
                The work is supported and funded by the Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2024R407) and the Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. Dr. Salabat is working for National Research Foundation of Korea (NRF) under the Brain Pool Program (Grant No. 2022H1D3A2A02055024) and Creative Research Project (ID: RS-2023-00248526). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Artificial Intelligence
                Data Mining and Machine Learning
                Data Science
                Internet of Things

                customer segmentation and churn prediction,hyper-parameters optimization,mixed data over-sampling,personalized recommendations,automl based oversampling,aiot

                Comments

                Comment on this article