2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A novel and innovative cancer classification framework through a consecutive utilization of hybrid feature selection

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Cancer prediction in the early stage is a topic of major interest in medicine since it allows accurate and efficient actions for successful medical treatments of cancer. Mostly cancer datasets contain various gene expression levels as features with less samples, so firstly there is a need to eliminate similar features to permit faster convergence rate of classification algorithms. These features (genes) enable us to identify cancer disease, choose the best prescription to prevent cancer and discover deviations amid different techniques. To resolve this problem, we proposed a hybrid novel technique CSSMO-based gene selection for cancer classification. First, we made alteration of the fitness of spider monkey optimization (SMO) with cuckoo search algorithm (CSA) algorithm viz., CSSMO for feature selection, which helps to combine the benefit of both metaheuristic algorithms to discover a subset of genes which helps to predict a cancer disease in early stage. Further, to enhance the accuracy of the CSSMO algorithm, we choose a cleaning process, minimum redundancy maximum relevance (mRMR) to lessen the gene expression of cancer datasets. Next, these subsets of genes are classified using deep learning (DL) to identify different groups or classes related to a particular cancer disease. Eight different benchmark microarray gene expression datasets of cancer have been utilized to analyze the performance of the proposed approach with different evaluation matrix such as recall, precision, F1-score, and confusion matrix. The proposed gene selection method with DL achieves much better classification accuracy than other existing DL and machine learning classification models with all large gene expression dataset of cancer.

          Related collections

          Most cited references62

          • Record: found
          • Abstract: found
          • Article: not found
          Is Open Access

          Cancer statistics, 2023

          Each year, the American Cancer Society estimates the numbers of new cancer cases and deaths in the United States and compiles the most recent data on population-based cancer occurrence and outcomes using incidence data collected by central cancer registries and mortality data collected by the National Center for Health Statistics. In 2023, 1,958,310 new cancer cases and 609,820 cancer deaths are projected to occur in the United States. Cancer incidence increased for prostate cancer by 3% annually from 2014 through 2019 after two decades of decline, translating to an additional 99,000 new cases; otherwise, however, incidence trends were more favorable in men compared to women. For example, lung cancer in women decreased at one half the pace of men (1.1% vs. 2.6% annually) from 2015 through 2019, and breast and uterine corpus cancers continued to increase, as did liver cancer and melanoma, both of which stabilized in men aged 50 years and older and declined in younger men. However, a 65% drop in cervical cancer incidence during 2012 through 2019 among women in their early 20s, the first cohort to receive the human papillomavirus vaccine, foreshadows steep reductions in the burden of human papillomavirus-associated cancers, the majority of which occur in women. Despite the pandemic, and in contrast with other leading causes of death, the cancer death rate continued to decline from 2019 to 2020 (by 1.5%), contributing to a 33% overall reduction since 1991 and an estimated 3.8 million deaths averted. This progress increasingly reflects advances in treatment, which are particularly evident in the rapid declines in mortality (approximately 2% annually during 2016 through 2020) for leukemia, melanoma, and kidney cancer, despite stable/increasing incidence, and accelerated declines for lung cancer. In summary, although cancer mortality rates continue to decline, future progress may be attenuated by rising incidence for breast, prostate, and uterine corpus cancers, which also happen to have the largest racial disparities in mortality.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Genetic effects on gene expression across human tissues

            (2017)
            Characterization of the molecular function of the human genome and its variation across individuals is essential for identifying the cellular mechanisms that underlie human genetic traits and diseases. The Genotype-Tissue Expression (GTEx) project aims to characterize variation in gene expression levels across individuals and diverse tissues of the human body, many of which are not easily accessible. Here we describe genetic effects on gene expression levels across 44 human tissues. We find that local genetic variation affects gene expression levels for the majority of genes, and we further identify inter-chromosomal genetic effects for 93 genes and 112 loci. On the basis of the identified genetic effects, we characterize patterns of tissue specificity, compare local and distal effects, and evaluate the functional properties of the genetic effects. We also demonstrate that multi-tissue, multi-individual data can be used to identify genes and pathways affected by human disease-associated variation, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Gene expression profiling predicts clinical outcome of breast cancer.

              Breast cancer patients with the same stage of disease can have markedly different treatment responses and overall outcome. The strongest predictors for metastases (for example, lymph node status and histological grade) fail to classify accurately breast tumours according to their clinical behaviour. Chemotherapy or hormonal therapy reduces the risk of distant metastases by approximately one-third; however, 70-80% of patients receiving this treatment would have survived without it. None of the signatures of breast cancer gene expression reported to date allow for patient-tailored therapy strategies. Here we used DNA microarray analysis on primary breast tumours of 117 young patients, and applied supervised classification to identify a gene expression signature strongly predictive of a short interval to distant metastases ('poor prognosis' signature) in patients without tumour cells in local lymph nodes at diagnosis (lymph node negative). In addition, we established a signature that identifies tumours of BRCA1 carriers. The poor prognosis signature consists of genes regulating cell cycle, invasion, metastasis and angiogenesis. This gene expression profile will outperform all currently used clinical parameters in predicting disease outcome. Our findings provide a strategy to select patients who would benefit from adjuvant therapy.
                Bookmark

                Author and article information

                Contributors
                priyanka.roy@vitbhopal.ac.in
                sauravmtech2@gmail.com , smallik@hsph.harvard.edu , smallik@arizona.edu
                drmohdasifshah@kdu.edu.et
                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                15 December 2023
                15 December 2023
                2023
                : 24
                : 479
                Affiliations
                [1 ]School of Computing Science and Engineering, VIT Bhopal University, ( https://ror.org/02ax13658) Kothrikalan, Sehore, Madhya Pradesh 46611 India
                [2 ]School of Advanced Sciences and Language, VIT Bhopal University, ( https://ror.org/02ax13658) Kothrikalan, Sehore, Madhya Pradesh 46611 India
                [3 ]GRID grid.38142.3c, ISNI 000000041936754X, Molecular and Integrative Physiological Sciences, Department of Environmental Health, , Harvard T. H. Chan School of Public Health, ; Boston, MA 02115 USA
                [4 ]Department of Pharmacology and Toxicology, University of Arizona, ( https://ror.org/03m2x1q45) Tucson, AZ 85721 USA
                [5 ]Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, ( https://ror.org/03gds6c39) Houston, TX 77030 USA
                [6 ]School of Computer Science and Engineering, Xi’an University of Technology, ( https://ror.org/038avdt50) Shaanxi, 710048 China
                [7 ]Department of Economics, Kebri Dehar University, ( https://ror.org/00r6xxj20) Kebri Dehar, 250 Somali Ethiopia
                [8 ] Division of Research and Development, Lovely Professional University, ( https://ror.org/00et6q107) Phagwara, Punjab 144001 India
                [9 ] Centre for Research Impact & Outcome, Chitkara University, ( https://ror.org/057d6z539) Rajpura, Punjab 140401 India
                Article
                5605
                10.1186/s12859-023-05605-5
                10724960
                38102551
                3e329523-8006-4c21-9c56-94eaf6aae984
                © The Author(s) 2023

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

                History
                : 18 August 2023
                : 6 December 2023
                Categories
                Research
                Custom metadata
                © BioMed Central Ltd., part of Springer Nature 2023

                Bioinformatics & Computational biology
                deep learning (dl),cuckoo search algorithm (csa),spider monkey optimization (sm),minimum redundancy maximum relevance (mrmr),cancer classification

                Comments

                Comment on this article