23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Iterative ensemble feature selection for multiclass classification of imbalanced microarray data

      research-article
      , , , ,
      Journal of Biological Research
      BioMed Central
      2014 International Conference on Intelligent Computing (ICIC2014) (ICIC2014)
      3-6 August 2014

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Microarray technology allows biologists to monitor expression levels of thousands of genes among various tumor tissues. Identifying relevant genes for sample classification of various tumor types is beneficial to clinical studies. One of the most widely used classification strategies for multiclass classification data is the One-Versus-All (OVA) schema that divides the original problem into multiple binary classification of one class against the rest. Nevertheless, multiclass microarray data tend to suffer from imbalanced class distribution between majority and minority classes, which inevitably deteriorates the performance of the OVA classification.

          Results

          In this study, we propose a novel iterative ensemble feature selection (IEFS) framework for multiclass classification of imbalanced microarray data. In particular, filter feature selection and balanced sampling are performed iteratively and alternatively to boost the performance of each binary classification in the OVA schema. The proposed framework is tested and compared with other representative state-of-the-art filter feature selection methods using six benchmark multiclass microarray data sets. The experimental results show that IEFS framework provides superior or comparable performance to the other methods in terms of both classification accuracy and area under receiver operating characteristic curve. The more number of classes the data have, the better performance of IEFS framework achieves.

          Conclusions

          Balanced sampling and feature selection together work well in improving the performance of multiclass classification of imbalanced microarray data. The IEFS framework is readily applicable to other biological data analysis tasks facing the same problem.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: found
          • Article: not found

          Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses.

          We have generated a molecular taxonomy of lung carcinoma, the leading cause of cancer death in the United States and worldwide. Using oligonucleotide microarrays, we analyzed mRNA expression levels corresponding to 12,600 transcript sequences in 186 lung tumor samples, including 139 adenocarcinomas resected from the lung. Hierarchical and probabilistic clustering of expression data defined distinct subclasses of lung adenocarcinoma. Among these were tumors with high relative expression of neuroendocrine genes and of type II pneumocyte genes, respectively. Retrospective analysis revealed a less favorable outcome for the adenocarcinomas with neuroendocrine gene expression. The diagnostic potential of expression profiling is emphasized by its ability to discriminate primary lung adenocarcinomas from metastases of extra-pulmonary origin. These results suggest that integration of expression profile data with clinical parameters could aid in diagnosis of lung cancer patients.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Identification and consequences of miRNA-target interactions--beyond repression of gene expression.

            Comparative genomics analyses and high-throughput experimental studies indicate that a microRNA (miRNA) binds to hundreds of sites across the transcriptome. Although the knockout of components of the miRNA biogenesis pathway has profound phenotypic consequences, most predicted miRNA targets undergo small changes at the mRNA and protein levels when the expression of the miRNA is perturbed. Alternatively, miRNAs can establish thresholds in and increase the coherence of the expression of their target genes, as well as reduce the cell-to-cell variability in target gene expression. Here, we review the recent progress in identifying miRNA targets and the emerging paradigms of how miRNAs shape the dynamics of target gene expression.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling.

              Treatment of pediatric acute lymphoblastic leukemia (ALL) is based on the concept of tailoring the intensity of therapy to a patient's risk of relapse. To determine whether gene expression profiling could enhance risk assignment, we used oligonucleotide microarrays to analyze the pattern of genes expressed in leukemic blasts from 360 pediatric ALL patients. Distinct expression profiles identified each of the prognostically important leukemia subtypes, including T-ALL, E2A-PBX1, BCR-ABL, TEL-AML1, MLL rearrangement, and hyperdiploid >50 chromosomes. In addition, another ALL subgroup was identified based on its unique expression profile. Examination of the genes comprising the expression signatures provided important insights into the biology of these leukemia subgroups. Further, within some genetic subgroups, expression profiles identified those patients that would eventually fail therapy. Thus, the single platform of expression profiling should enhance the accurate risk stratification of pediatric ALL patients.
                Bookmark

                Author and article information

                Contributors
                yangjunshan@szu.edu.cn
                zhoujiarui@hitsz.edu.cn
                zhuzx@szu.edu.cn
                maxiaoliang@yeah.net
                jizhen@szu.edu.cn
                Conference
                J Biol Res (Thessalon)
                J Biol Res (Thessalon)
                Journal of Biological Research
                BioMed Central (London )
                1790-045X
                2241-5793
                4 July 2016
                4 July 2016
                May 2016
                : 23
                Issue : Suppl 1 Issue sponsor : Publication of this supplement has not been supported by sponsorship. Information about the source of funding for publication charges can be found in the individual articles. The articles have undergone the journal's standard peer review process for supplements. The Supplement Editors declare that they have no competing interests.
                : 13
                Affiliations
                [ ]College of Engineering and Information, Shenzhen University, Shenzhen, People’s Republic of China
                [ ]School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, People’s Republic of China
                [ ]College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, People’s Republic of China
                Article
                45
                10.1186/s40709-016-0045-8
                4943507
                27437198
                6b81da9b-65fa-48a0-8645-c1a20e60c533
                © Yang et al. 2016

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                2014 International Conference on Intelligent Computing (ICIC2014)
                ICIC2014
                Taiyuan, China
                3-6 August 2014
                History
                Categories
                Research
                Custom metadata
                © The Author(s) 2016

                Comments

                Comment on this article