63
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          The extensive use of DNA microarray technology in the characterization of the cell transcriptome is leading to an ever increasing amount of microarray data from cancer studies. Although similar questions for the same type of cancer are addressed in these different studies, a comparative analysis of their results is hampered by the use of heterogeneous microarray platforms and analysis methods.

          Results

          In contrast to a meta-analysis approach where results of different studies are combined on an interpretative level, we investigate here how to directly integrate raw microarray data from different studies for the purpose of supervised classification analysis. We use median rank scores and quantile discretization to derive numerically comparable measures of gene expression from different platforms. These transformed data are then used for training of classifiers based on support vector machines. We apply this approach to six publicly available cancer microarray gene expression data sets, which consist of three pairs of studies, each examining the same type of cancer, i.e. breast cancer, prostate cancer or acute myeloid leukemia. For each pair, one study was performed by means of cDNA microarrays and the other by means of oligonucleotide microarrays. In each pair, high classification accuracies (> 85%) were achieved with training and testing on data instances randomly chosen from both data sets in a cross-validation analysis. To exemplify the potential of this cross-platform classification analysis, we use two leukemia microarray data sets to show that important genes with regard to the biology of leukemia are selected in an integrated analysis, which are missed in either single-set analysis.

          Conclusion

          Cross-platform classification of multiple cancer microarray data sets yields discriminative gene expression signatures that are found and validated on a large number of microarray samples, generated by different laboratories and microarray technologies. Predictive models generated by this approach are better validated than those generated on a single data set, while showing high predictive power and improved generalization performance.

          Related collections

          Most cited references41

          • Record: found
          • Abstract: found
          • Article: not found

          Creation of human tumour cells with defined genetic elements.

          During malignant transformation, cancer cells acquire genetic mutations that override the normal mechanisms controlling cellular proliferation. Primary rodent cells are efficiently converted into tumorigenic cells by the coexpression of cooperating oncogenes. However, similar experiments with human cells have consistently failed to yield tumorigenic transformants, indicating a fundamental difference in the biology of human and rodent cells. The few reported successes in the creation of human tumour cells have depended on the use of chemical or physical agents to achieve immortalization, the selection of rare, spontaneously arising immortalized cells, or the use of an entire viral genome. We show here that the ectopic expression of the telomerase catalytic subunit (hTERT) in combination with two oncogenes (the simian virus 40 large-T oncoprotein and an oncogenic allele of H-ras) results in direct tumorigenic conversion of normal human epithelial and fibroblast cells. These results demonstrate that disruption of the intracellular pathways regulated by large-T, oncogenic ras and telomerase suffices to create a human tumor cell.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Prognostically useful gene-expression profiles in acute myeloid leukemia.

            In patients with acute myeloid leukemia (AML) a combination of methods must be used to classify the disease, make therapeutic decisions, and determine the prognosis. However, this combined approach provides correct therapeutic and prognostic information in only 50 percent of cases. We determined the gene-expression profiles in samples of peripheral blood or bone marrow from 285 patients with AML using Affymetrix U133A GeneChips containing approximately 13,000 unique genes or expression-signature tags. Data analyses were carried out with Omniviz, significance analysis of microarrays, and prediction analysis of microarrays software. Statistical analyses were performed to determine the prognostic significance of cases of AML with specific molecular signatures. Unsupervised cluster analyses identified 16 groups of patients with AML on the basis of molecular signatures. We identified the genes that defined these clusters and determined the minimal numbers of genes needed to identify prognostically important clusters with a high degree of accuracy. The clustering was driven by the presence of chromosomal lesions (e.g., t(8;21), t(15;17), and inv(16)), particular genetic mutations (CEBPA), and abnormal oncogene expression (EVI1). We identified several novel clusters, some consisting of specimens with normal karyotypes. A unique cluster with a distinctive gene-expression signature included cases of AML with a poor treatment outcome. Gene-expression profiling allows a comprehensive classification of AML that includes previously identified genetically defined subgroups and a novel cluster with an adverse prognosis. Copyright 2004 Massachusetts Medical Society
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression.

              Many studies have used DNA microarrays to identify the gene expression signatures of human cancer, yet the critical features of these often unmanageably large signatures remain elusive. To address this, we developed a statistical method, comparative metaprofiling, which identifies and assesses the intersection of multiple gene expression signatures from a diverse collection of microarray data sets. We collected and analyzed 40 published cancer microarray data sets, comprising 38 million gene expression measurements from >3,700 cancer samples. From this, we characterized a common transcriptional profile that is universally activated in most cancer types relative to the normal tissues from which they arose, likely reflecting essential transcriptional features of neoplastic transformation. In addition, we characterized a transcriptional profile that is commonly activated in various types of undifferentiated cancer, suggesting common molecular mechanisms by which cancer cells progress and avoid differentiation. Finally, we validated these transcriptional profiles on independent data sets.
                Bookmark

                Author and article information

                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                2005
                4 November 2005
                : 6
                : 265
                Affiliations
                [1 ]Department of Theoretical Bioinformatics, German Cancer Research Center, Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany
                [2 ]Department of Bioinformatics and Functional Genomics, Institute for Pharmacy and Molecular Biology, University of Heidelberg, Im Neuenheimer Feld 364, D-69120 Heidelberg, Germany
                Article
                1471-2105-6-265
                10.1186/1471-2105-6-265
                1312314
                16271137
                42809079-57b2-48db-a413-d578a10d5f6e
                Copyright © 2005 Warnat et al; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 29 March 2005
                : 4 November 2005
                Categories
                Methodology Article

                Bioinformatics & Computational biology
                cancer,gene expression profiling,cross-platform analysis,classification,dna microarray

                Comments

                Comment on this article