92
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Data Mining Methods for Omics and Knowledge of Crude Medicinal Plants toward Big Data Biology

      review-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Molecular biological data has rapidly increased with the recent progress of the Omics fields, e.g., genomics, transcriptomics, proteomics and metabolomics that necessitates the development of databases and methods for efficient storage, retrieval, integration and analysis of massive data. The present study reviews the usage of KNApSAcK Family DB in metabolomics and related area, discusses several statistical methods for handling multivariate data and shows their application on Indonesian blended herbal medicines (Jamu) as a case study. Exploration using Biplot reveals many plants are rarely utilized while some plants are highly utilized toward specific efficacy. Furthermore, the ingredients of Jamu formulas are modeled using Partial Least Squares Discriminant Analysis (PLS-DA) in order to predict their efficacy. The plants used in each Jamu medicine served as the predictors, whereas the efficacy of each Jamu provided the responses. This model produces 71.6% correct classification in predicting efficacy. Permutation test then is used to determine plants that serve as main ingredients in Jamu formula by evaluating the significance of the PLS-DA coefficients. Next, in order to explain the role of plants that serve as main ingredients in Jamu medicines, information of pharmacological activity of the plants is added to the predictor block. Then N-PLS-DA model, multiway version of PLS-DA, is utilized to handle the three-dimensional array of the predictor block. The resulting N-PLS-DA model reveals that the effects of some pharmacological activities are specific for certain efficacy and the other activities are diverse toward many efficacies. Mathematical modeling introduced in the present study can be utilized in global analysis of big data targeting to reveal the underlying biology.

          Related collections

          Most cited references132

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          MetaboLights—an open-access general-purpose repository for metabolomics studies and associated meta-data

          MetaboLights (http://www.ebi.ac.uk/metabolights) is the first general-purpose, open-access repository for metabolomics studies, their raw experimental data and associated metadata, maintained by one of the major open-access data providers in molecular biology. Metabolomic profiling is an important tool for research into biological functioning and into the systemic perturbations caused by diseases, diet and the environment. The effectiveness of such methods depends on the availability of public open data across a broad range of experimental methods and conditions. The MetaboLights repository, powered by the open source ISA framework, is cross-species and cross-technique. It will cover metabolite structures and their reference spectra as well as their biological roles, locations, concentrations and raw data from metabolic experiments. Studies automatically receive a stable unique accession number that can be used as a publication reference (e.g. MTBLS1). At present, the repository includes 15 submitted studies, encompassing 93 protocols for 714 assays, and span over 8 different species including human, Caenorhabditis elegans, Mus musculus and Arabidopsis thaliana. Eight hundred twenty-seven of the metabolites identified in these studies have been mapped to ChEBI. These studies cover a variety of techniques, including NMR spectroscopy and mass spectrometry.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            The use of metabolomics to dissect plant responses to abiotic stresses

            Plant metabolism is perturbed by various abiotic stresses. As such the metabolic network of plants must be reconfigured under stress conditions in order to allow both the maintenance of metabolic homeostasis and the production of compounds that ameliorate the stress. The recent development and adoption of metabolomics and systems biology approaches enable us not only to gain a comprehensive overview, but also a detailed analysis of crucial components of the plant metabolic response to abiotic stresses. In this review we introduce the analytical methods used for plant metabolomics and describe their use in studies related to the metabolic response to water, temperature, light, nutrient limitation, ion and oxidative stresses. Both similarity and specificity of the metabolic responses against diverse abiotic stress are evaluated using data available in the literature. Classically discussed stress compounds such as proline, γ-amino butyrate and polyamines are reviewed, and the widespread importance of branched chain amino acid metabolism under stress condition is discussed. Finally, where possible, mechanistic insights into metabolic regulatory processes are discussed. Electronic supplementary material The online version of this article (doi:10.1007/s00018-012-1091-5) contains supplementary material, which is available to authorized users.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Partial least squares: a versatile tool for the analysis of high-dimensional genomic data.

              Partial least squares (PLS) is an efficient statistical regression technique that is highly suited for the analysis of genomic and proteomic data. In this article, we review both the theory underlying PLS as well as a host of bioinformatics applications of PLS. In particular, we provide a systematic comparison of the PLS approaches currently employed, and discuss analysis problems as diverse as, e.g. tumor classification from transcriptome data, identification of relevant genes, survival analysis and modeling of gene networks and transcription factor activities.
                Bookmark

                Author and article information

                Journal
                Comput Struct Biotechnol J
                Comput Struct Biotechnol J
                CSBJ
                Computational and Structural Biotechnology Journal
                Research Network of Computational and Structural Biotechnology (RNCSB) Organization
                2001-0370
                23 March 2013
                2013
                : 4
                : e201301010
                Affiliations
                [a ]Graduate School of Information Science, Nara Institute of Science and Technology, Nara 630-0101, Ikoma, Japan
                [b ]Department of Statistics, Bogor Agricultural University, Jln. Meranti, Kampus IPB Darmaga, Bogor 16680, Indonesia
                [c ]Biopharmaca Research Center, Bogor Agricultural University, Kampas IPB Taman Kencana, Jln. Taman Kencana No. 3 Bogor 16151, Indonesia
                [d ]Maebashi Institute of technology, 450-1 Kamisadori, Maebashi-shi, Gunma, 371-0816 Japan
                [e ]Department of Medicinal Resources, Institute of Natural Medicine, University of Toyama, 2630 Toyama, 930-0194, Japan
                [f ]Department of Electronic and Computer Engineering, Ibaraki National College of Technology, 866 Nakane, Hitachinaka, Ibaraki 312-8508, Japan
                Author notes
                Citation
                Afendi FM, Ono N, Nakamura Y, Nakamura K, Darusman LK, Kibinge N, Morita AH, Tanaka K, Horai H, Altaf-Ul-Amin M, Kanaya S (2013) Data Mining Methods for Omics and Knowledge of Crude Medicinal Plants toward Big Data Biology. Computational and Structural Biotechnology Journal. 4 (5): e201301010.
                [* ] Corresponding author: E-mail address: skanaya@ 123456gtc.naist.jp (Shigehiko Kanaya)
                Article
                CSBJ-4-e201301010
                10.5936/csbj.201301010
                3962233
                24688691
                9a3cfdfc-1400-41a6-b125-93459b8b5b49
                © Afendi et al.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly cited.

                History
                : 08 December 2012
                : 09 March 2013
                : 09 March 2013
                Categories
                Mini Reviews

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content27

                Cited by13

                Most referenced authors1,049