51
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Caipirini: using gene sets to rank literature

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Keeping up-to-date with bioscience literature is becoming increasingly challenging. Several recent methods help meet this challenge by allowing literature search to be launched based on lists of abstracts that the user judges to be 'interesting'. Some methods go further by allowing the user to provide a second input set of 'uninteresting' abstracts; these two input sets are then used to search and rank literature by relevance. In this work we present the service 'Caipirini' ( http://caipirini.org) that also allows two input sets, but takes the novel approach of allowing ranking of literature based on one or more sets of genes.

          Results

          To evaluate the usefulness of Caipirini, we used two test cases, one related to the human cell cycle, and a second related to disease defense mechanisms in Arabidopsis thaliana. In both cases, the new method achieved high precision in finding literature related to the biological mechanisms underlying the input data sets.

          Conclusions

          To our knowledge Caipirini is the first service enabling literature search directly based on biological relevance to gene sets; thus, Caipirini gives the research community a new way to unlock hidden knowledge from gene sets derived via high-throughput experiments.

          Related collections

          Most cited references20

          • Record: found
          • Abstract: not found
          • Article: not found

          A survey of current work in biomedical text mining.

          A. Cohen (2005)
          The volume of published biomedical research, and therefore the underlying biomedical knowledge base, is expanding at an increasing rate. Among the tools that can aid researchers in coping with this information overload are text mining and knowledge extraction. Significant progress has been made in applying text mining to named entity recognition, text classification, terminology extraction, relationship extraction and hypothesis generation. Several research groups are constructing integrated flexible text-mining systems intended for multiple uses. The major challenge of biomedical text mining over the next 5-10 years is to make these systems useful to biomedical researchers. This will require enhanced access to full text, better understanding of the feature space of biomedical literature, better methods for measuring the usefulness of systems to users, and continued cooperation with the biomedical research community to ensure that their needs are addressed.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Understanding the functions of plant disease resistance proteins.

            Many disease resistance (R) proteins of plants detect the presence of disease-causing bacteria, viruses, or fungi by recognizing specific pathogen effector molecules that are produced during the infection process. Effectors are often pathogen proteins that probably evolved to subvert various host processes for promotion of the pathogen life cycle. Five classes of effector-specific R proteins are known, and their sequences suggest roles in both effector recognition and signal transduction. Although some R proteins may act as primary receptors of pathogen effector proteins, most appear to play indirect roles in this process. The functions of various R proteins require phosphorylation, protein degradation, or specific localization within the host cell. Some signaling components are shared by many R gene pathways whereas others appear to be pathway specific. New technologies arising from the genomics and proteomics revolution will greatly expand our ability to investigate the role of R proteins in plant disease resistance.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Induction of systemic acquired disease resistance in plants by chemicals.

                Bookmark

                Author and article information

                Journal
                BioData Min
                BioData Mining
                BioMed Central
                1756-0381
                2012
                1 February 2012
                : 5
                : 1
                Affiliations
                [1 ]Structural and Computational Biology Unit, European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
                [2 ]Computational Biology and Data Mining Group, Max-Delbrück Center for Molecular Medicine, Berlin, Germany
                [3 ]Bioinformatics Graduate Program, Federal University of Paraná - UFPR (SEPT). Curitiba - PR, Brazil
                [4 ]Departamento de Genética, Laboratório de Genética e Biotecnologia Vegetal, Centro de CiênciasBiológicas, Universidade Federal de Pernambuco, Recife, PE, Brasil
                [5 ]ESAT-SCD/IBBT-K.U. Leuven Future Health Department, KatholiekeUniversiteit Leuven, Leuven, Belgium
                [6 ]LIFE Biosystems GmbH, Heidelberg, Germany
                [7 ]Garvan Institute of Medical Research, Sydney, Australia
                [8 ]Division of Mathematics, Informatics, and Statistics, CSIRO, Sydney, Australia
                [9 ]Luxembourg Center for Systems Biomedicine, University of Luxembourg, Luxembourg
                Article
                1756-0381-5-1
                10.1186/1756-0381-5-1
                3307494
                22297131
                9162d6e6-6059-424a-ad7f-daeb4f77e093
                Copyright ©2012 Soldatos et al; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 12 October 2010
                : 1 February 2012
                Categories
                Software Article

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article