28
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Cache Domains That are Homologous to, but Different from PAS Domains Comprise the Largest Superfamily of Extracellular Sensors in Prokaryotes

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Cellular receptors usually contain a designated sensory domain that recognizes the signal. Per/Arnt/Sim (PAS) domains are ubiquitous sensors in thousands of species ranging from bacteria to humans. Although PAS domains were described as intracellular sensors, recent structural studies revealed PAS-like domains in extracytoplasmic regions in several transmembrane receptors. However, these structurally defined extracellular PAS-like domains do not match sequence-derived PAS domain models, and thus their distribution across the genomic landscape remains largely unknown. Here we show that structurally defined extracellular PAS-like domains belong to the Cache superfamily, which is homologous to, but distinct from the PAS superfamily. Our newly built computational models enabled identification of Cache domains in tens of thousands of signal transduction proteins including those from important pathogens and model organisms. Furthermore, we show that Cache domains comprise the dominant mode of extracellular sensing in prokaryotes.

          Author Summary

          Cell-surface receptors control multiple cellular functions and are attractive targets for drug design. These receptors often have dedicated extracellular domains that bind signaling molecules, such as hormones and nutrients. Computational identification of these ligand-binding domains in genomic sequences is a pre-requisite for their further experimental characterization. Using available three-dimensional structures of several bacterial cell-surface receptors, we built computational models that enabled identification of the Cache domain, as the most common extracellular sensor module in prokaryotes, including many important pathogens. We also demonstrated that the Cache domain is homologous to, but sufficiently different from the most common intracellular sensor module, the PAS domain. These findings provide a unified view on molecular principles of signal recognition by extra- and intracellular receptors.

          Related collections

          Most cited references57

          • Record: found
          • Abstract: found
          • Article: not found

          Protein homology detection by HMM-HMM comparison.

          Protein homology detection and sequence alignment are at the basis of protein structure prediction, function prediction and evolution. We have generalized the alignment of protein sequences with a profile hidden Markov model (HMM) to the case of pairwise alignment of profile HMMs. We present a method for detecting distant homologous relationships between proteins based on this approach. The method (HHsearch) is benchmarked together with BLAST, PSI-BLAST, HMMER and the profile-profile comparison tools PROF_SIM and COMPASS, in an all-against-all comparison of a database of 3691 protein domains from SCOP 1.63 with pairwise sequence identities below 20%.Sensitivity: When the predicted secondary structure is included in the HMMs, HHsearch is able to detect between 2.7 and 4.2 times more homologs than PSI-BLAST or HMMER and between 1.44 and 1.9 times more than COMPASS or PROF_SIM for a rate of false positives of 10%. Approximately half of the improvement over the profile-profile comparison methods is attributable to the use of profile HMMs in place of simple profiles. Alignment quality: Higher sensitivity is mirrored by an increased alignment quality. HHsearch produced 1.2, 1.7 and 3.3 times more good alignments ('balanced' score >0.3) than the next best method (COMPASS), and 1.6, 2.9 and 9.4 times more than PSI-BLAST, at the family, superfamily and fold level, respectively.Speed: HHsearch scans a query of 200 residues against 3691 domains in 33 s on an AMD64 2GHz PC. This is 10 times faster than PROF_SIM and 17 times faster than COMPASS.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Two-component signal transduction.

            Most prokaryotic signal-transduction systems and a few eukaryotic pathways use phosphotransfer schemes involving two conserved components, a histidine protein kinase and a response regulator protein. The histidine protein kinase, which is regulated by environmental stimuli, autophosphorylates at a histidine residue, creating a high-energy phosphoryl group that is subsequently transferred to an aspartate residue in the response regulator protein. Phosphorylation induces a conformational change in the regulatory domain that results in activation of an associated domain that effects the response. The basic scheme is highly adaptable, and numerous variations have provided optimization within specific signaling systems. The domains of two-component proteins are modular and can be integrated into proteins and pathways in a variety of ways, but the core structures and activities are maintained. Thus detailed analyses of a relatively small number of representative proteins provide a foundation for understanding this large family of signaling proteins.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              SMART: recent updates, new developments and status in 2015

              SMART (Simple Modular Architecture Research Tool) is a web resource (http://smart.embl.de/) providing simple identification and extensive annotation of protein domains and the exploration of protein domain architectures. In the current version, SMART contains manually curated models for more than 1200 protein domains, with ∼200 new models since our last update article. The underlying protein databases were synchronized with UniProt, Ensembl and STRING, bringing the total number of annotated domains and other protein features above 100 million. SMART's ‘Genomic’ mode, which annotates proteins from completely sequenced genomes was greatly expanded and now includes 2031 species, compared to 1133 in the previous release. SMART analysis results pages have been completely redesigned and include links to several new information sources. A new, vector-based display engine has been developed for protein schematics in SMART, which can also be exported as high-resolution bitmap images for easy inclusion into other documents. Taxonomic tree displays in SMART have been significantly improved, and can be easily navigated using the integrated search engine.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Comput Biol
                PLoS Comput. Biol
                plos
                ploscomp
                PLoS Computational Biology
                Public Library of Science (San Francisco, CA USA )
                1553-734X
                1553-7358
                6 April 2016
                April 2016
                : 12
                : 4
                : e1004862
                Affiliations
                [1 ]Genome Science and Technology Graduate Program, University of Tennessee–Oak Ridge National Laboratory, Knoxville, Tennessee, United States of America
                [2 ]Department of Microbiology, University of Tennessee, Knoxville, Tennessee, United States of America
                [3 ]Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee, United States of America
                [4 ]European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
                Icahn School of Medicine at Mount Sinai, UNITED STATES
                Author notes

                The authors have declared that no competing interests exist.

                Conceived and designed the experiments: IBZ AAU. Performed the experiments: AAU. Analyzed the data: AAU ADF OA RDF IBZ. Contributed reagents/materials/analysis tools: RDF. Wrote the paper: AAU ADF IBZ.

                Article
                PCOMPBIOL-D-15-01830
                10.1371/journal.pcbi.1004862
                4822843
                27049771
                a1ecf4bd-f44d-469e-9504-bd05bca4b4a4
                © 2016 Upadhyay et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 30 October 2015
                : 10 March 2016
                Page count
                Figures: 7, Tables: 1, Pages: 21
                Funding
                This work was supported in part by the National Institute of General Medical Sciences under award number R01GM0722285. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology and Life Sciences
                Biochemistry
                Proteins
                Protein Domains
                Research and Analysis Methods
                Database and Informatics Methods
                Biological Databases
                Sequence Databases
                Biology and Life Sciences
                Molecular Biology
                Molecular Biology Techniques
                Sequencing Techniques
                Sequence Analysis
                Sequence Databases
                Research and Analysis Methods
                Molecular Biology Techniques
                Sequencing Techniques
                Sequence Analysis
                Sequence Databases
                Research and Analysis Methods
                Database and Informatics Methods
                Database Searching
                Biology and Life Sciences
                Cell Biology
                Signal Transduction
                Biology and Life Sciences
                Molecular Biology
                Molecular Biology Techniques
                Sequencing Techniques
                Sequence Analysis
                Sequence Alignment
                Research and Analysis Methods
                Molecular Biology Techniques
                Sequencing Techniques
                Sequence Analysis
                Sequence Alignment
                Biology and Life Sciences
                Cell Biology
                Cellular Types
                Prokaryotic Cells
                Research and Analysis Methods
                Database and Informatics Methods
                Database Searching
                Sequence Similarity Searching
                Biology and Life Sciences
                Neuroscience
                Sensory Perception
                Sensory Receptors
                Biology and Life Sciences
                Psychology
                Sensory Perception
                Sensory Receptors
                Social Sciences
                Psychology
                Sensory Perception
                Sensory Receptors
                Biology and Life Sciences
                Cell Biology
                Signal Transduction
                Sensory Receptors
                Custom metadata
                All relevant data are within the paper and its Supporting Information files.

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article