0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Protein Condensate Atlas from predictive models of heteromolecular condensate composition

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Biomolecular condensates help cells organise their content in space and time. Cells harbour a variety of condensate types with diverse composition and many are likely yet to be discovered. Here, we develop a methodology to predict the composition of biomolecular condensates. We first analyse available proteomics data of cellular condensates and find that the biophysical features that determine protein localisation into condensates differ from known drivers of homotypic phase separation processes, with charge mediated protein-RNA and hydrophobicity mediated protein-protein interactions playing a key role in the former process. We then develop a machine learning model that links protein sequence to its propensity to localise into heteromolecular condensates. We apply the model across the proteome and find many of the top-ranked targets outside the original training data to localise into condensates as confirmed by orthogonal immunohistochemical staining imaging. Finally, we segment the condensation-prone proteome into condensate types based on an overlap with biomolecular interaction profiles to generate a Protein Condensate Atlas. Several condensate clusters within the Atlas closely match the composition of experimentally characterised condensates or regions within them, suggesting that the Atlas can be valuable for identifying additional components within known condensate systems and discovering previously uncharacterised condensates.

          Abstract

          Biomolecular condensates help cells organise their content in space and time. Here the authors report a machine learning driven methodology to predict the composition of biomolecular condensates and they then validate their predictions against the composition of known biomolecular condensates.

          Related collections

          Most cited references55

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The STRING database in 2021: customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets

          Abstract Cellular life depends on a complex web of functional associations between biomolecules. Among these associations, protein–protein interactions are particularly important due to their versatility, specificity and adaptability. The STRING database aims to integrate all known and predicted associations between proteins, including both physical interactions as well as functional associations. To achieve this, STRING collects and scores evidence from a number of sources: (i) automated text mining of the scientific literature, (ii) databases of interaction experiments and annotated complexes/pathways, (iii) computational interaction predictions from co-expression and from conserved genomic context and (iv) systematic transfers of interaction evidence from one organism to another. STRING aims for wide coverage; the upcoming version 11.5 of the resource will contain more than 14 000 organisms. In this update paper, we describe changes to the text-mining system, a new scoring-mode for physical interactions, as well as extensive user interface features for customizing, extending and sharing protein networks. In addition, we describe how to query STRING with genome-wide, experimental data, including the automated detection of enriched functionalities and potential biases in the user's query data. The STRING resource is available online, at https://string-db.org/.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            The Gene Ontology resource: enriching a GOld mine

            Abstract The Gene Ontology Consortium (GOC) provides the most comprehensive resource currently available for computable knowledge regarding the functions of genes and gene products. Here, we report the advances of the consortium over the past two years. The new GO-CAM annotation framework was notably improved, and we formalized the model with a computational schema to check and validate the rapidly increasing repository of 2838 GO-CAMs. In addition, we describe the impacts of several collaborations to refine GO and report a 10% increase in the number of GO annotations, a 25% increase in annotated gene products, and over 9,400 new scientific articles annotated. As the project matures, we continue our efforts to review older annotations in light of newer findings, and, to maintain consistency with other ontologies. As a result, 20 000 annotations derived from experimental data were reviewed, corresponding to 2.5% of experimental GO annotations. The website (http://geneontology.org) was redesigned for quick access to documentation, downloads and tools. To maintain an accurate resource and support traceability and reproducibility, we have made available a historical archive covering the past 15 years of GO data with a consistent format and file structure for both the ontology and annotations.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              UMAP: Uniform Manifold Approximation and Projection

                Bookmark

                Author and article information

                Contributors
                ksaar@transitionbio.com
                tpjk2@cam.ac.uk
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                10 July 2024
                10 July 2024
                2024
                : 15
                : 5418
                Affiliations
                [1 ]Transition Bio Ltd, Cambridge, UK
                [2 ]Yusuf Hamied Department of Chemistry, University of Cambridge, ( https://ror.org/013meh722) Cambridge, CB2 1EW UK
                [3 ]Department of Chemistry, University of Oxford, ( https://ror.org/052gg0110) Oxford, OX1 3TA UK
                [4 ]GRID grid.94365.3d, ISNI 0000 0001 2297 5165, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, , National Institutes of Health, ; Bethesda, MD 20892 USA
                [5 ]Cavendish Laboratory, Department of Physics, University of Cambridge, ( https://ror.org/013meh722) Cambridge, CB3 0HE UK
                [6 ]Wellcome Sanger Institute, ( https://ror.org/05cy4wa09) Wellcome Genome Campus, Hinxton, Cambridge, UK
                Author information
                http://orcid.org/0000-0002-5926-3628
                http://orcid.org/0009-0008-0012-0703
                http://orcid.org/0000-0001-5308-8542
                http://orcid.org/0000-0002-9616-3108
                http://orcid.org/0000-0002-6294-6366
                http://orcid.org/0000-0002-7879-0140
                Article
                48496
                10.1038/s41467-024-48496-7
                11237133
                38987300
                d2764be1-416a-4bed-b113-9fc890bf582a
                © The Author(s) 2024

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 17 June 2023
                : 2 May 2024
                Funding
                Funded by: European Research Council
                Categories
                Article
                Custom metadata
                © Springer Nature Limited 2024

                Uncategorized
                molecular biophysics,computational biophysics,proteome informatics
                Uncategorized
                molecular biophysics, computational biophysics, proteome informatics

                Comments

                Comment on this article