12
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      GPS-SUMO 2.0: an updated online service for the prediction of SUMOylation sites and SUMO-interacting motifs

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Small ubiquitin-like modifiers (SUMOs) are tiny but important protein regulators involved in orchestrating a broad spectrum of biological processes, either by covalently modifying protein substrates or by noncovalently interacting with other proteins. Here, we report an updated server, GPS-SUMO 2.0, for the prediction of SUMOylation sites and SUMO-interacting motifs (SIMs). For predictor training, we adopted three machine learning algorithms, penalized logistic regression (PLR), a deep neural network (DNN), and a transformer, and used 52 404 nonredundant SUMOylation sites in 8262 proteins and 163 SIMs in 102 proteins. To further increase the accuracy of predicting SUMOylation sites, a pretraining model was first constructed using 145 545 protein lysine modification sites, followed by transfer learning to fine-tune the model. GPS-SUMO 2.0 exhibited greater accuracy in predicting SUMOylation sites than did other existing tools. For users, one or multiple protein sequences or identifiers can be input, and the prediction results are shown in a tabular list. In addition to the basic statistics, we integrated knowledge from 35 public resources to annotate SUMOylation sites or SIMs. The GPS-SUMO 2.0 server is freely available at https://sumo.biocuckoo.cn/. We believe that GPS-SUMO 2.0 can serve as a useful tool for further analysis of SUMOylation and SUMO interactions.

          Graphical Abstract

          Graphical Abstract

          Related collections

          Most cited references62

          • Record: found
          • Abstract: found
          • Article: not found

          Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

          Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets

            Abstract Proteins and their functional interactions form the backbone of the cellular machinery. Their connectivity network needs to be considered for the full understanding of biological phenomena, but the available information on protein–protein associations is incomplete and exhibits varying levels of annotation granularity and reliability. The STRING database aims to collect, score and integrate all publicly available sources of protein–protein interaction information, and to complement these with computational predictions. Its goal is to achieve a comprehensive and objective global network, including direct (physical) as well as indirect (functional) interactions. The latest version of STRING (11.0) more than doubles the number of organisms it covers, to 5090. The most important new feature is an option to upload entire, genome-wide datasets as input, allowing users to visualize subsets as interaction networks and to perform gene-set enrichment analysis on the entire input. For the enrichment analysis, STRING implements well-known classification systems such as Gene Ontology and KEGG, but also offers additional, new classification systems based on high-throughput text-mining as well as on a hierarchical clustering of the association network itself. The STRING resource is available online at https://string-db.org/.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              CD-HIT: accelerated for clustering the next-generation sequencing data

              Summary: CD-HIT is a widely used program for clustering biological sequences to reduce sequence redundancy and improve the performance of other sequence analyses. In response to the rapid increase in the amount of sequencing data produced by the next-generation sequencing technologies, we have developed a new CD-HIT program accelerated with a novel parallelization strategy and some other techniques to allow efficient clustering of such datasets. Our tests demonstrated very good speedup derived from the parallelization for up to ∼24 cores and a quasi-linear speedup for up to ∼8 cores. The enhanced CD-HIT is capable of handling very large datasets in much shorter time than previous versions. Availability: http://cd-hit.org. Contact: liwz@sdsc.edu Supplementary information: Supplementary data are available at Bioinformatics online.
                Bookmark

                Author and article information

                Contributors
                Journal
                Nucleic Acids Res
                Nucleic Acids Res
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                05 July 2024
                06 May 2024
                06 May 2024
                : 52
                : W1
                : W238-W247
                Affiliations
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Computer Network Information Center, Chinese Academy of Sciences , Beijing100190, China
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Key Laboratory of Molecular Biophysics of Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology , Wuhan430074, China
                Nanjing University Institute of Artificial Intelligence Biomedicine , Nanjing210031, China
                Author notes
                To whom correspondence should be addressed. Tel: +86 27 87793903; Email: xueyu@ 123456hust.edu.cn
                Correspondence may also be addressed to Di Peng. Email: pengdi@ 123456hust.edu.cn
                Correspondence may also be addressed to Teng Lu. Email: luteng@ 123456sccas.cn

                The first two authors should be regarded as Joint First Authors.

                Author information
                https://orcid.org/0000-0002-9403-6869
                Article
                gkae346
                10.1093/nar/gkae346
                11223847
                38709873
                dab7b5aa-bfb4-4fce-a9d5-548723235451
                © The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License ( https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@ 123456oup.com

                History
                : 18 April 2024
                : 08 April 2024
                : 31 January 2024
                Page count
                Pages: 10
                Funding
                Funded by: National Key R&D Program of China, DOI 10.13039/501100012166;
                Award ID: 2022YFC2704304
                Award ID: 2021YFF0702000
                Funded by: Natural Science Foundation of China, DOI 10.13039/501100001809;
                Award ID: 32341020
                Award ID: 32341021
                Award ID: 31930021
                Funded by: Hubei Innovation Group;
                Award ID: 2021CFA005
                Funded by: Hubei Province Postdoctoral Outstanding Talent Tracking Support Program, Strategic Priority Research Program of CAS;
                Award ID: XDB0500101
                Funded by: Research Core Facilities for Life Science;
                Categories
                AcademicSubjects/SCI00010
                Web Server Issue

                Genetics
                Genetics

                Comments

                Comment on this article