69
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      STALLION: a stacking-based ensemble learning framework for prokaryotic lysine acetylation site prediction

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Protein post-translational modification (PTM) is an important regulatory mechanism that plays a key role in both normal and disease states. Acetylation on lysine residues is one of the most potent PTMs owing to its critical role in cellular metabolism and regulatory processes. Identifying protein lysine acetylation (Kace) sites is a challenging task in bioinformatics. To date, several machine learning-based methods for the in silico identification of Kace sites have been developed. Of those, a few are prokaryotic species-specific. Despite their attractive advantages and performances, these methods have certain limitations. Therefore, this study proposes a novel predictor STALLION (STacking-based Predictor for ProkAryotic Lysine AcetyLatION), containing six prokaryotic species-specific models to identify Kace sites accurately. To extract crucial patterns around Kace sites, we employed 11 different encodings representing three different characteristics. Subsequently, a systematic and rigorous feature selection approach was employed to identify the optimal feature set independently for five tree-based ensemble algorithms and built their respective baseline model for each species. Finally, the predicted values from baseline models were utilized and trained with an appropriate classifier using the stacking strategy to develop STALLION. Comparative benchmarking experiments showed that STALLION significantly outperformed existing predictor on independent tests. To expedite direct accessibility to the STALLION models, a user-friendly online predictor was implemented, which is available at: http://thegleelab.org/STALLION.

          Related collections

          Most cited references99

          • Record: found
          • Abstract: found
          • Article: not found

          Deep learning.

          Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Random Forests

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Support-vector networks

                Bookmark

                Author and article information

                Contributors
                Journal
                Brief Bioinform
                Brief Bioinform
                bib
                Briefings in Bioinformatics
                Oxford University Press
                1467-5463
                1477-4054
                January 2022
                17 September 2021
                17 September 2021
                : 23
                : 1
                : bbab376
                Affiliations
                Department of Physiology, Ajou University School of Medicine , Republic of Korea
                Department of Molecular Science and Technology, Ajou University , Suwon 16499, Republic of Korea
                Department of Physiology, Ajou University School of Medicine , Republic of Korea
                Author notes
                Corresponding authors. Gwang Lee, Department of Molecular Science and Technology, Ajou University, Suwon 16499, Republic of Korea. Tel.: +82-31-219-4555; Fax: +82-31-219-5049; E-mail: glee@ 123456ajou.ac.kr ; Balachandran Manavalan, Department of Physiology, Ajou University School of Medicine, Suwon 16499, Republic of Korea. Tel.: +82-31-219-4913; Fax: +82-31-219-5049; E-mail: bala@ 123456ajou.ac.kr
                Author information
                https://orcid.org/0000-0002-1299-9478
                https://orcid.org/0000-0003-0697-9419
                Article
                bbab376
                10.1093/bib/bbab376
                8769686
                34532736
                4c0e4a0d-1b9c-4bd6-899c-a0bf6cd28e9a
                © The Author(s) 2021. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 4 June 2021
                : 22 August 2021
                : 24 August 2021
                Page count
                Pages: 15
                Funding
                Funded by: National Research Foundation of Korea, DOI 10.13039/501100003725;
                Award ID: 2021R1A2C1014338
                Award ID: 2019R1I1A1A01062260
                Award ID: 2020R1A4A4079722
                Categories
                Problem Solving Protocol
                AcademicSubjects/SCI01060

                Bioinformatics & Computational biology
                lysine acetylation sites,bioinformatics,stacking strategy,machine learning,feature optimization,performance assessment

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content438

                Cited by26

                Most referenced authors1,587