47
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      MAFFT-DASH: integrated protein sequence and structural alignment

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Here, we describe a web server that integrates structural alignments with the MAFFT multiple sequence alignment (MSA) tool. For this purpose, we have prepared a web-based Database of Aligned Structural Homologs (DASH), which provides structural alignments at the domain and chain levels for all proteins in the Protein Data Bank (PDB), and can be queried interactively or by a simple REST-like API. MAFFT-DASH integration can be invoked with a single flag on either the web ( https://mafft.cbrc.jp/alignment/server/) or command-line versions of MAFFT. In our benchmarks using 878 cases from the BAliBase, HomFam, OXFam, Mattbench and SISYPHUS datasets, MAFFT-DASH showed 10–20% improvement over standard MAFFT for MSA problems with weak similarity, in terms of Sum-of-Pairs (SP), a measure of how well a program succeeds at aligning input sequences in comparison to a reference alignment. When MAFFT alignments were supplemented with homologous sequences, further improvement was observed. Potential applications of DASH beyond MSA enrichment include functional annotation through detection of remote homology and assembly of template libraries for homology modeling.

          Related collections

          Most cited references23

          • Record: found
          • Abstract: found
          • Article: not found

          T-Coffee: a web server for the multiple sequence alignment of protein and RNA sequences using structural information and homology extension

          This article introduces a new interface for T-Coffee, a consistency-based multiple sequence alignment program. This interface provides an easy and intuitive access to the most popular functionality of the package. These include the default T-Coffee mode for protein and nucleic acid sequences, the M-Coffee mode that allows combining the output of any other aligners, and template-based modes of T-Coffee that deliver high accuracy alignments while using structural or homology derived templates. These three available template modes are Expresso for the alignment of protein with a known 3D-Structure, R-Coffee to align RNA sequences with conserved secondary structures and PSI-Coffee to accurately align distantly related sequences using homology extension. The new server benefits from recent improvements of the T-Coffee algorithm and can align up to 150 sequences as long as 10 000 residues and is available from both http://www.tcoffee.org and its main mirror http://tcoffee.crg.cat.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            An improved algorithm for matching biological sequences.

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Emerging methods in protein co-evolution.

              Co-evolution is a fundamental component of the theory of evolution and is essential for understanding the relationships between species in complex ecological networks. A wide range of co-evolution-inspired computational methods has been designed to predict molecular interactions, but it is only recently that important advances have been made. Breakthroughs in the handling of phylogenetic information and in disentangling indirect relationships have resulted in an improved capacity to predict interactions between proteins and contacts between different protein residues. Here, we review the main co-evolution-based computational approaches, their theoretical basis, potential applications and foreseeable developments.
                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                Nucleic Acids Res
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                02 July 2019
                07 May 2019
                07 May 2019
                : 47
                : W1
                : W5-W10
                Affiliations
                [1 ]Department of Genome Informatics, Genome Information Research Center, Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita 565-0871, Japan
                [2 ]Systems Immunology Laboratory, Immunology Frontier Research Center, Osaka University, 3-1 Yamadaoka, Suita 565-0871, Japan
                Author notes
                To whom correspondence should be addressed. Tel: +81 6 6879 8367; Fax: +81 6 6879 8368; Email: katoh@ 123456ifrec.osaka-u.ac.jp
                Correspondence may also be addressed to Daron M. Standley. Email: standley@ 123456ifrec.osaka-u.ac.jp
                Author information
                http://orcid.org/0000-0003-4133-8393
                Article
                gkz342
                10.1093/nar/gkz342
                6602451
                31062021
                cfc23838-e2a7-4b1d-87be-826fd61a78c4
                © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@ 123456oup.com

                History
                : 25 April 2019
                : 07 April 2019
                : 12 February 2019
                Page count
                Pages: 6
                Funding
                Funded by: AMED 10.13039/100009619
                Award ID: 18am0101108j0002
                Funded by: JSPS 10.13039/501100001691
                Award ID: 16K07464
                Categories
                Web Server Issue

                Genetics
                Genetics

                Comments

                Comment on this article