I-TASSER: a unified platform for automated protein structure and function prediction

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

The iterative threading assembly refinement (I-TASSER) server is an integrated platform for automated protein structure and function prediction based on the sequence-to-structure-to-function paradigm. Starting from an amino acid sequence, I-TASSER first generates three-dimensional (3D) atomic models from multiple threading alignments and iterative structural assembly simulations. The function of the protein is then inferred by structurally matching the 3D models with other known proteins. The output from a typical server run contains full-length secondary and tertiary structure predictions, and functional annotations on ligand-binding sites, Enzyme Commission numbers and Gene Ontology terms. An estimate of accuracy of the predictions is provided based on the confidence score of the modeling. This protocol provides new insights and guidelines for designing of online server systems for the state-of-the-art protein structure and function predictions. The server is available at http://zhanglab.ccmb.med.umich.edu/I-TASSER.

Related collections

Most cited references 68

Record: found
Abstract: found
Article: not found

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

S Altschul (1997)

The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSI-BLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.

0 comments Cited 4311 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

The Protein Data Bank.

H M Berman, J Westbrook, Z. Feng … (2000)

The Protein Data Bank (PDB; http://www.rcsb.org/pdb/ ) is the single worldwide archive of structural data of biological macromolecules. This paper describes the goals of the PDB, the systems in place for data deposition and access, how to obtain further information, and near-term plans for the future development of the resource.

0 comments Cited 3929 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Comparative protein modelling by satisfaction of spatial restraints.

A. Sali, T L Blundell (1993)

We describe a comparative protein modelling method designed to find the most probable structure for a sequence given its alignment with related structures. The three-dimensional (3D) model is obtained by optimally satisfying spatial restraints derived from the alignment and expressed as probability density functions (pdfs) for the features restrained. For example, the probabilities for main-chain conformations of a modelled residue may be restrained by its residue type, main-chain conformation of an equivalent residue in a related protein, and the local similarity between the two sequences. Several such pdfs are obtained from the correlations between structural features in 17 families of homologous proteins which have been aligned on the basis of their 3D structures. The pdfs restrain C alpha-C alpha distances, main-chain N-O distances, main-chain and side-chain dihedral angles. A smoothing procedure is used in the derivation of these relationships to minimize the problem of a sparse database. The 3D model of a protein is obtained by optimization of the molecular pdf such that the model violates the input restraints as little as possible. The molecular pdf is derived as a combination of pdfs restraining individual spatial features of the whole molecule. The optimization procedure is a variable target function method that applies the conjugate gradients algorithm to positions of all non-hydrogen atoms. The method is automated and is illustrated by the modelling of trypsin from two other serine proteinases.

0 comments Cited 1039 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Title: Nature Protocols

Abbreviated Title: Nat Protoc

Publisher: Springer Science and Business Media LLC

ISSN (Print): 1754-2189

ISSN (Electronic): 1750-2799

Publication date Created: April 2010

Publication date (Electronic): March 25 2010

Publication date (Print): April 2010

Volume: 5

Issue: 4

Pages: 725-738

Article

DOI: 10.1038/nprot.2010.5

PMC ID: 2849174

PubMed ID: 20360767

SO-VID: 927ed7f6-7536-4bbd-a487-9f37a36a8761

License:

http://www.springer.com/tdm

History

Data availability:

I-TASSER: a unified platform for automated protein structure and function prediction

Read this article at

Abstract

Related collections

The Journal of Prediction Markets

Most cited references 68

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

The Protein Data Bank.

Comparative protein modelling by satisfaction of spatial restraints.

Author and article information

Journal

Article

History

Comments

Comment on this article

Similar content 1,994

Cited by 2,082