Genome-Based Drug Target Identification in Human Pathogen  Streptococcus gallolyticus

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Streptococcus gallolysticus ( Sg) is an opportunistic Gram-positive, non-motile bacterium, which causes infective endocarditis, an inflammation of the inner lining of the heart. As Sg has acquired resistance with the available antibiotics, therefore, there is a dire need to find new therapeutic targets and potent drugs to prevent and treat this disease. In the current study, an in silico approach is utilized to link genomic data of Sg species with its proteome to identify putative therapeutic targets. A total of 1,138 core proteins have been identified using pan genomic approach. Further, using subtractive proteomic analysis, a set of 18 proteins, essential for bacteria and non-homologous to host (human), is identified. Out of these 18 proteins, 12 cytoplasmic proteins were selected as potential drug targets. These selected proteins were subjected to molecular docking against drug-like compounds retrieved from ZINC database. Furthermore, the top docked compounds with lower binding energy were identified. In this work, we have identified novel drug and vaccine targets against Sg, of which some have already been reported and validated in other species. Owing to the experimental validation, we believe our methodology and result are significant contribution for drug/vaccine target identification against Sg-caused infective endocarditis.

Related collections

Most cited references 61

Record: found
Abstract: found
Article: found

Is Open Access

KEGG: new perspectives on genomes, pathways, diseases and drugs

Minoru Kanehisa, Miho Furumichi, Mao Tanabe … (2016)

KEGG (http://www.kegg.jp/ or http://www.genome.jp/kegg/) is an encyclopedia of genes and genomes. Assigning functional meanings to genes and genomes both at the molecular and higher levels is the primary objective of the KEGG database project. Molecular-level functions are stored in the KO (KEGG Orthology) database, where each KO is defined as a functional ortholog of genes and proteins. Higher-level functions are represented by networks of molecular interactions, reactions and relations in the forms of KEGG pathway maps, BRITE hierarchies and KEGG modules. In the past the KO database was developed for the purpose of defining nodes of molecular networks, but now the content has been expanded and the quality improved irrespective of whether or not the KOs appear in the three molecular network databases. The newly introduced addendum category of the GENES database is a collection of individual proteins whose functions are experimentally characterized and from which an increasing number of KOs are defined. Furthermore, the DISEASE and DRUG databases have been improved by systematic analysis of drug labels for better integration of diseases and drugs with the KEGG molecular networks. KEGG is moving towards becoming a comprehensive knowledge base for both functional interpretation and practical application of genomic information.

0 comments Cited 2228 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

ZINC 15 – Ligand Discovery for Everyone

Teague Sterling, John Irwin (2015)

Many questions about the biological activity and availability of small molecules remain inaccessible to investigators who could most benefit from their answers. To narrow the gap between chemoinformatics and biology, we have developed a suite of ligand annotation, purchasability, target, and biology association tools, incorporated into ZINC and meant for investigators who are not computer specialists. The new version contains over 120 million purchasable “drug-like” compounds – effectively all organic molecules that are for sale – a quarter of which are available for immediate delivery. ZINC connects purchasable compounds to high-value ones such as metabolites, drugs, natural products, and annotated compounds from the literature. Compounds may be accessed by the genes for which they are annotated as well as the major and minor target classes to which those genes belong. It offers new analysis tools that are easy for nonspecialists yet with few limitations for experts. ZINC retains its original 3D roots – all molecules are available in biologically relevant, ready-to-dock formats. ZINC is freely available at http://zinc15.docking.org.

0 comments Cited 693 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Prediction of protein subcellular localization.

Chin-Sheng Yu, Yu-Ching Chen, Chih-Hao Lu … (2006)

Because the protein's function is usually related to its subcellular localization, the ability to predict subcellular localization directly from protein sequences will be useful for inferring protein functions. Recent years have seen a surging interest in the development of novel computational tools to predict subcellular localization. At present, these approaches, based on a wide range of algorithms, have achieved varying degrees of success for specific organisms and for certain localization categories. A number of authors have noticed that sequence similarity is useful in predicting subcellular localization. For example, Nair and Rost (Protein Sci 2002;11:2836-2847) have carried out extensive analysis of the relation between sequence similarity and identity in subcellular localization, and have found a close relationship between them above a certain similarity threshold. However, many existing benchmark data sets used for the prediction accuracy assessment contain highly homologous sequences-some data sets comprising sequences up to 80-90% sequence identity. Using these benchmark test data will surely lead to overestimation of the performance of the methods considered. Here, we develop an approach based on a two-level support vector machine (SVM) system: the first level comprises a number of SVM classifiers, each based on a specific type of feature vectors derived from sequences; the second level SVM classifier functions as the jury machine to generate the probability distribution of decisions for possible localizations. We compare our approach with a global sequence alignment approach and other existing approaches for two benchmark data sets-one comprising prokaryotic sequences and the other eukaryotic sequences. Furthermore, we carried out all-against-all sequence alignment for several data sets to investigate the relationship between sequence homology and subcellular localization. Our results, which are consistent with previous studies, indicate that the homology search approach performs well down to 30% sequence identity, although its performance deteriorates considerably for sequences sharing lower sequence identity. A data set of high homology levels will undoubtedly lead to biased assessment of the performances of the predictive approaches-especially those relying on homology search or sequence annotations. Our two-level classification system based on SVM does not rely on homology search; therefore, its performance remains relatively unaffected by sequence homology. When compared with other approaches, our approach performed significantly better. Furthermore, we also develop a practical hybrid method, which combines the two-level SVM classifier and the homology search method, as a general tool for the sequence annotation of subcellular localization.

0 comments Cited 563 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Nosheen Afzal Qureshi: URI : http://loop.frontiersin.org/people/999354/overview

Syeda Marriam Bakhtiar: URI : http://loop.frontiersin.org/people/95668/overview

Syed Babar Jamal: URI : http://loop.frontiersin.org/people/478599/overview

Journal

Journal ID (nlm-ta): Front Genet

Journal ID (iso-abbrev): Front Genet

Journal ID (publisher-id): Front. Genet.

Title: Frontiers in Genetics

Publisher: Frontiers Media S.A.

ISSN (Electronic): 1664-8021

Publication date (Electronic): 25 March 2021

Publication date Collection: 2021

Volume: 12

Electronic Location Identifier: 564056

Affiliations

[1] ¹Department of Bioinformatics and Biosciences, Capital University of Science and Technology , Islamabad, Pakistan

[2] ²Department of Biological Sciences, National University of Medical Sciences , Rawalpindi, Pakistan

[3] ³Department of Biochemistry, Bahauddin Zakariya University , Multan, Pakistan

[4] ⁴Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University , Riyadh, Saudi Arabia

[5] ⁵Department of Pharmacology, College of Pharmacy, King Saud University , Riyadh, Saudi Arabia

[6] ⁶Department of Soil Science, College of Food and Agriculture Sciences, King Saud University , Riyadh, Saudi Arabia

[7] ⁷Department of Pharmacognosy (MAPPRC), College of Pharmacy, King Saud University , Riyadh, Saudi Arabia

Author notes

Edited by: Debmalya Barh, Institute of Integrative Omics and Applied Biotechnology (IIOAB), India

Reviewed by: Ashutosh Mani, Motilal Nehru National Institute of Technology Allahabad, India; Muhammad Tariq, University of Tabuk, Saudi Arabia; Nurnabi Azad Jewel, Shahjalal University of Science and Technology, Bangladesh; Muhammad Ilyas, Islamia College University, Pakistan

*Correspondence: Riaz Ullah, rullah@ 123456ksu.edu.sa

Syed Babar Jamal, babar.jamal@ 123456numspak.edu.pk ; syedbabar.jamal@ 123456gmail.com

This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics

Article

DOI: 10.3389/fgene.2021.564056

PMC ID: 8027347

PubMed ID: 33841489

SO-VID: 13a79958-3d45-432a-9e74-1e5d966af3c8

License:

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

History

Date received : 20 May 2020

Date accepted : 16 February 2021

Page count

Figures: 14, Tables: 16, Equations: 0, References: 63, Pages: 20, Words: 0

Comments

Comment on this article

scite_

Cited by 9

See all cited by

Most referenced authors 654

See all reference authors

Genome-Based Drug Target Identification in Human Pathogen Streptococcus gallolyticus

Read this article at

Abstract

Related collections

Genome Engineering using CRISPR

Most cited references 61

KEGG: new perspectives on genomes, pathways, diseases and drugs

ZINC 15 – Ligand Discovery for Everyone

Prediction of protein subcellular localization.

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Page count

Categories

Comments

Comment on this article

Similar content 170

Cited by 9

Most referenced authors 654