Advancing Drug Discovery through Integrative Computational Models and AI Technologies

Piotto, Stefano; Sessa, Lucía; Sottile, Eugenio; Sarkar, Arkadeep; Concilio, Simona

doi:10.58647/DRUGREPO.25.1.0001

Record: found
Abstract: found
Article: found

Is Open Access

Advancing Drug Discovery through Integrative Computational Models and AI Technologies

Published

research-article

Author(s): Stefano Piotto ¹ ^, ² ^, ³ ^, , Lucia Sessa ¹ ^, ² , Eugenio Sottile ² , Arkadeep Sarkar ² , Simona Concilio ² ^, ³

Publication date (Electronic): 13 March 2025

Journal: Drug Repurposing

Publisher: ScienceOpen

Keywords: drug discovery, molecular similarity, electrotopological descriptors

Abstract

The intricate, costly, and time-intensive nature of traditional drug discovery processes delays the development of novel pharmaceuticals. We proposed a drug repurposing workflow by integrating computational models, artificial intelligence, and molecular biology techniques to streamline drug discovery and enhance pharmacological research. This workflow moves beyond conventional methods, focusing on protein interactions and multiscale molecular analyses. Our approach addresses critical limitations in current methodologies: the oversimplification of receptor–ligand interactions, static representation of protein structures, and neglect of the complex electronic distributions in molecular interactions. Central to our methodology is enriching a comprehensive knowledge graph, integrating data from scientific literature and multiple databases such as UniProt for genes, PDB for protein structures, and ChEMBL for molecules. This knowledge graph is further enhanced by incorporating predicted drug–target and protein–protein interaction scores derived from structural similarity analyses. To demonstrate its potential, we applied this workflow to a specific use case, uncovering new insights into the mechanisms of action for repurposed drugs. This integration provides profound insights into the mechanistic processes underlying these interactions, establishing a new framework for biomedical research. Ultimately, this research enables a more accurate modeling of biological systems’ complex nature, facilitating the discovery of more effective and tailored medical treatments.

Main article text

INTRODUCTION

The classical concept of structure-based drug design, which depends on the concept of targeting a single receptor and its interaction with a ligand, has been increasingly criticized for presenting a simplistic view of cellular systems. This approach, which directs all attention on a single-target protein, fails to account for the complexity of biological networks where multiple proteins interact within cellular pathways.¹ Consequently, the single-target model overlooks the broader context of disease mechanisms, leading to incomplete drug efficacy and safety assessments. This narrow focus also ignores off-target effects that can be detrimental or beneficial to therapeutic outcomes.² A more holistic understanding of cellular processes is essential in complex diseases such as cancer, neurological disorders, and autoimmune diseases, where entire protein networks and pathways are disrupted. There is an increasing demand for advanced methodologies that can model the interconnected nature of these systems, enabling the identification of multiple drug targets and the prediction of system-wide effects, ultimately improving treatment efficacy and reducing side effects.³

To overcome the limitations of traditional protein models, which oversimplified the protein interactions, developing interaction networks is crucial.⁴ These networks offer a more comprehensive and realistic view of the cell, enabling researchers to capture the intricate dependencies and regulatory mechanisms between proteins. By simulating these processes, scientists can gain deeper insights into how drugs affect individual targets and entire protein networks, which is critical for understanding the broader implications of therapeutic interventions.⁵ These simulations are particularly valuable for modeling diseases driven by complex protein interactions, such as cancer or neurodegenerative disorders, where multiple signaling pathways are disrupted.^6,7

Given the limitations of current target-based models and their inability to fully capture the complexity of cellular networks, drug repurposing offers a practical alternative. By utilizing drugs with known safety profiles, repurposing allows researchers to bypass some of the challenges posed by incomplete models, offering a faster and more efficient approach to discovering treatments that can target complex diseases more effectively. Drug repurposing has become an essential strategy in modern pharmacology, offering a more cost-effective pathway to drug development. This approach leverages existing data on a drug’s safety, pharmacokinetics, and pharmacodynamics, significantly reducing the time and resources needed to bring treatments to market.^8,9 Additionally, drug repurposing has proven particularly valuable in addressing urgent medical needs, such as rare diseases, cancer, and emerging infectious diseases, where time is critical.¹⁰ The pre-existing regulatory approval of repurposed drugs also facilitates quicker clinical trials and regulatory processes, enhancing their appeal in research and industry settings. Recent successes in repurposing, such as the use of sildenafil for pulmonary hypertension and thalidomide for multiple myeloma, illustrate the growing importance of this strategy in modern drug discovery.¹¹

Advanced modeling techniques, such as multistate Markov models, can further enhance the understanding of drug–protein interactions. These models address the limitations of traditional approaches by capturing the dynamic and stochastic nature of protein behavior. Unlike static models, which capture only a single state, these models simulate the transitions between various molecular states over time, making them ideal for capturing the dynamic behavior of biological systems.^12–14 By allowing for real-time mapping of protein conformational changes and drug effects, multistate Markov models offer a more accurate understanding of how drugs interact with their targets within the cellular environment.¹⁵ This continuous updating of molecular characteristics reflects the transient and fluctuating nature of protein structures, which is essential for predicting drug efficacy in real-world biological systems. The flexibility of this approach enables the identification of intermediate states that are often missed in static models, providing valuable insights into drug–protein interactions that are critical for more precise and effective drug discovery.^16–18 To construct comprehensive protein interaction networks and improve the accuracy of drug–protein interaction models, we integrate two computational approaches: an electrotopological descriptor for molecular similarity and a tool for binding pocket comparison. The electrotopological descriptor calculates molecular similarity by analyzing structural and topological features, leveraging three-dimensional (3D) atomic arrangements to enable precise comparisons of molecular geometries—an advantage particularly valuable in dynamic environments like binding sites.^19,20 Meanwhile, the binding pocket comparison tool provides a robust analysis of binding site similarity through water-mapping-based molecular dynamics (MD), exploring the spatial and volumetric properties of binding pockets with a focus on hydrophilic and hydrophobic regions often overlooked by static methods.²¹ The integration of these methods supports the construction of dynamic protein interaction networks that capture real-time molecular behavior. By combining molecular similarity assessments with binding site analysis, we improve the precision of drug repurposing efforts, providing deeper insights into protein–ligand interactions and facilitating the identification of new therapeutic opportunities.²²

METHODS

Molecular Similarity Calculations

The electrotopological descriptor is a novel method designed to compute molecular similarity by considering molecules’ atomistic topology and geometry. Unlike traditional 2D-based methods, this approach uses 3D representations of molecules, allowing for a more accurate assessment of molecular similarities, particularly in dynamic environments like protein binding sites. We encode both the atomistic topology and geometry of each atom to precisely measure intermolecular distances. The algorithm starts by computing the atomic properties and positioning the molecule at the center of a sphere. A projection is then cast onto the sphere’s surface, creating a 2D shadow of each atom on the opposite side. The location of each shadow point is determined by converting the atom’s Cartesian coordinates (x, y, z) into spherical coordinates (see Figure 1 ). The algorithm calculates pairwise distances between atoms of two molecules, adjusting these distances based on atomistic properties such as charge and atomic weight. This flexibility is essential when molecules undergo conformational changes, especially in dynamic binding environments. The electrotopological descriptor ensures consistent similarity assessments across different molecular topologies by incorporating a hyperbolic tangent function-based normalization.²³

Atomic positions mapped from 3D coordinates, encoding properties and spatial relationships in a 2D projection.

Figure 1.

Representation of mapped atomic positions derived from 3D Cartesian coordinates. Each atom’s properties (x_a ,x_b ,x_c , etc.) are mapped onto specific atomic points, encoding key atomic attributes. The angles (ϕa and ψa) represent the radial coordinates of the atoms within the molecular structure, capturing their spatial relationships. The atom a′ is a 2D mapping of the original atom a. 2D, two-dimensional; 3D, three-dimensional.

Binding Site Similarity Analysis

The binding pocket comparison tool uses MD simulations to explore protein binding pockets’ spatial and volumetric properties. As shown in Figure 2 , by introducing water molecules into the MD simulation, this tool maps the dynamic behavior of binding sites,²⁴ including hydrophilic regions often overlooked by static methods. The algorithm iteratively calculates the binding pocket’s volume and hydrophilic characteristics, comprehensively analyzing the binding site’s geometry and flexibility.

Binding pocket analysis via MD simulation, highlighting hydration zones and interactions for drug-binding evaluation.

Figure 2.

The binding pocket comparison tool’s process for analyzing binding sites using molecular dynamics. Initial setup with water molecules in the binding pocket, followed by a 10-ns MD simulation (top left). The network of hydrogen bonds among water molecules within the pocket highlights hydrophilic interactions (center left). A refined water distribution map identifying key hydration zones (center right). Final binding pocket structure, showing hydrophilic and hydrophobic regions essential for evaluating binding affinity and drug interactions (far right).

The binding pocket comparison tool employs MD simulations to map receptors’ active sites comprehensively. By introducing water molecules into the system, the method fully exploits their ability to occupy cavities through cohesive hydrogen-bonding networks. These networks capture essential details about the geometry and topology of the binding site, providing a richer characterization than simple volumetric analyses.

Multiple water networks are generated during all-atom MD simulations of the hydrated receptor, corresponding to different snapshots along the simulation trajectory. It has been proved that 10 ns is sufficient to map the binding site. Averaging the probability of water occupancy across these snapshots allows the tool to integrate geometric, topological, and dynamic information about the binding pocket. This approach characterizes the receptor’s binding site not as a static structure but as a dynamic entity, reflecting its flexibility and interaction potential.

In addition, the tool introduces a novel volume- and topology-based similarity metric for comparing binding pockets across proteins. This innovation enables the identification of structurally similar binding sites that may accommodate existing drugs, offering new opportunities for drug repurposing and therapeutic innovation.

Hydrophobic interactions, often associated with regions favoring the binding of nonpolar molecules, are frequently misunderstood. These interactions are best described as van der Waals forces arising from temporary and induced dipoles. In aqueous environments, where water molecules exhibit a strong dipole moment, these interactions are enhanced due to significant dipole-induced effects.

The preference of nonpolar molecules for hydrophobic sites does not stem solely from direct interaction strength but is primarily driven by the entropic contribution of bulk water molecules. Upon binding, water molecules are displaced from the cavity, resulting in an entropic gain stabilizing the interaction. Mapping binding cavities with water molecules provides a detailed and exhaustive exploration of their geometry and enables the creation of cohesive water networks. These networks can be represented and analyzed as graphs, offering a robust framework to understand and compare binding site properties across different proteins.

Integrating Heterogeneous Knowledge Graphs for Protein Interaction Networks

To capture the complex nature of protein interactions, we constructed a network by integrating data from a wide range of sources, including literature and molecular, genomic, structural, and ontological databases. The following databases were utilized in this framework: MeSH,²⁵ Drugs@FDA,²⁶ RxList,²⁷ DrugBank,²⁸ PubChem,²⁹ MedlinePlus,³⁰ PubMed,³¹ UMLS,³² NCBI,³³ UniProt,³⁴ GO Ontology, GO Gene, GO Annotation,³⁵ Reactome,³⁶ KEGG,³⁷ and RCSB PDB.³⁸ This comprehensive integration allows for the identification of deep correlations among proteins, their structures, and associated pathways. By combining heterogeneous knowledge graphs with tools for molecular similarity and binding pocket comparison, the network integrates various perspectives on protein function and interaction.

The workflow begins with a semantic literature analysis to create a disease-specific knowledge graph encompassing relationships among diseases, genes, drugs, and other biological entities. This foundation is enhanced with molecular similarity data and binding pocket comparison analyses, enabling the refinement of protein interaction networks. For example, edges in the graph are reinforced for proteins with structurally similar active sites, emphasizing their functional relevance, while the overall interaction weights are normalized to maintain network consistency.

This comprehensive approach leverages over 60 million publications from PubMed and data from molecular and genomic repositories, creating a unified framework to explore the intricate interplay of molecular and cellular components associated with a given disease.

We integrate molecular similarity data from the electrotopological descriptor and binding pocket comparison tools alongside gene expression data to refine the protein interaction network. Gene expression levels are obtained from publicly available sources, such as TCGA, to establish baseline activity for genes encoding the proteins of interest. The electrotopological descriptor computes pairwise molecular similarities based on proteins’ 3D atomic topology and geometry, generating a similarity matrix. The molecular similarity data from the electrotopological descriptor are used to strengthen interactions between proteins that share strong molecular interactions with common molecules, applying a similar boosting approach. This iterative process of reinforcement and normalization yields a biologically relevant, pruned protein interaction network. We filter out weaker interactions by applying a threshold to this matrix, focusing on the most significant relationships, which are normalized for consistency.

RESULTS

As a proof of concept, we applied the integrated approach of electrotopological descriptors, a binding pocket comparison tool, and heterogeneous knowledge graph integration to a set of established drug–disease systems. We considered hepatocellular carcinoma and diabetes mellitus, along with their associated drugs, as the established drug–disease systems. This approach allowed us to dynamically map disease-associated protein networks and evaluate the impact of various drug interactions on cellular processes. The electrotopological descriptors enabled precise molecular similarity calculations, while the binding pocket comparison tool provided detailed insights into binding site dynamics. In an initial case study, we focused on a known drug repurposing candidate for cancer treatment. Using the electrotopological descriptors, we identified a structurally similar compound with potential therapeutic effects. It was implemented on a set of clinically approved 153 antineoplastic small molecules retrieved from ChEMBL database³⁹ to perform a similarity-based drug repurposing against tyrosine kinase that plays a pivotal role in the development of hepatocellular carcinoma, renal, and thyroid cancer.⁴⁰ It was reported that drug-like sorafenib actively inhibits the tyrosine kinase receptor.⁴¹ It suppresses tumor growth by inhibiting the RAF/MEK/ERK signaling pathway, which is crucial for cell proliferation and survival. Additionally, it reduces tumor angiogenesis by blocking vascular endothelial growth factor (VEGFR) and platelet-derived growth factor (PDGFR), key receptors involved in the formation and stabilization of blood vessels.⁴² To identify a similar drug to sorafenib from the retrieved molecules, we performed the electrotopological descriptor-based similarity assessment by considering sorafenib as the reference molecule. We used atomistic formal charge and mass properties for the intermolecular distance calculation. The obtained result suggested that regorafenib depicted maximum similarity with sorafenib with a similarity value of 0.97 ( Figure 3 ). The literature source also validated regorafenib as an inhibitor of tyrosine kinase^45,46 and is very similar to sorafenib.⁴⁷

Scatter plot of molecular similarity to sorafenib based on charge and mass, identifying key tyrosine kinase inhibitors.

Figure 3.

Scatter plot illustrating molecular similarity (similarity > 0.60) based on atomic formal charge and atomic mass for compounds compared to sorafenib in an in-vacuum state. Regorafenib exhibited the highest similarity to sorafenib. Notably, capmatinib and alpelisib demonstrated similarity scores exceeding 0.80. The literature evidence further indicates that both drugs are reported inhibitors of tyrosine kinase.^43,44

The binding pocket comparison tool confirmed the compound’s ability to bind to the target protein’s binding site effectively, and the integration of heterogeneous knowledge graphs illustrated the complex interactions between the drug and various proteins within the network. Traditional drug discovery approaches often rely on identifying known target protein inhibitors and searching molecular databases for structurally similar compounds to evaluate their potency against the same target. However, this approach overlooks a significant portion of the chemical space, potentially missing promising molecules with therapeutic potential. The proposed protocol overcomes these limitations by mapping proteins with similar binding pockets and their associated known inhibitors. These inhibitors can then be leveraged to analyze their binding potential and activity against the target of interest. This strategy broadens the exploration of chemical libraries, enabling the identification of novel compounds that may have been overlooked using conventional methods, thereby enhancing the discovery of new therapeutic candidates. Compared to traditional static models such as CASTp,⁴⁸ the integration of protein dynamics-based network analysis provides a more comprehensive understanding of the drug’s effects within the complex network of cellular interactions.

Upon establishing this knowledge graph, a tool for binding pocket comparison is employed to analyze target proteins with structural or functional similarities, focusing on their binding sites to identify interaction patterns. This is complemented by electrotopological descriptors to calculate molecular similarities, facilitating the identification of compounds with analogous therapeutic potential based on their structural attributes. The final graph ( Figure 4 ) integrates gene expression data and structural and functional similarities, offering a more accurate model of protein interactions within cellular systems. This refined network is instrumental in elucidating disease mechanisms, identifying drug targets, and predicting the broader effects of therapeutic interventions.

Network graph linking genes, drugs, and diseases in hepatocellular carcinoma and diabetes treatment.

Figure 4.

The network graph illustrating key molecular and genetic components associated with hepatocellular carcinoma and diabetes mellitus. The central pink node represents the disease, while the cyan circles indicate genes prominently involved in the disease mechanism. Dark purple circles represent drugs used in therapeutic interventions. The thickness of the edges indicates the strength of interactions, highlighting the complex interconnectivity between genes, molecules, and drugs relevant to hepatocellular carcinoma and diabetes mellitus treatment. The mentioned graph can be accessed at https://newroad.biovista.com/#!bv_gid=179e4d5784d7407bd321faa1542120fb

CONCLUSION

This workflow highlights the transformative potential of computational methodologies for drug repurposing, with the development of a framework that integrates molecular similarity, binding site comparison, and semantic analysis of the literature. This integration leverages diverse sources and databases, such as molecular structures, binding site information, and literature-based insights, enabling a unified and comprehensive exploration of drug–disease interactions. This approach enables the systematic and data-driven exploration of drug–disease interactions, offering a significant advancement in computational tools for drug discovery. Beyond its immediate applications, this methodology underscores the evolving landscape of drug repurposing, where the integration of semantic analysis and artificial intelligence (AI) introduces both unprecedented opportunities and complex challenges. One of the most significant advancements is the ability to process and synthesize vast amounts of heterogeneous data, providing previously inaccessible insights. Additionally, this framework facilitates data accuracy and reliability validation by cross-referencing multiple sources and leveraging computational power for deeper analysis. However, this also raises challenges in ensuring such data’s accuracy, relevance, and contextual interpretation, mainly when it originates from unstructured sources like scientific literature. The interaction between semantic search and AI further shifts the role of the researcher. Rather than being a data consumer, the researcher must act as an interpreter and integrator, navigating rapidly expanding possibilities. This requires a deep understanding of the underlying scientific principles and the ability to critically evaluate and validate computational predictions. The need for interdisciplinary expertise becomes apparent, as bridging computational tools with experimental validation is essential to translate predictions into actionable insights.

Integrating diverse datasets and sophisticated algorithms demands significant computational power, raising questions about scalability and accessibility. Optimizing workflows to ensure seamless integration, computational efficiency, and data validation is crucial for overcoming these challenges and enhancing scalability. Finally, this study demonstrates that while computational methods significantly improve the scope and depth of drug repurposing efforts, their full potential is yet to be realized. The ability to uncover novel therapeutic applications hinges on the platform’s robustness and the researcher’s ability to adapt to this new paradigm. The interplay between advanced computational tools and human expertise will ultimately define the success of such integrative approaches, paving the way for more innovative and effective strategies in drug discovery.

DATA AND CODE AVAILABILITY

Data and code availability does not apply.

CONFLICTS OF INTEREST

The author declares no conflict of interest.

REFERENCES

Hopkins AL. Network pharmacology: The next paradigm in drug discovery. Nat Chem Biol. 2008. Vol. 4(11):682–690. [Cross Ref]
Knight ZA, Lin H, Shokat KM. Targeting the cancer kinome through polypharmacology. Nat Rev Cancer. 2010. Vol. 10(2):130–137. [Cross Ref]
Schadt EE. Molecular networks as sensors and drivers of common human diseases. Nature. 2009. Vol. 461(7261):218–223. [Cross Ref]
Barabási A-L, Oltvai ZN. Network biology: Understanding the cell’s functional organization. Nat Rev Genet. 2004. Vol. 5(2):101–113. [Cross Ref]
Vidal M, Cusick ME, Barabási A-L. Interactome networks and human disease. Cell. 2011. Vol. 144(6):986–998. [Cross Ref]
De Las Rivas J, Fontanillo C. Protein-protein interaction networks: Unraveling the wiring of molecular machines within the cell. Brief Funct Genomics. 2012. Vol. 11(6):489–496. [Cross Ref]
Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005. Vol. 4(1):17[Cross Ref]
Ashburn TT, Thor KB. Drug repositioning: Identifying and developing new uses for existing drugs. Nat Rev Drug Discov. 2004. Vol. 3(8):673–683. [Cross Ref]
Pushpakom S, Iorio F, Eyers PA, et al.. Drug repurposing: Progress, challenges and recommendations. Nat Rev Drug Discov. 2019. Vol. 18(1):41–58. [Cross Ref]
Nosengo N. Can you teach old drugs new tricks? Nature. 2016. Vol. 534(7607):314–316. [Cross Ref]
Palumbo A, Facon T, Sonneveld P, et al.. Thalidomide for treatment of multiple myeloma: 10 years later. Blood. 2008. Vol. 111(8):3968–3977. [Cross Ref]
Bowman GR, Pande VS, Noé F. An introduction to markov state models and their application to long timescale molecular simulation. Springer Science & Business Media. 2013. Vol. Vol. 797. [Cross Ref]
Chodera JD, Noé F. Markov state models of biomolecular conformational dynamics. Curr Opin Struct Biol. 2014. Vol. 25:135–144. [Cross Ref]
Piotto S, Sessa L, Santoro J, Di Biasi L. Artificial chemical neural network for drug discovery applicationsSchneider JJ, Weyland MS, Flumini D, Füchslin RM. Artificial life and evolutionary computation: Communications in computer and information science. Cham: Springer. 2022. p. 225–229. [Cross Ref]
Husic BE, Pande VS. Markov state models: From an art to a science. J Am Chem Soc. 2018. Vol. 140(7):2386–2396. [Cross Ref]
Noé F, Fischer S. Transition networks for modeling the kinetics of conformational change in macromolecules. Curr Opin Struct Biol. 2008. Vol. 18(2):154–162. [Cross Ref]
Prinz J-H, Wu H, Sarich M, et al.. Markov models of molecular kinetics: Generation and validation. J Chem Phys. 2011. Vol. 134(17):174105. [Cross Ref]
Piotto S, Nardiello AM, Di Biasi L, Sessa L. Encoding materials dynamics for machine learning applicationsPiotto S, Concilio S, Sessa L, Rossi F. Advances in bionanomaterials II. BIONAM 2019: Lecture Notes in Bioengineering. Cham: Springer. 2020. 128–136. [Cross Ref]
Maldonado AG, Doucet JP, Petitjean M, Fan B-T. Molecular similarity and diversity in chemoinformatics: From theory to applications. Mol Div. 2006. Vol. 10:39–79. [Cross Ref]
Hall LH, Kier LB. Electrotopological state indices for atom types: A novel combination of electronic, topological, and valence state information. J Chem Inform Comput Sci. 1995. Vol. 35(6):1039–1045
Malisi C, Schumann M, Toussaint NC, Kageyama J, Kohlbacher O, Höcker B. Binding pocket optimization by computational protein design. PLoS One. 2012. Vol. 7(12):e52505. [Cross Ref]
Wei B, Zhang Y, Gong X. DeepLPI: A novel deep learning-based model for protein–ligand interaction prediction for drug repurposing. Sci Rep. 2022. Vol. 12(1):18200. [Cross Ref]
Fan S-KS, Jen C-H, Lee T-Y. Modeling and monitoring the nonlinear profile of heat treatment process data by using an approach based on a hyperbolic tangent function. Qual Eng. 2017. Vol. 29(2):226–243. [Cross Ref]
Ciancetta A, Gill AK, Ding T, et al.. Probe confined dynamic mapping for G protein-coupled receptor allosteric site prediction. ACS Cent Sci. 2021. Vol. 7(11):1847–1862. [Cross Ref]
Tsuyuzaki K, Morota G, Ishii M, Nakazato T, Miyazaki S, Nikaido I. MeSH ORA framework: R/Bioconductor packages to support MeSH over-representation analysis. BMC Bioinformatics. 2015. Vol. 16:45[Cross Ref]
Schwartz LM, Woloshin S, Zheng E, Tse T, Zarin DA. ClinicalTrials.gov and Drugs@FDA: A comparison of results reporting for new drug approval trials. Ann Intern Med. 2016. Vol. 165(6):421–430. [Cross Ref]
Smalls D, Akaeme O, Hailemeskel B, Maneno M. Availability of various categories of drug-related information among free drug databases: Survey of first professional year students. Int J Pharma Care Health IJPCH. 2019. Vol. 103(10)[Cross Ref]
Wishart DS, Feunang YD, Guo AC, et al.. DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res. 2018. Vol. 46(D1):D1074–D1082. [Cross Ref]
Kim S, Thiessen PA, Bolton EE, et al.. PubChem substance and compound databases. Nucleic acids Res. 2016. Vol. 44(D1):D1202–D1213. [Cross Ref]
National Library of Medicine. MedlinePlus [Internet]. Bethesda, MD: National Library of Medicine. https://medlineplus.gov
Canese K, Weis S. PubMed: The bibliographic database. The NCBI Handbook. 2013. Vol. 2(1)
Huanying G, Yehoshua P, James G, Michael H, Li-min L, James JC. Representing the UMLS as an object-oriented database: Modeling issues and advantages. J Am Med Inform Assoc. 2000. Vol. 7(1):66–80. [Cross Ref]
Federhen S. The NCBI taxonomy database. Nucleic Acids Res. 2012. Vol. 40(D1):D136–D143. [Cross Ref]
UniProt Consortium. UniProt: A hub for protein information. Nucleic Acids Res. 2015. Vol. 43(D1):D204–D212. [Cross Ref]
Harris MA, Clark J, Ireland A, et al.. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2004. Vol. 32 Suppl 1:D258–D261. [Cross Ref]
Croft D, O’kelly G, Wu G, et al.. Reactome: A database of reactions, pathways and biological processes. Nucleic Acids Res. 2010. Vol. 39 Suppl 1:D691–D697. [Cross Ref]
Kanehisa M. The KEGG databaseBock G, Goode JA. ‘In silico’ simulation of biological processes: Novartis Foundation Symposium 247. Wiley Online Library. 2002. [Cross Ref]
Rose PW, Bi C, Bluhm WF, et al.. The RCSB Protein Data Bank: New resources for research and education. Nucleic Acids Res. 2013. Vol. 41(D1):D475–D482. [Cross Ref]
Gaulton A, Bellis LJ, Bento AP, et al.. ChEMBL: A large-scale bioactivity database for drug discovery. Nucleic Acids Res. 2012. Vol. 40(D1):D1100–D1107. [Cross Ref]
Du Z, Lovly CM. Mechanisms of receptor tyrosine kinase activation in cancer. Mol Cancer. 2018. Vol. 17:58[Cross Ref]
Yunus M, Jansson PJ, Kovacevic Z, Kalinowski DS, Richardson DR. Tumor-induced neoangiogenesis and receptor tyrosine kinases – Mechanisms and strategies for acquired resistance. Biochim Biophys Acta Gen Subj. 2019. Vol. 1863(7):1217–1225. [Cross Ref]
Chen R, Li Q, Xu S, et al.. Modulation of the tumour microenvironment in hepatocellular carcinoma by tyrosine kinase inhibitors: From modulation to combination therapy targeting the microenvironment. Cancer Cell Int. 2022. Vol. 22(1):73[Cross Ref]
Vansteenkiste JF, Van de Kerkhove C, Wauters E, Van Mol P. Capmatinib for the treatment of non-small cell lung cancer. Expert Rev Anticancer Ther. 2019. Vol. 19(8):659–671. [Cross Ref]
Mayer IA, Abramson VG, Formisano L, et al.. A phase Ib study of alpelisib (BYL719), a PI3Kα-specific inhibitor, with letrozole in ER+/HER2− metastatic breast cancer. Clin Cancer Res. 2017. Vol. 23(1):26–34. [Cross Ref]
Crona DJ, Keisler MD, Walko CM. Regorafenib: A novel multitargeted tyrosine kinase inhibitor for colorectal cancer and gastrointestinal stromal tumors. Ann Pharmacother. 2013. Vol. 47(12):1685–1696. [Cross Ref]
Wilhelm SM, Dumas J, Adnane L, et al.. Regorafenib (BAY 73-4506): A new oral multikinase inhibitor of angiogenic, stromal and oncogenic receptor tyrosine kinases with potent preclinical antitumor activity. Int J Cancer. 2011. Vol. 129(1):245–255. [Cross Ref]
Frenette CT. The role of regorafenib in hepatocellular carcinoma. Gastroenterol Hepatol (N Y). 2017. Vol. 13(2):122–124
Tian W, Chen C, Lei X, Zhao J, Liang J. CASTp 3.0: Computed atlas of surface topography of proteins. Nucleic Acids Res. 2018. Vol. 46(W1):W363–W367. [Cross Ref]

Author and article information

Journal

Journal ID (publisher-id): dr

Title: Drug Repurposing

Publisher: ScienceOpen (Berlin )

ISSN (Electronic): 2941-2528

Publication date (Electronic): 13 March 2025

Volume: 2

Issue: 1

Electronic Location Identifier: e20250001

Affiliations

[1 ] SoftMining Srl, Via Giovanni Paolo II 132, 84084 Fisciano, Italy ( https://ror.org/0192m2k53)

[2 ] Department of Pharmacy, University of Salerno, Via Giovanni Paolo II 132, 84084 Fisciano, Italy ( https://ror.org/0192m2k53)

[3 ] BIONAM – Interdepartmental Research Center for Biomaterials, University of Salerno, Via Giovanni Paolo II 132, 84084 Fisciano, Italy;

Author notes

*Correspondence to: Stefano Piotto, SoftMining Srl, Via Giovanni Paolo II 132, 84084 Fisciano, Italy. E-mail: stefano@ 123456softmining.it

Author information

Stefano Piotto https://orcid.org/0000-0002-3102-1918

Lucia Sessa https://orcid.org/0000-0002-5343-2777

Arkadeep Sarkar https://orcid.org/0009-0008-5720-1854

Simona Concilio https://orcid.org/0000-0002-1461-9301

Article

DOI: 10.58647/DRUGREPO.25.1.0001

SO-VID: 4366ead5-6031-4b0f-88f1-1990da258148

License:

This work has been published open access under Creative Commons Attribution License (CC BY) 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Conditions, terms of use and publishing policy can be found at www.scienceopen.com.

History

Date received : 12 November 2024

Date accepted : 14 February 2025

Page count

Figures: 4, References: 48, Pages: 8

Funding

Funded by: European Union, EU4Health Programme (EU4H)

Award ID: 101080024

This work was funded by the project NEWROAD, Grant Agreement No 101080024—co-funded by the European Union, EU4Health Programme (EU4H); and by COST Action CA21169 Information, Coding, and Biological Function: the Dynamics of Life (DYNALIFE).

Comments

Comment on this article

[1] Hopkins AL. Network pharmacology: The next paradigm in drug discovery. Nat Chem Biol. 2008. Vol. 4(11):682–690. [Cross Ref]

[2] Knight ZA, Lin H, Shokat KM. Targeting the cancer kinome through polypharmacology. Nat Rev Cancer. 2010. Vol. 10(2):130–137. [Cross Ref]

[3] Schadt EE. Molecular networks as sensors and drivers of common human diseases. Nature. 2009. Vol. 461(7261):218–223. [Cross Ref]

[4] Barabási A-L, Oltvai ZN. Network biology: Understanding the cell’s functional organization. Nat Rev Genet. 2004. Vol. 5(2):101–113. [Cross Ref]

[5] Vidal M, Cusick ME, Barabási A-L. Interactome networks and human disease. Cell. 2011. Vol. 144(6):986–998. [Cross Ref]

[6] De Las Rivas J, Fontanillo C. Protein-protein interaction networks: Unraveling the wiring of molecular machines within the cell. Brief Funct Genomics. 2012. Vol. 11(6):489–496. [Cross Ref]

[7] Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005. Vol. 4(1):17[Cross Ref]

[8] Ashburn TT, Thor KB. Drug repositioning: Identifying and developing new uses for existing drugs. Nat Rev Drug Discov. 2004. Vol. 3(8):673–683. [Cross Ref]

[9] Pushpakom S, Iorio F, Eyers PA, et al.. Drug repurposing: Progress, challenges and recommendations. Nat Rev Drug Discov. 2019. Vol. 18(1):41–58. [Cross Ref]

[10] Nosengo N. Can you teach old drugs new tricks? Nature. 2016. Vol. 534(7607):314–316. [Cross Ref]

[11] Palumbo A, Facon T, Sonneveld P, et al.. Thalidomide for treatment of multiple myeloma: 10 years later. Blood. 2008. Vol. 111(8):3968–3977. [Cross Ref]

[12] Bowman GR, Pande VS, Noé F. An introduction to markov state models and their application to long timescale molecular simulation. Springer Science & Business Media. 2013. Vol. Vol. 797. [Cross Ref]

[13] Chodera JD, Noé F. Markov state models of biomolecular conformational dynamics. Curr Opin Struct Biol. 2014. Vol. 25:135–144. [Cross Ref]

[14] Piotto S, Sessa L, Santoro J, Di Biasi L. Artificial chemical neural network for drug discovery applicationsSchneider JJ, Weyland MS, Flumini D, Füchslin RM. Artificial life and evolutionary computation: Communications in computer and information science. Cham: Springer. 2022. p. 225–229. [Cross Ref]

[15] Husic BE, Pande VS. Markov state models: From an art to a science. J Am Chem Soc. 2018. Vol. 140(7):2386–2396. [Cross Ref]

[16] Noé F, Fischer S. Transition networks for modeling the kinetics of conformational change in macromolecules. Curr Opin Struct Biol. 2008. Vol. 18(2):154–162. [Cross Ref]

[17] Prinz J-H, Wu H, Sarich M, et al.. Markov models of molecular kinetics: Generation and validation. J Chem Phys. 2011. Vol. 134(17):174105. [Cross Ref]

[18] Piotto S, Nardiello AM, Di Biasi L, Sessa L. Encoding materials dynamics for machine learning applicationsPiotto S, Concilio S, Sessa L, Rossi F. Advances in bionanomaterials II. BIONAM 2019: Lecture Notes in Bioengineering. Cham: Springer. 2020. 128–136. [Cross Ref]

[19] Maldonado AG, Doucet JP, Petitjean M, Fan B-T. Molecular similarity and diversity in chemoinformatics: From theory to applications. Mol Div. 2006. Vol. 10:39–79. [Cross Ref]

[20] Hall LH, Kier LB. Electrotopological state indices for atom types: A novel combination of electronic, topological, and valence state information. J Chem Inform Comput Sci. 1995. Vol. 35(6):1039–1045

[21] Malisi C, Schumann M, Toussaint NC, Kageyama J, Kohlbacher O, Höcker B. Binding pocket optimization by computational protein design. PLoS One. 2012. Vol. 7(12):e52505. [Cross Ref]

[22] Wei B, Zhang Y, Gong X. DeepLPI: A novel deep learning-based model for protein–ligand interaction prediction for drug repurposing. Sci Rep. 2022. Vol. 12(1):18200. [Cross Ref]

[23] Fan S-KS, Jen C-H, Lee T-Y. Modeling and monitoring the nonlinear profile of heat treatment process data by using an approach based on a hyperbolic tangent function. Qual Eng. 2017. Vol. 29(2):226–243. [Cross Ref]

[24] Ciancetta A, Gill AK, Ding T, et al.. Probe confined dynamic mapping for G protein-coupled receptor allosteric site prediction. ACS Cent Sci. 2021. Vol. 7(11):1847–1862. [Cross Ref]

[25] Tsuyuzaki K, Morota G, Ishii M, Nakazato T, Miyazaki S, Nikaido I. MeSH ORA framework: R/Bioconductor packages to support MeSH over-representation analysis. BMC Bioinformatics. 2015. Vol. 16:45[Cross Ref]

[26] Schwartz LM, Woloshin S, Zheng E, Tse T, Zarin DA. ClinicalTrials.gov and Drugs@FDA: A comparison of results reporting for new drug approval trials. Ann Intern Med. 2016. Vol. 165(6):421–430. [Cross Ref]

[27] Smalls D, Akaeme O, Hailemeskel B, Maneno M. Availability of various categories of drug-related information among free drug databases: Survey of first professional year students. Int J Pharma Care Health IJPCH. 2019. Vol. 103(10)[Cross Ref]

[28] Wishart DS, Feunang YD, Guo AC, et al.. DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res. 2018. Vol. 46(D1):D1074–D1082. [Cross Ref]

[29] Kim S, Thiessen PA, Bolton EE, et al.. PubChem substance and compound databases. Nucleic acids Res. 2016. Vol. 44(D1):D1202–D1213. [Cross Ref]

[30] National Library of Medicine. MedlinePlus [Internet]. Bethesda, MD: National Library of Medicine. https://medlineplus.gov

[31] Canese K, Weis S. PubMed: The bibliographic database. The NCBI Handbook. 2013. Vol. 2(1)

[32] Huanying G, Yehoshua P, James G, Michael H, Li-min L, James JC. Representing the UMLS as an object-oriented database: Modeling issues and advantages. J Am Med Inform Assoc. 2000. Vol. 7(1):66–80. [Cross Ref]

[33] Federhen S. The NCBI taxonomy database. Nucleic Acids Res. 2012. Vol. 40(D1):D136–D143. [Cross Ref]

[34] UniProt Consortium. UniProt: A hub for protein information. Nucleic Acids Res. 2015. Vol. 43(D1):D204–D212. [Cross Ref]

[35] Harris MA, Clark J, Ireland A, et al.. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2004. Vol. 32 Suppl 1:D258–D261. [Cross Ref]

[36] Croft D, O’kelly G, Wu G, et al.. Reactome: A database of reactions, pathways and biological processes. Nucleic Acids Res. 2010. Vol. 39 Suppl 1:D691–D697. [Cross Ref]

[37] Kanehisa M. The KEGG databaseBock G, Goode JA. ‘In silico’ simulation of biological processes: Novartis Foundation Symposium 247. Wiley Online Library. 2002. [Cross Ref]

[38] Rose PW, Bi C, Bluhm WF, et al.. The RCSB Protein Data Bank: New resources for research and education. Nucleic Acids Res. 2013. Vol. 41(D1):D475–D482. [Cross Ref]

[39] Gaulton A, Bellis LJ, Bento AP, et al.. ChEMBL: A large-scale bioactivity database for drug discovery. Nucleic Acids Res. 2012. Vol. 40(D1):D1100–D1107. [Cross Ref]

[40] Du Z, Lovly CM. Mechanisms of receptor tyrosine kinase activation in cancer. Mol Cancer. 2018. Vol. 17:58[Cross Ref]

[41] Yunus M, Jansson PJ, Kovacevic Z, Kalinowski DS, Richardson DR. Tumor-induced neoangiogenesis and receptor tyrosine kinases – Mechanisms and strategies for acquired resistance. Biochim Biophys Acta Gen Subj. 2019. Vol. 1863(7):1217–1225. [Cross Ref]

[42] Chen R, Li Q, Xu S, et al.. Modulation of the tumour microenvironment in hepatocellular carcinoma by tyrosine kinase inhibitors: From modulation to combination therapy targeting the microenvironment. Cancer Cell Int. 2022. Vol. 22(1):73[Cross Ref]

[43] Vansteenkiste JF, Van de Kerkhove C, Wauters E, Van Mol P. Capmatinib for the treatment of non-small cell lung cancer. Expert Rev Anticancer Ther. 2019. Vol. 19(8):659–671. [Cross Ref]

[44] Mayer IA, Abramson VG, Formisano L, et al.. A phase Ib study of alpelisib (BYL719), a PI3Kα-specific inhibitor, with letrozole in ER+/HER2− metastatic breast cancer. Clin Cancer Res. 2017. Vol. 23(1):26–34. [Cross Ref]

[45] Crona DJ, Keisler MD, Walko CM. Regorafenib: A novel multitargeted tyrosine kinase inhibitor for colorectal cancer and gastrointestinal stromal tumors. Ann Pharmacother. 2013. Vol. 47(12):1685–1696. [Cross Ref]

[46] Wilhelm SM, Dumas J, Adnane L, et al.. Regorafenib (BAY 73-4506): A new oral multikinase inhibitor of angiogenic, stromal and oncogenic receptor tyrosine kinases with potent preclinical antitumor activity. Int J Cancer. 2011. Vol. 129(1):245–255. [Cross Ref]

[47] Frenette CT. The role of regorafenib in hepatocellular carcinoma. Gastroenterol Hepatol (N Y). 2017. Vol. 13(2):122–124

[48] Tian W, Chen C, Lei X, Zhao J, Liang J. CASTp 3.0: Computed atlas of surface topography of proteins. Nucleic Acids Res. 2018. Vol. 46(W1):W363–W367. [Cross Ref]

Submit your manuscript to the new open access journal Drug Repurposing. Open for research articles, reviews, discussions, case studies, negative results across the whole spectrum of drug repurposing.

No article processing charges.

Drug Repurposing

Advancing Drug Discovery through Integrative Computational Models and AI Technologies

Abstract

Main article text

INTRODUCTION

METHODS

Molecular Similarity Calculations

Binding Site Similarity Analysis

Integrating Heterogeneous Knowledge Graphs for Protein Interaction Networks

RESULTS

CONCLUSION

DATA AND CODE AVAILABILITY

CONFLICTS OF INTEREST

REFERENCES

Author and article information

Journal

Affiliations

Author notes

Author information

Article

History

Page count

Funding

Categories

Comments

Comment on this article