Applications of machine learning in metabolomics: Disease modeling and classification

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Metabolomics research has recently gained popularity because it enables the study of biological traits at the biochemical level and, as a result, can directly reveal what occurs in a cell or a tissue based on health or disease status, complementing other omics such as genomics and transcriptomics. Like other high-throughput biological experiments, metabolomics produces vast volumes of complex data. The application of machine learning (ML) to analyze data, recognize patterns, and build models is expanding across multiple fields. In the same way, ML methods are utilized for the classification, regression, or clustering of highly complex metabolomic data. This review discusses how disease modeling and diagnosis can be enhanced via deep and comprehensive metabolomic profiling using ML. We discuss the general layout of a metabolic workflow and the fundamental ML techniques used to analyze metabolomic data, including support vector machines (SVM), decision trees, random forests (RF), neural networks (NN), and deep learning (DL). Finally, we present the advantages and disadvantages of various ML methods and provide suggestions for different metabolic data analysis scenarios.

Related collections

Most cited references 137

Record: found
Abstract: found
Article: not found

Deep learning.

Yann LeCun, Yoshua Bengio, Geoffrey E Hinton (2015)

Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.

0 comments Cited 10097 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Support-vector networks

Corinna Cortes, Vladimir Vapnik (1995)

0 comments Cited 3388 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Multilayer feedforward networks are universal approximators

Kurt Hornik, Maxwell Stinchcombe, Halbert L. White (1989)

0 comments Cited 2746 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Aya Galal: URI : https://loop.frontiersin.org/people/2002768/overview

Ahmed Moustafa: URI : https://loop.frontiersin.org/people/399317/overview

Journal

Journal ID (nlm-ta): Front Genet

Journal ID (iso-abbrev): Front Genet

Journal ID (publisher-id): Front. Genet.

Title: Frontiers in Genetics

Publisher: Frontiers Media S.A.

ISSN (Electronic): 1664-8021

Publication date (Electronic): 24 November 2022

Publication date Collection: 2022

Volume: 13

Electronic Location Identifier: 1017340

Affiliations

[1] ¹ Systems Genomics Laboratory , American University in Cairo , New Cairo, Egypt

[2] ² Institute of Global Health and Human Ecology , American University in Cairo , New Cairo, Egypt

[3] ³ Biotechnology Graduate Program , American University in Cairo , New Cairo, Egypt

[4] ⁴ Department of Biology , American University in Cairo , New Cairo, Egypt

Author notes

Edited by: Mehdi Pirooznia, Johnson & Johnson, United States

Reviewed by: Marco Vanoni, University of Milano-Bicocca, Italy

Jagadheshwar Balan, Mayo Clinic, United States

*Correspondence: Ahmed Moustafa, amoustafa@ 123456aucegypt.edu

This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics

[ † ]

These authors have contributed equally to this work

Article

Publisher ID: 1017340

DOI: 10.3389/fgene.2022.1017340

PMC ID: 9730048

PubMed ID: 36506316

SO-VID: 2474554c-1421-464f-8b28-923d5aab8ad4

License:

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

History

Date received : 11 August 2022

Date accepted : 07 November 2022

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

Cited by 30

See all cited by

Most referenced authors 3,382

See all reference authors

Applications of machine learning in metabolomics: Disease modeling and classification

Read this article at

Abstract

Related collections

Annual Reviews AI, Machine Learning, and Society

Most cited references 137

Deep learning.

Support-vector networks

Multilayer feedforward networks are universal approximators

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Categories

Comments

Comment on this article

Similar content 696

Cited by 30

Most referenced authors 3,382