An Efficient Deep Learning Approach for DNA-Binding Proteins Classification from Primary Sequences

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

As the number of identified proteins has expanded, the accurate identification of proteins has become a significant challenge in the field of biology. Various computational methods, such as Support Vector Machine (SVM), K-nearest neighbors (KNN), and convolutional neural network (CNN), have been proposed to recognize deoxyribonucleic acid (DNA)-binding proteins solely based on amino acid sequences. However, these methods do not consider the contextual information within amino acid sequences, limiting their ability to adequately capture sequence features. In this study, we propose a novel approach to identify DNA-binding proteins by integrating a CNN with bidirectional long-short-term memory (LSTM) and gated recurrent unit (GRU) as (CNN-BiLG). The CNN-BiLG model can explore the potential contextual relationships of amino acid sequences and obtain more features than traditional models. Our experimental results demonstrate a validation set prediction accuracy of 94% for the proposed CNN-BiLG, surpassing the accuracy of machine learning models and deep learning models. Furthermore, our model is both effective and efficient, exhibiting commendable classification accuracy based on comparative analysis.

Related collections

Most cited references 65

Record: found
Abstract: found
Article: not found

Long Short-Term Memory

Jürgen Schmidhuber, Jürgen Schmidhuber (2003)

Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.

0 comments Cited 7826 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Bidirectional recurrent neural networks

M. Schuster, K.K. Paliwal (1997)

0 comments Cited 804 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Conference Proceedings: not found

CNN architectures for large-scale audio classification

Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis … (2017)

0 comments Cited 198 times – based on 0 reviews

Bookmark

All references

Author and article information

Contributors

Khadija Mohammed Al-Aidarous: (View ORCID Profile)

Journal

Title: International Journal of Computational Intelligence Systems

Abbreviated Title: Int J Comput Intell Syst

Publisher: Springer Science and Business Media LLC

ISSN (Electronic): 1875-6883

Publication date Created: December 2024

Publication date (Electronic): April 11 2024

Volume: 17

Issue: 1

Article

DOI: 10.1007/s44196-024-00462-3

SO-VID: 47faf659-b0f4-4fbe-b222-00404d96219f

License:

https://creativecommons.org/licenses/by/4.0

History

Data availability:

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.