Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Deep learning (DL), a branch of machine learning (ML) and artificial intelligence (AI) is nowadays considered as a core technology of today’s Fourth Industrial Revolution (4IR or Industry 4.0). Due to its learning capabilities from data, DL technology originated from artificial neural network (ANN), has become a hot topic in the context of computing, and is widely applied in various application areas like healthcare, visual recognition, text analytics, cybersecurity, and many more. However, building an appropriate DL model is a challenging task, due to the dynamic nature and variations in real-world problems and data. Moreover, the lack of core understanding turns DL methods into black-box machines that hamper development at the standard level. This article presents a structured and comprehensive view on DL techniques including a taxonomy considering various types of real-world tasks like supervised or unsupervised. In our taxonomy, we take into account deep networks for supervised or discriminative learning, unsupervised or generative learning as well as hybrid learning and relevant others. We also summarize real-world application areas where deep learning techniques can be used. Finally, we point out ten potential aspects for future generation DL modeling with research directions. Overall, this article aims to draw a big picture on DL modeling that can be used as a reference guide for both academia and industry professionals.

Related collections

Most cited references 79

Record: found
Abstract: found
Article: not found

Deep learning.

Yann LeCun, Yoshua Bengio, Geoffrey E Hinton (2015)

Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.

0 comments Cited 9112 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Long Short-Term Memory

Jürgen Schmidhuber, Jürgen Schmidhuber (2002)

Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.

0 comments Cited 6906 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Gradient-based learning applied to document recognition

Y Lecun, L. Bottou, Y Bengio … (1998)

0 comments Cited 3492 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Iqbal H. Sarker:

ORCID: http://orcid.org/0000-0003-1740-5517

msarker@swin.edu.au

Journal

Journal ID (nlm-ta): SN Comput Sci

Journal ID (iso-abbrev): SN Comput Sci

Title: Sn Computer Science

Publisher: Springer Singapore (Singapore )

ISSN (Print): 2662-995X

ISSN (Electronic): 2661-8907

Publication date (Electronic): 18 August 2021

Publication date (Print): 2021

Volume: 2

Issue: 6

Electronic Location Identifier: 420

Affiliations

[1 ]GRID grid.1027.4, ISNI 0000 0004 0409 2862, Swinburne University of Technology, ; Melbourne, VIC 3122 Australia

[2 ]GRID grid.442957.9, Chittagong University of Engineering & Technology, ; Chittagong, 4349 Bangladesh

Author information

Iqbal H. Sarker http://orcid.org/0000-0003-1740-5517

Article

Publisher ID: 815

DOI: 10.1007/s42979-021-00815-1

PMC ID: 8372231

PubMed ID: 34426802

SO-VID: 9539bbe5-ffac-497e-82b0-8800d12fe2bb

License:

This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.

History

Date received : 29 May 2021

Date accepted : 7 August 2021

Custom metadata

Keywords: deep learning,artificial neural network,artificial intelligence,discriminative learning,generative learning,hybrid learning,intelligent systems

Data availability:

Keywords: deep learning, artificial neural network, artificial intelligence, discriminative learning, generative learning, hybrid learning, intelligent systems

Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions

Read this article at

Abstract

Related collections

Computer Vision, Deep Learning, Deep Reinforcement Learning, IoT

Most cited references 79

Deep learning.

Long Short-Term Memory

Gradient-based learning applied to document recognition

Author and article information

Contributors

Journal

Affiliations

Author information

Article

History

Categories

Custom metadata

Comments

Comment on this article

Similar content 105

Cited by 176