Representation Learning: A Review and New Perspectives

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

The success of machine learning algorithms generally depends on data representation, and we hypothesize that this is because different representations can entangle and hide more or less the different explanatory factors of variation behind the data. Although specific domain knowledge can be used to help design representations, learning with generic priors can also be used, and the quest for AI is motivating the design of more powerful representation-learning algorithms implementing such priors. This paper reviews recent work in the area of unsupervised feature learning and deep learning, covering advances in probabilistic models, autoencoders, manifold learning, and deep networks. This motivates longer term unanswered questions about the appropriate objectives for learning good representations, for computing representations (i.e., inference), and the geometrical connections between representation learning, density estimation, and manifold learning.

Related collections

Most cited references 198

Record: found
Abstract: not found
Article: not found

Gradient-based learning applied to document recognition

Y Lecun, L. Bottou, Y Bengio … (1998)

0 comments Cited 3745 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Reducing the dimensionality of data with neural networks.

G E Hinton, R R Salakhutdinov (2006)

High-dimensional data can be converted to low-dimensional codes by training a multilayer neural network with a small central layer to reconstruct high-dimensional input vectors. Gradient descent can be used for fine-tuning the weights in such "autoencoder" networks, but this works well only if the initial weights are close to a good solution. We describe an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data.

0 comments Cited 1635 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

A fast learning algorithm for deep belief nets.

Geoffrey E. Hinton, Simon Osindero, Yee-Whye Teh (2006)

We show how to use "complementary priors" to eliminate the explaining-away effects that make inference difficult in densely connected belief nets that have many hidden layers. Using complementary priors, we derive a fast, greedy algorithm that can learn deep, directed belief networks one layer at a time, provided the top two layers form an undirected associative memory. The fast, greedy algorithm is used to initialize a slower learning procedure that fine-tunes the weights using a contrastive version of the wake-sleep algorithm. After fine-tuning, a network with three hidden layers forms a very good generative model of the joint distribution of handwritten digit images and their labels. This generative model gives better digit classification than the best discriminative learning algorithms. The low-dimensional manifolds on which the digits lie are modeled by long ravines in the free-energy landscape of the top-level associative memory, and it is easy to explore these ravines by using the directed connections to display what the associative memory has in mind.

0 comments Cited 1179 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Title: IEEE Transactions on Pattern Analysis and Machine Intelligence

Abbreviated Title: IEEE Trans. Pattern Anal. Mach. Intell.

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

ISSN (Print): 0162-8828

ISSN (Electronic): 2160-9292

Publication date Created: August 2013

Publication date (Print): August 2013

Volume: 35

Issue: 8

Pages: 1798-1828

Article

DOI: 10.1109/TPAMI.2013.50

PubMed ID: 23787338

SO-VID: 73161184-2df8-4f89-b340-cf3ffbffc9e8

License:

https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html

History

Data availability:

Comments

Comment on this article

scite_

10,868

6,286

Smart Citations

10,868

6,286

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

Representation Learning: A Review and New Perspectives

Read this article at

Abstract

Related collections

Annual Reviews AI, Machine Learning, and Society

Most cited references 198

Gradient-based learning applied to document recognition

Reducing the dimensionality of data with neural networks.

A fast learning algorithm for deep belief nets.

Author and article information

Journal

Article

History

Comments

Comment on this article

Similar content 3,406

Cited by 2,077

Most referenced authors 1,214