An analog-AI chip for energy-efficient speech recognition and transcription

Ambrogio, S; Narayanan, P.; Okazaki, A; Fasoli, A; Mackin, C.; Hosokawa, K; Nomura, A.; Yasuda, T; Chen, A.; Friz, A.; Ishii, M; Luquin, J.; Kohda, Y; Saulnier, N; Brew, K; Choi, S.; �ok, I.; Philip, T; Chan, V.; Silvestre, C.; Ahsan, I.; Narayanan, V.; Tsai, H.; Burr, G. W.

doi:10.1038/s41586-023-06337-5

ScienceOpen: research and publishing network

For Publishers

For Researchers

Blog
About

Search
Advanced search

views

recommends

Record: found
Abstract: found
Article: found

Is Open Access

An analog-AI chip for energy-efficient speech recognition and transcription

research-article

Author(s): S. Ambrogio ¹ ^, , P. Narayanan ¹ , A. Okazaki ² , A. Fasoli ¹ , C. Mackin ¹ , K. Hosokawa ² , A. Nomura ² , T. Yasuda ² , A. Chen ¹ , A. Friz ¹ , M. Ishii ² , J. Luquin ¹ , Y. Kohda ² , N. Saulnier ³ , K. Brew ³ , S. Choi ³ , I. Ok ³ , T. Philip ³ , V. Chan ³ , C. Silvestre ³ , I. Ahsan ³ , V. Narayanan ⁴ , H. Tsai ¹ , G. W. Burr ¹

Publication date (Electronic): 23 August 2023

Journal: Nature

Publisher: Nature Publishing Group UK

Keywords: Electrical and electronic engineering, Electronic devices, Information technology, Computational science

Read this article at

ScienceOpen Publisher PMC

Bookmark

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Models of artificial intelligence (AI) that have billions of parameters can achieve high accuracy across a range of tasks ^{1,
2}, but they exacerbate the poor energy efficiency of conventional general-purpose processors, such as graphics processing units or central processing units. Analog in-memory computing (analog-AI) ^{3–
7} can provide better energy efficiency by performing matrix–vector multiplications in parallel on ‘memory tiles’. However, analog-AI has yet to demonstrate software-equivalent (SW _eq) accuracy on models that require many such tiles and efficient communication of neural-network activations between the tiles. Here we present an analog-AI chip that combines 35 million phase-change memory devices across 34 tiles, massively parallel inter-tile communication and analog, low-power peripheral circuitry that can achieve up to 12.4 tera-operations per second per watt (TOPS/W) chip-sustained performance. We demonstrate fully end-to-end SW _eq accuracy for a small keyword-spotting network and near-SW _eq accuracy on the much larger MLPerf ⁸ recurrent neural-network transducer (RNNT), with more than 45 million weights mapped onto more than 140 million phase-change memory devices across five chips.

Abstract

A low-power chip that runs AI models using analog rather than digital computation shows comparable accuracy on speech-recognition tasks but is more than 14 times as energy efficient.

Related collections

Most cited references 34

Record: found
Abstract: found
Article: not found

Deep learning.

Yann LeCun, Yoshua Bengio, Geoffrey E Hinton (2015)

Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.

0 comments Cited 10054 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Fully hardware-implemented memristor convolutional neural network

Peng Yao, Huaqiang Wu, Bin Gao … (2020)

0 comments Cited 428 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Conference Proceedings: not found

Speech recognition with deep recurrent neural networks

Alex Graves, Abdel-rahman Mohamed, Geoffrey E Hinton (2013)

0 comments Cited 326 times – based on 0 reviews

Bookmark

All references

Author and article information

Contributors

S. Ambrogio:

ORCID: http://orcid.org/0000-0002-5475-4209

stefano.ambrogio@ibm.com

Journal

Journal ID (nlm-ta): Nature

Journal ID (iso-abbrev): Nature

Title: Nature

Publisher: Nature Publishing Group UK (London )

ISSN (Print): 0028-0836

ISSN (Electronic): 1476-4687

Publication date (Electronic): 23 August 2023

Publication date PMC-release: 23 August 2023

Publication date (Print): 2023

Volume: 620

Issue: 7975

Pages: 768-775

Affiliations

[1 ]GRID grid.481551.c, IBM Research – Almaden, ; San Jose, CA USA

[2 ]GRID grid.420126.3, IBM Research – Tokyo, ; Kawasaki, Japan

[3 ]IBM Research – Albany NanoTech Center, Albany, NY USA

[4 ]GRID grid.481554.9, ISNI 0000 0001 2111 841X, IBM Thomas J. Watson Research Center, ; Yorktown Heights, NY USA

Author information

S. Ambrogio http://orcid.org/0000-0002-5475-4209

A. Okazaki http://orcid.org/0000-0002-5275-5224

A. Fasoli http://orcid.org/0000-0001-6892-5139

C. Mackin http://orcid.org/0000-0001-8413-5583

A. Nomura http://orcid.org/0000-0003-2354-867X

M. Ishii http://orcid.org/0000-0003-0794-7232

K. Brew http://orcid.org/0000-0002-2515-2882

G. W. Burr http://orcid.org/0000-0001-5717-2549

Article

Publisher ID: 6337

DOI: 10.1038/s41586-023-06337-5

PMC ID: 10447234

PubMed ID: 37612392

SO-VID: a2c91523-d893-408b-945b-db4d655b6671

License:

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

History

Date received : 13 December 2022

Date accepted : 16 June 2023

Custom metadata

ScienceOpen disciplines: Uncategorized

Keywords: electrical and electronic engineering,electronic devices,information technology,computational science

Data availability:

ScienceOpen disciplines: Uncategorized

Keywords: electrical and electronic engineering, electronic devices, information technology, computational science

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

An analog-AI chip for energy-efficient speech recognition and transcription

Read this article at

Abstract

Abstract

Related collections

Journal of Applied Computing and Information Technology

Most cited references 34

Deep learning.

Fully hardware-implemented memristor convolutional neural network

Speech recognition with deep recurrent neural networks

Author and article information

Contributors

Journal

Affiliations

Author information

Article

History

Categories

Custom metadata

Comments

Comment on this article

Similar content 10

Cited by 26

Most referenced authors 335