Inviting an author to review:
Find an author and click ‘Invite to review selected article’ near their name.
Search for authorsSearch for similar articles
13
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      An analog-AI chip for energy-efficient speech recognition and transcription

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Models of artificial intelligence (AI) that have billions of parameters can achieve high accuracy across a range of tasks 1, 2 , but they exacerbate the poor energy efficiency of conventional general-purpose processors, such as graphics processing units or central processing units. Analog in-memory computing (analog-AI) 37 can provide better energy efficiency by performing matrix–vector multiplications in parallel on ‘memory tiles’. However, analog-AI has yet to demonstrate software-equivalent (SW eq) accuracy on models that require many such tiles and efficient communication of neural-network activations between the tiles. Here we present an analog-AI chip that combines 35 million phase-change memory devices across 34 tiles, massively parallel inter-tile communication and analog, low-power peripheral circuitry that can achieve up to 12.4 tera-operations per second per watt (TOPS/W) chip-sustained performance. We demonstrate fully end-to-end SW eq accuracy for a small keyword-spotting network and near-SW eq accuracy on the much larger MLPerf 8 recurrent neural-network transducer (RNNT), with more than 45 million weights mapped onto more than 140 million phase-change memory devices across five chips.

          Abstract

          A low-power chip that runs AI models using analog rather than digital computation shows comparable accuracy on speech-recognition tasks but is more than 14 times as energy efficient.

          Related collections

          Most cited references34

          • Record: found
          • Abstract: found
          • Article: not found

          Deep learning.

          Deep learning allows computational models that are composed of multiple processing layers to learn representations of data with multiple levels of abstraction. These methods have dramatically improved the state-of-the-art in speech recognition, visual object recognition, object detection and many other domains such as drug discovery and genomics. Deep learning discovers intricate structure in large data sets by using the backpropagation algorithm to indicate how a machine should change its internal parameters that are used to compute the representation in each layer from the representation in the previous layer. Deep convolutional nets have brought about breakthroughs in processing images, video, speech and audio, whereas recurrent nets have shone light on sequential data such as text and speech.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Fully hardware-implemented memristor convolutional neural network

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Speech recognition with deep recurrent neural networks

                Bookmark

                Author and article information

                Contributors
                stefano.ambrogio@ibm.com
                Journal
                Nature
                Nature
                Nature
                Nature Publishing Group UK (London )
                0028-0836
                1476-4687
                23 August 2023
                23 August 2023
                2023
                : 620
                : 7975
                : 768-775
                Affiliations
                [1 ]GRID grid.481551.c, IBM Research – Almaden, ; San Jose, CA USA
                [2 ]GRID grid.420126.3, IBM Research – Tokyo, ; Kawasaki, Japan
                [3 ]IBM Research – Albany NanoTech Center, Albany, NY USA
                [4 ]GRID grid.481554.9, ISNI 0000 0001 2111 841X, IBM Thomas J. Watson Research Center, ; Yorktown Heights, NY USA
                Author information
                http://orcid.org/0000-0002-5475-4209
                http://orcid.org/0000-0002-5275-5224
                http://orcid.org/0000-0001-6892-5139
                http://orcid.org/0000-0001-8413-5583
                http://orcid.org/0000-0003-2354-867X
                http://orcid.org/0000-0003-0794-7232
                http://orcid.org/0000-0002-2515-2882
                http://orcid.org/0000-0001-5717-2549
                Article
                6337
                10.1038/s41586-023-06337-5
                10447234
                37612392
                a2c91523-d893-408b-945b-db4d655b6671
                © The Author(s) 2023

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 13 December 2022
                : 16 June 2023
                Categories
                Article
                Custom metadata
                © Springer Nature Limited 2023

                Uncategorized
                electrical and electronic engineering,electronic devices,information technology,computational science

                Comments

                Comment on this article

                scite_
                0
                0
                0
                0
                Smart Citations
                0
                0
                0
                0
                Citing PublicationsSupportingMentioningContrasting
                View Citations

                See how this article has been cited at scite.ai

                scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

                Similar content10

                Cited by26

                Most referenced authors335