ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

0

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: not found

How Can We Know What Language Models Know?

Author(s): Zhengbao Jiang ¹ , Frank F. Xu ¹ , Jun Araki ² , Graham Neubig ¹

Publication date Created: December 2020

Publication date (Print): December 2020

Journal: Transactions of the Association for Computational Linguistics

Publisher: MIT Press

Read this article at

ScienceOpenPublisher

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Recent work has presented intriguing results examining the knowledge contained in language models (LMs) by having the LM fill in the blanks of prompts such as “ Obama is a __ by profession”. These prompts are usually manually created, and quite possibly sub-optimal; another prompt such as “ Obama worked as a __ ” may result in more accurately predicting the correct profession. Because of this, given an inappropriate prompt, we might fail to retrieve facts that the LM does know, and thus any given prompt only provides a lower bound estimate of the knowledge contained in an LM. In this paper, we attempt to more accurately estimate the knowledge contained in LMs by automatically discovering better prompts to use in this querying process. Specifically, we propose mining-based and paraphrasing-based methods to automatically generate high-quality and diverse prompts, as well as ensemble methods to combine answers from different prompts. Extensive experiments on the LAMA benchmark for extracting relational knowledge from LMs demonstrate that our methods can improve accuracy from 31.1% to 39.6%, providing a tighter lower bound on what LMs know. We have released the code and the resulting LM Prompt And Query Archive (LPAQA) at https://github.com/jzbjyb/LPAQA .

Related collections

Most cited references 44

Record: found
Abstract: not found
Article: not found

Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies

Tal Linzen, Emmanuel Dupoux, Yoav Goldberg (2016)

0 comments Cited 100 times – based on 0 reviews      Review now

Record: found
Abstract: not found
Conference Proceedings: not found

Context dependent recurrent neural network language model

Tomas Mikolov, Geoffrey Zweig (2012)

0 comments Cited 56 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

Language Models as Knowledge Bases?

Fabio Petroni, Tim Rocktäschel, Sebastian Riedel … (2019)

0 comments Cited 42 times – based on 0 reviews

Author and article information

Journal

Title: Transactions of the Association for Computational Linguistics

Abbreviated Title: Transactions of the Association for Computational Linguistics

Publisher: MIT Press

ISSN (Electronic): 2307-387X

Publication date Created: December 2020

Publication date (Print): December 2020

Volume: 8

Pages: 423-438

Affiliations

[1 ]Language Technologies Institute, Carnegie Mellon University.

[2 ]Bosch Research North America.

Article

DOI: 10.1162/tacl_a_00324

SO-VID: 19dbfb37-c522-4f0e-bccf-6e413d7ac927

Copyright © © 2020

History

Data availability:

Comments

Comment on this article

scite_

Similar content 167

See all similar

Cited by 24

See all cited by

Most referenced authors 166

See all reference authors