ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08910
  4. Cited By
How Much Knowledge Can You Pack Into the Parameters of a Language Model?

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

10 February 2020
Adam Roberts
Colin Raffel
Noam M. Shazeer
    KELM
ArXivPDFHTML

Papers citing "How Much Knowledge Can You Pack Into the Parameters of a Language Model?"

50 / 217 papers shown
Title
Mind's Eye: Grounded Language Model Reasoning through Simulation
Mind's Eye: Grounded Language Model Reasoning through Simulation
Ruibo Liu
Jason W. Wei
S. Gu
Te-Yen Wu
Soroush Vosoughi
Claire Cui
Denny Zhou
Andrew M. Dai
ReLM
LRM
118
79
0
11 Oct 2022
Understanding and Improving Zero-shot Multi-hop Reasoning in Generative
  Question Answering
Understanding and Improving Zero-shot Multi-hop Reasoning in Generative Question Answering
Zhengbao Jiang
Jun Araki
Haibo Ding
Graham Neubig
LRM
35
10
0
09 Oct 2022
Measuring and Narrowing the Compositionality Gap in Language Models
Measuring and Narrowing the Compositionality Gap in Language Models
Ofir Press
Muru Zhang
Sewon Min
Ludwig Schmidt
Noah A. Smith
M. Lewis
ReLM
KELM
LRM
52
561
0
07 Oct 2022
Calibrating Factual Knowledge in Pretrained Language Models
Calibrating Factual Knowledge in Pretrained Language Models
Qingxiu Dong
Damai Dai
Yifan Song
Jingjing Xu
Zhifang Sui
Lei Li
KELM
238
82
0
07 Oct 2022
A Unified Encoder-Decoder Framework with Entity Memory
A Unified Encoder-Decoder Framework with Entity Memory
Zhihan Zhang
W. Yu
Chenguang Zhu
Meng Jiang
39
11
0
07 Oct 2022
GLM-130B: An Open Bilingual Pre-trained Model
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
253
1,073
0
05 Oct 2022
Generate rather than Retrieve: Large Language Models are Strong Context
  Generators
Generate rather than Retrieve: Large Language Models are Strong Context Generators
W. Yu
Dan Iter
Shuohang Wang
Yichong Xu
Mingxuan Ju
Soumya Sanyal
Chenguang Zhu
Michael Zeng
Meng Jiang
RALM
AIMat
235
322
0
21 Sep 2022
Metadata Archaeology: Unearthing Data Subsets by Leveraging Training
  Dynamics
Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics
Shoaib Ahmed Siddiqui
Nitarshan Rajkumar
Tegan Maharaj
David M. Krueger
Sara Hooker
44
27
0
20 Sep 2022
VIPHY: Probing "Visible" Physical Commonsense Knowledge
VIPHY: Probing "Visible" Physical Commonsense Knowledge
Shikhar Singh
Ehsan Qasemi
Muhao Chen
46
6
0
15 Sep 2022
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language
  Models for Sinhala Text Classification
BERTifying Sinhala -- A Comprehensive Analysis of Pre-trained Language Models for Sinhala Text Classification
Vinura Dhananjaya
Piyumal Demotte
Surangika Ranathunga
Sanath Jayasena
27
14
0
16 Aug 2022
RealTime QA: What's the Answer Right Now?
RealTime QA: What's the Answer Right Now?
Jungo Kasai
Keisuke Sakaguchi
Yoichi Takahashi
Ronan Le Bras
Akari Asai
Xinyan Velocity Yu
Dragomir R. Radev
Noah A. Smith
Yejin Choi
Kentaro Inui
KELM
45
167
0
27 Jul 2022
Rationale-Augmented Ensembles in Language Models
Rationale-Augmented Ensembles in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Denny Zhou
ReLM
LRM
35
124
0
02 Jul 2022
OPERA: Harmonizing Task-Oriented Dialogs and Information Seeking
  Experience
OPERA: Harmonizing Task-Oriented Dialogs and Information Seeking Experience
Miaoran Li
Baolin Peng
Jianfeng Gao
Zhu Zhang
74
9
0
24 Jun 2022
Memory-Based Model Editing at Scale
Memory-Based Model Editing at Scale
E. Mitchell
Charles Lin
Antoine Bosselut
Christopher D. Manning
Chelsea Finn
KELM
35
322
0
13 Jun 2022
QAMPARI: An Open-domain Question Answering Benchmark for Questions with
  Many Answers from Multiple Paragraphs
QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
S. Amouyal
Tomer Wolfson
Ohad Rubin
Ori Yoran
Jonathan Herzig
Jonathan Berant
RALM
VLM
30
21
0
25 May 2022
TALM: Tool Augmented Language Models
TALM: Tool Augmented Language Models
Aaron T Parisi
Yao-Min Zhao
Noah Fiedel
KELM
RALM
LLMAG
32
144
0
24 May 2022
GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained
  Language Models
GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models
Da Yin
Hritik Bansal
Masoud Monajatipoor
Liunian Harold Li
Kai-Wei Chang
49
28
0
24 May 2022
SKILL: Structured Knowledge Infusion for Large Language Models
SKILL: Structured Knowledge Infusion for Large Language Models
Fedor Moiseev
Zhe Dong
Enrique Alfonseca
Martin Jaggi
KELM
58
58
0
17 May 2022
Near-Negative Distinction: Giving a Second Life to Human Evaluation
  Datasets
Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets
Philippe Laban
Chien-Sheng Wu
Wenhao Liu
Caiming Xiong
41
5
0
13 May 2022
Towards Answering Open-ended Ethical Quandary Questions
Towards Answering Open-ended Ethical Quandary Questions
Yejin Bang
Nayeon Lee
Tiezheng Yu
Leila Khalatbari
Yan Xu
...
Romain Barraud
Elham J. Barezi
Andrea Madotto
Hayden Kee
Pascale Fung
ELM
35
6
0
12 May 2022
Adaptable Adapters
Adaptable Adapters
N. Moosavi
Quentin Delfosse
Kristian Kersting
Iryna Gurevych
53
21
0
03 May 2022
LayoutBERT: Masked Language Layout Model for Object Insertion
LayoutBERT: Masked Language Layout Model for Object Insertion
Kerem Turgutlu
Sanatan Sharma
J. Kumar
VLM
DiffM
30
2
0
30 Apr 2022
TemporalWiki: A Lifelong Benchmark for Training and Evaluating
  Ever-Evolving Language Models
TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models
Joel Jang
Seonghyeon Ye
Changho Lee
Sohee Yang
Joongbo Shin
Janghoon Han
Gyeonghun Kim
Minjoon Seo
CLL
KELM
27
92
0
29 Apr 2022
Faithful to the Document or to the World? Mitigating Hallucinations via
  Entity-linked Knowledge in Abstractive Summarization
Faithful to the Document or to the World? Mitigating Hallucinations via Entity-linked Knowledge in Abstractive Summarization
Yue Dong
John Wieting
Pat Verga
HILM
24
24
0
28 Apr 2022
Plug-and-Play Adaptation for Continuously-updated QA
Plug-and-Play Adaptation for Continuously-updated QA
Kyungjae Lee
Wookje Han
Seung-won Hwang
Hwaran Lee
Joonsuk Park
Sang-Woo Lee
KELM
30
16
0
27 Apr 2022
On the Limitations of Dataset Balancing: The Lost Battle Against
  Spurious Correlations
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations
Roy Schwartz
Gabriel Stanovsky
37
26
0
27 Apr 2022
Autoregressive Search Engines: Generating Substrings as Document
  Identifiers
Autoregressive Search Engines: Generating Substrings as Document Identifiers
Michele Bevilacqua
G. Ottaviano
Patrick Lewis
Wen-tau Yih
Sebastian Riedel
Fabio Petroni
KELM
RALM
30
156
0
22 Apr 2022
Knowledgeable Salient Span Mask for Enhancing Language Models as
  Knowledge Base
Knowledgeable Salient Span Mask for Enhancing Language Models as Knowledge Base
Cunxiang Wang
Fuli Luo
Yanyang Li
Runxin Xu
Fei Huang
Yue Zhang
KELM
36
2
0
17 Apr 2022
Calibrating Trust of Multi-Hop Question Answering Systems with
  Decompositional Probes
Calibrating Trust of Multi-Hop Question Answering Systems with Decompositional Probes
Kaige Xie
Sarah Wiegreffe
Mark O. Riedl
ReLM
24
12
0
16 Apr 2022
Knowledge Infused Decoding
Knowledge Infused Decoding
Ruibo Liu
Guoqing Zheng
Shashank Gupta
Radhika Gaonkar
Chongyang Gao
Soroush Vosoughi
Milad Shokouhi
Ahmed Hassan Awadallah
KELM
25
14
0
06 Apr 2022
Data Augmentation for Intent Classification with Off-the-shelf Large
  Language Models
Data Augmentation for Intent Classification with Off-the-shelf Large Language Models
Gaurav Sahu
Pau Rodríguez López
I. Laradji
Parmida Atighehchian
David Vazquez
Dzmitry Bahdanau
24
61
0
05 Apr 2022
Factual Consistency of Multilingual Pretrained Language Models
Factual Consistency of Multilingual Pretrained Language Models
Constanza Fierro
Anders Søgaard
HILM
27
15
0
22 Mar 2022
Sequence-to-Sequence Knowledge Graph Completion and Question Answering
Sequence-to-Sequence Knowledge Graph Completion and Question Answering
Apoorv Saxena
Adrian Kochsiek
Rainer Gemulla
AIMat
45
120
0
19 Mar 2022
Reasoning over Public and Private Data in Retrieval-Based Systems
Reasoning over Public and Private Data in Retrieval-Based Systems
Simran Arora
Patrick Lewis
Angela Fan
Jacob Kahn
Christopher Ré
23
23
0
14 Mar 2022
Read before Generate! Faithful Long Form Question Answering with Machine
  Reading
Read before Generate! Faithful Long Form Question Answering with Machine Reading
Dan Su
Xiaoguang Li
Jindi Zhang
Lifeng Shang
Xin Jiang
Qun Liu
Pascale Fung
HILM
19
59
0
01 Mar 2022
KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification
  and Reasoning Abilities of Language Models
KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models
Daniel Gao
Yantao Jia
Lei Li
Chengzhen Fu
Zhicheng Dou
Hao Jiang
Xinyu Zhang
Lei Chen
Bo Zhao
KELM
22
8
0
28 Feb 2022
NoisyTune: A Little Noise Can Help You Finetune Pretrained Language
  Models Better
NoisyTune: A Little Noise Can Help You Finetune Pretrained Language Models Better
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
25
58
0
24 Feb 2022
ST-MoE: Designing Stable and Transferable Sparse Expert Models
ST-MoE: Designing Stable and Transferable Sparse Expert Models
Barret Zoph
Irwan Bello
Sameer Kumar
Nan Du
Yanping Huang
J. Dean
Noam M. Shazeer
W. Fedus
MoE
24
182
0
17 Feb 2022
Deduplicating Training Data Mitigates Privacy Risks in Language Models
Deduplicating Training Data Mitigates Privacy Risks in Language Models
Nikhil Kandpal
Eric Wallace
Colin Raffel
PILM
MU
54
275
0
14 Feb 2022
Locating and Editing Factual Associations in GPT
Locating and Editing Factual Associations in GPT
Kevin Meng
David Bau
A. Andonian
Yonatan Belinkov
KELM
56
1,199
0
10 Feb 2022
Can Open Domain Question Answering Systems Answer Visual Knowledge
  Questions?
Can Open Domain Question Answering Systems Answer Visual Knowledge Questions?
Jiawen Zhang
Abhijit Mishra
Avinesh P.V.S
Siddharth Patwardhan
Sachin Agarwal
24
0
0
09 Feb 2022
Survey of Hallucination in Natural Language Generation
Survey of Hallucination in Natural Language Generation
Ziwei Ji
Nayeon Lee
Rita Frieske
Tiezheng Yu
D. Su
...
Delong Chen
Wenliang Dai
Ho Shu Chan
Andrea Madotto
Pascale Fung
HILM
LRM
70
2,243
0
08 Feb 2022
Pre-Trained Language Models for Interactive Decision-Making
Pre-Trained Language Models for Interactive Decision-Making
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
48
247
0
03 Feb 2022
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge
  for Embodied Agents
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
Wenlong Huang
Pieter Abbeel
Deepak Pathak
Igor Mordatch
LM&Ro
42
1,062
0
18 Jan 2022
Eliciting Knowledge from Pretrained Language Models for Prototypical
  Prompt Verbalizer
Eliciting Knowledge from Pretrained Language Models for Prototypical Prompt Verbalizer
Yinyi Wei
Tong Mo
Yong-jia Jiang
Weiping Li
Wen Zhao
VLM
34
15
0
14 Jan 2022
Zero-shot Commonsense Question Answering with Cloze Translation and
  Consistency Optimization
Zero-shot Commonsense Question Answering with Cloze Translation and Consistency Optimization
Zi-Yi Dou
Nanyun Peng
ELM
15
26
0
01 Jan 2022
Knowledge-Grounded Dialogue Generation with a Unified Knowledge
  Representation
Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation
Yu Li
Baolin Peng
Yelong Shen
Yi Mao
Lars Liden
Zhou Yu
Jianfeng Gao
19
53
0
15 Dec 2021
A Scoping Review of Publicly Available Language Tasks in Clinical
  Natural Language Processing
A Scoping Review of Publicly Available Language Tasks in Clinical Natural Language Processing
Yanjun Gao
Dmitriy Dligach
Leslie A. Christensen
S. Tesch
Ryan Laffin
Dongfang Xu
Timothy A. Miller
Özlem Uzuner
M. Churpek
Majid Afshar
LM&MA
33
25
0
07 Dec 2021
Iconary: A Pictionary-Based Game for Testing Multimodal Communication
  with Drawings and Text
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
Christopher Clark
Jordi Salvador
Dustin Schwenk
Derrick Bonafilia
Mark Yatskar
...
Aaron Sarnat
Hannaneh Hajishirzi
Aniruddha Kembhavi
Oren Etzioni
Ali Farhadi
MLLM
20
3
0
01 Dec 2021
Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context
  Images via Online Resources
Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources
Sahar Abdelnabi
Rakibul Hasan
Mario Fritz
26
74
0
30 Nov 2021
Previous
12345
Next