ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08910
  4. Cited By
How Much Knowledge Can You Pack Into the Parameters of a Language Model?

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

10 February 2020
Adam Roberts
Colin Raffel
Noam M. Shazeer
    KELM
ArXivPDFHTML

Papers citing "How Much Knowledge Can You Pack Into the Parameters of a Language Model?"

50 / 217 papers shown
Title
Plug-and-Play Document Modules for Pre-trained Models
Plug-and-Play Document Modules for Pre-trained Models
Chaojun Xiao
Zhengyan Zhang
Xu Han
Chi-Min Chan
Yankai Lin
Zhiyuan Liu
Xiangyang Li
Zhonghua Li
Bo Zhao
Maosong Sun
KELM
31
5
0
28 May 2023
A RelEntLess Benchmark for Modelling Graded Relations between Named
  Entities
A RelEntLess Benchmark for Modelling Graded Relations between Named Entities
Asahi Ushio
Jose Camacho-Collados
Steven Schockaert
29
1
0
24 May 2023
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence
  Scores from Language Models Fine-Tuned with Human Feedback
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback
Katherine Tian
E. Mitchell
Allan Zhou
Archit Sharma
Rafael Rafailov
Huaxiu Yao
Chelsea Finn
Christopher D. Manning
57
289
0
24 May 2023
Mitigating Temporal Misalignment by Discarding Outdated Facts
Mitigating Temporal Misalignment by Discarding Outdated Facts
Michael J.Q. Zhang
Eunsol Choi
KELM
HILM
27
17
0
24 May 2023
Language Models with Rationality
Language Models with Rationality
Nora Kassner
Oyvind Tafjord
Ashish Sabharwal
Kyle Richardson
Hinrich Schütze
Peter Clark
ReLM
KELM
LRM
20
15
0
23 May 2023
Polyglot or Not? Measuring Multilingual Encyclopedic Knowledge in
  Foundation Models
Polyglot or Not? Measuring Multilingual Encyclopedic Knowledge in Foundation Models
Tim Schott
Daniel Furman
Shreshta Bhat
ELM
35
4
0
23 May 2023
Enhancing Small Medical Learners with Privacy-preserving Contextual
  Prompting
Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting
Xinlu Zhang
Shiyang Li
Xianjun Yang
Chenxin Tian
Yao Qin
Linda R. Petzold
24
9
0
22 May 2023
Statistical Knowledge Assessment for Large Language Models
Statistical Knowledge Assessment for Large Language Models
Qingxiu Dong
Jingjing Xu
Lingpeng Kong
Zhifang Sui
Lei Li
HILM
47
6
0
17 May 2023
PaLM 2 Technical Report
PaLM 2 Technical Report
Rohan Anil
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
...
Ce Zheng
Wei Zhou
Denny Zhou
Slav Petrov
Yonghui Wu
ReLM
LRM
122
1,152
0
17 May 2023
Distinguish Before Answer: Generating Contrastive Explanation as
  Knowledge for Commonsense Question Answering
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering
Qianglong Chen
Guohai Xu
Mingshi Yan
Ji Zhang
Fei Huang
Luo Si
Yin Zhang
21
9
0
14 May 2023
Synergistic Interplay between Search and Large Language Models for
  Information Retrieval
Synergistic Interplay between Search and Large Language Models for Information Retrieval
Jiazhan Feng
Chongyang Tao
Xiubo Geng
Tao Shen
Can Xu
Guodong Long
Dongyan Zhao
Daxin Jiang
KELM
63
5
0
12 May 2023
Active Retrieval Augmented Generation
Active Retrieval Augmented Generation
Zhengbao Jiang
Frank F. Xu
Luyu Gao
Zhiqing Sun
Qian Liu
Jane Dwivedi-Yu
Yiming Yang
Jamie Callan
Graham Neubig
RALM
23
255
0
11 May 2023
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
Archiki Prasad
Swarnadeep Saha
Xiang Zhou
Joey Tianyi Zhou
LRM
32
45
0
21 Apr 2023
UniMax: Fairer and more Effective Language Sampling for Large-Scale
  Multilingual Pretraining
UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Hyung Won Chung
Noah Constant
Xavier Garcia
Adam Roberts
Yi Tay
Sharan Narang
Orhan Firat
29
50
0
18 Apr 2023
Salient Span Masking for Temporal Understanding
Salient Span Masking for Temporal Understanding
Jeremy R. Cole
Aditi Chaudhary
Bhuwan Dhingra
Partha P. Talukdar
52
11
0
22 Mar 2023
cTBLS: Augmenting Large Language Models with Conversational Tables
cTBLS: Augmenting Large Language Models with Conversational Tables
Anirudh S. Sundar
Larry Heck
LMTD
21
7
0
21 Mar 2023
Cross-Lingual Question Answering over Knowledge Base as Reading
  Comprehension
Cross-Lingual Question Answering over Knowledge Base as Reading Comprehension
Chen Zhang
Yuxuan Lai
Yansong Feng
Xingyu Shen
Haowei Du
Dongyan Zhao
21
3
0
26 Feb 2023
Can Pre-trained Vision and Language Models Answer Visual
  Information-Seeking Questions?
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen
Hexiang Hu
Yi Luan
Haitian Sun
Soravit Changpinyo
Alan Ritter
Ming-Wei Chang
48
80
0
23 Feb 2023
Dynamic Named Entity Recognition
Dynamic Named Entity Recognition
Tristan Luiggi
Laure Soulier
Vincent Guigue
Siwar Jendoubi
Aurélien Baelde
36
0
0
16 Feb 2023
Capturing Topic Framing via Masked Language Modeling
Capturing Topic Framing via Masked Language Modeling
Xiaobo Guo
Weicheng Ma
Soroush Vosoughi
22
1
0
07 Feb 2023
You Truly Understand What I Need: Intellectual and Friendly Dialogue
  Agents grounding Knowledge and Persona
You Truly Understand What I Need: Intellectual and Friendly Dialogue Agents grounding Knowledge and Persona
J. Lim
Myunghoon Kang
Yuna Hur
Seung-Ju Jung
Jinsung Kim
...
Dongyub Lee
Hyesung Ji
Donghoon Shin
Seung Wook Kim
Heu-Jeoung Lim
HILM
25
8
0
06 Jan 2023
Contrastive Learning Reduces Hallucination in Conversations
Contrastive Learning Reduces Hallucination in Conversations
Weiwei Sun
Zhengliang Shi
Shen Gao
Pengjie Ren
Maarten de Rijke
Z. Ren
42
62
0
20 Dec 2022
DSI++: Updating Transformer Memory with New Documents
DSI++: Updating Transformer Memory with New Documents
Sanket Vaibhav Mehta
Jai Gupta
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
J. Rao
Marc Najork
Emma Strubell
Donald Metzler
CLL
32
39
0
19 Dec 2022
Evaluating Step-by-Step Reasoning through Symbolic Verification
Evaluating Step-by-Step Reasoning through Symbolic Verification
Yi-Fan Zhang
Hanlin Zhang
Li Erran Li
Eric P. Xing
ReLM
LRM
19
8
0
16 Dec 2022
Attributed Question Answering: Evaluation and Modeling for Attributed
  Large Language Models
Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models
Bernd Bohnet
Vinh Q. Tran
Pat Verga
Roee Aharoni
D. Andor
...
Michael Collins
Dipanjan Das
Donald Metzler
Slav Petrov
Kellie Webster
43
59
0
15 Dec 2022
Event knowledge in large language models: the gap between the impossible
  and the unlikely
Event knowledge in large language models: the gap between the impossible and the unlikely
Carina Kauf
Anna A. Ivanova
Giulia Rambelli
Emmanuele Chersoni
Jingyuan Selena She
Zawad Chowdhury
Evelina Fedorenko
Alessandro Lenci
37
67
0
02 Dec 2022
CREPE: Open-Domain Question Answering with False Presuppositions
CREPE: Open-Domain Question Answering with False Presuppositions
Xinyan Velocity Yu
Sewon Min
Luke Zettlemoyer
Hannaneh Hajishirzi
19
45
0
30 Nov 2022
TSGP: Two-Stage Generative Prompting for Unsupervised Commonsense
  Question Answering
TSGP: Two-Stage Generative Prompting for Unsupervised Commonsense Question Answering
Yueqing Sun
Yu Zhang
Le Qi
Qi Shi
ReLM
RALM
LRM
21
5
0
24 Nov 2022
Can Open-Domain QA Reader Utilize External Knowledge Efficiently like
  Humans?
Can Open-Domain QA Reader Utilize External Knowledge Efficiently like Humans?
Neeraj Varshney
Man Luo
Chitta Baral
RALM
21
11
0
23 Nov 2022
Evaluating the Knowledge Dependency of Questions
Evaluating the Knowledge Dependency of Questions
Hyeongdon Moon
Yoonseok Yang
Jamin Shin
Hangyeol Yu
Seunghyun Lee
Myeongho Jeong
Juneyoung Park
Minsam Kim
Seungtaek Choi
AI4Ed
31
10
0
21 Nov 2022
Validating Large Language Models with ReLM
Validating Large Language Models with ReLM
Michael Kuchnik
Virginia Smith
George Amvrosiadis
36
27
0
21 Nov 2022
VER: Unifying Verbalizing Entities and Relations
VER: Unifying Verbalizing Entities and Relations
Jie Huang
Kevin Chen-Chuan Chang
17
1
0
20 Nov 2022
Knowledge Graph Generation From Text
Knowledge Graph Generation From Text
Igor Melnyk
Pierre L. Dognin
Payel Das
24
23
0
18 Nov 2022
PromptCap: Prompt-Guided Task-Aware Image Captioning
PromptCap: Prompt-Guided Task-Aware Image Captioning
Yushi Hu
Hang Hua
Zhengyuan Yang
Weijia Shi
Noah A. Smith
Jiebo Luo
51
101
0
15 Nov 2022
Large Language Models Struggle to Learn Long-Tail Knowledge
Large Language Models Struggle to Learn Long-Tail Knowledge
Nikhil Kandpal
H. Deng
Adam Roberts
Eric Wallace
Colin Raffel
RALM
KELM
56
388
0
15 Nov 2022
A Survey for Efficient Open Domain Question Answering
A Survey for Efficient Open Domain Question Answering
Qin Zhang
Shan Chen
Dongkuan Xu
Qingqing Cao
Xiaojun Chen
Trevor Cohn
Meng Fang
28
33
0
15 Nov 2022
Large Language Models with Controllable Working Memory
Large Language Models with Controllable Working Memory
Daliang Li
A. S. Rawat
Manzil Zaheer
Xin Wang
Michal Lukasik
Andreas Veit
Felix X. Yu
Surinder Kumar
KELM
61
154
0
09 Nov 2022
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in
  Diffusion Models
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
P. Schramowski
Manuel Brack
Bjorn Deiseroth
Kristian Kersting
42
272
0
09 Nov 2022
ConsPrompt: Exploiting Contrastive Samples for Fewshot Prompt Learning
ConsPrompt: Exploiting Contrastive Samples for Fewshot Prompt Learning
Jinta Weng
Yifan Deng
D. D. Li
Hao You
Yue Hu
Heyan Huang
25
1
0
08 Nov 2022
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models
COPEN: Probing Conceptual Knowledge in Pre-trained Language Models
Hao Peng
Xiaozhi Wang
Shengding Hu
Hailong Jin
Lei Hou
Juanzi Li
Zhiyuan Liu
Qun Liu
18
22
0
08 Nov 2022
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised
  Knowledge-Grounded Conversation
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation
Yanyang Li
Jianqiao Zhao
M. Lyu
Liwei Wang
24
15
0
03 Nov 2022
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating
  Models to Reflect Conflicting Evidence
Rich Knowledge Sources Bring Complex Knowledge Conflicts: Recalibrating Models to Reflect Conflicting Evidence
Hung-Ting Chen
Michael J.Q. Zhang
Eunsol Choi
RALM
HILM
47
92
0
25 Oct 2022
Enhancing Tabular Reasoning with Pattern Exploiting Training
Enhancing Tabular Reasoning with Pattern Exploiting Training
Abhilash Shankarampeta
Vivek Gupta
Shuo Zhang
LMTD
RALM
ReLM
68
6
0
21 Oct 2022
Transcending Scaling Laws with 0.1% Extra Compute
Transcending Scaling Laws with 0.1% Extra Compute
Yi Tay
Jason W. Wei
Hyung Won Chung
Vinh Q. Tran
David R. So
...
Donald Metzler
Slav Petrov
N. Houlsby
Quoc V. Le
Mostafa Dehghani
LRM
44
68
0
20 Oct 2022
Pre-training Language Models with Deterministic Factual Knowledge
Pre-training Language Models with Deterministic Factual Knowledge
Shaobo Li
Xiaoguang Li
Lifeng Shang
Chengjie Sun
Bingquan Liu
Zhenzhou Ji
Xin Jiang
Qun Liu
KELM
47
11
0
20 Oct 2022
RARR: Researching and Revising What Language Models Say, Using Language
  Models
RARR: Researching and Revising What Language Models Say, Using Language Models
Luyu Gao
Zhuyun Dai
Panupong Pasupat
Anthony Chen
Arun Tejasvi Chaganty
...
Vincent Zhao
Ni Lao
Hongrae Lee
Da-Cheng Juan
Kelvin Guu
HILM
KELM
41
257
0
17 Oct 2022
Mass-Editing Memory in a Transformer
Mass-Editing Memory in a Transformer
Kevin Meng
Arnab Sen Sharma
A. Andonian
Yonatan Belinkov
David Bau
KELM
VLM
59
527
0
13 Oct 2022
On the Explainability of Natural Language Processing Deep Models
On the Explainability of Natural Language Processing Deep Models
Julia El Zini
M. Awad
29
82
0
13 Oct 2022
Closed-book Question Generation via Contrastive Learning
Closed-book Question Generation via Contrastive Learning
Xiangjue Dong
Jiaying Lu
Jianling Wang
James Caverlee
37
8
0
13 Oct 2022
Language Models are Realistic Tabular Data Generators
Language Models are Realistic Tabular Data Generators
V. Borisov
Kathrin Seßler
Tobias Leemann
Martin Pawelczyk
Gjergji Kasneci
LMTD
27
223
0
12 Oct 2022
Previous
12345
Next