ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08910
  4. Cited By
How Much Knowledge Can You Pack Into the Parameters of a Language Model?

How Much Knowledge Can You Pack Into the Parameters of a Language Model?

10 February 2020
Adam Roberts
Colin Raffel
Noam M. Shazeer
    KELM
ArXivPDFHTML

Papers citing "How Much Knowledge Can You Pack Into the Parameters of a Language Model?"

50 / 217 papers shown
Title
DeltaEdit: Enhancing Sequential Editing in Large Language Models by Controlling Superimposed Noise
DeltaEdit: Enhancing Sequential Editing in Large Language Models by Controlling Superimposed Noise
Ding Cao
Yuchen Cai
Rongxi Guo
Xiaoxiao He
Guiquan Liu
KELM
43
0
0
12 May 2025
EnronQA: Towards Personalized RAG over Private Documents
EnronQA: Towards Personalized RAG over Private Documents
Michael J. Ryan
Danmei Xu
Chris Nivera
Daniel Campos
SILM
67
0
0
01 May 2025
ConSens: Assessing context grounding in open-book question answering
ConSens: Assessing context grounding in open-book question answering
Ivan Vankov
Matyo Ivanov
Adriana Correia
Victor Botev
ELM
69
0
0
30 Apr 2025
Saliency-driven Dynamic Token Pruning for Large Language Models
Saliency-driven Dynamic Token Pruning for Large Language Models
Yao Tao
Yehui Tang
Yun Wang
Mingjian Zhu
Hailin Hu
Yunhe Wang
36
0
0
06 Apr 2025
Towards Efficient Educational Chatbots: Benchmarking RAG Frameworks
Umar Ali Khan
Ekram Khan
Fiza Khan
A. A. Moinuddin
48
0
0
02 Mar 2025
Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement
Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement
Siyuan Zhang
Y. Zhang
Yinpeng Dong
Hang Su
HILM
KELM
218
0
0
26 Feb 2025
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models
Hongzhan Lin
Yang Deng
Yuxuan Gu
Wenxuan Zhang
Jing Ma
See-Kiong Ng
Tat-Seng Chua
LLMAG
KELM
HILM
68
0
0
25 Feb 2025
Revealing and Mitigating Over-Attention in Knowledge Editing
Revealing and Mitigating Over-Attention in Knowledge Editing
Pinzheng Wang
Zecheng Tang
Keyan Zhou
J. Li
Qiaoming Zhu
Hao Fei
KELM
120
2
0
21 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
72
1
0
17 Feb 2025
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents
Wanqi Yang
Yong Li
Meng Fang
L. Chen
64
1
0
09 Feb 2025
Episodic memory in AI agents poses risks that should be studied and mitigated
Episodic memory in AI agents poses risks that should be studied and mitigated
Chad DeChant
67
2
0
20 Jan 2025
Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions
Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions
Moran Yanuka
Assaf Ben-Kish
Yonatan Bitton
Idan Szpektor
Raja Giryes
VLM
47
2
0
13 Nov 2024
Controllable Context Sensitivity and the Knob Behind It
Controllable Context Sensitivity and the Knob Behind It
Julian Minder
Kevin Du
Niklas Stoehr
Giovanni Monea
Chris Wendler
Robert West
Ryan Cotterell
KELM
58
3
0
11 Nov 2024
Gradient Localization Improves Lifelong Pretraining of Language Models
Gradient Localization Improves Lifelong Pretraining of Language Models
Jared Fernandez
Yonatan Bisk
Emma Strubell
KELM
39
1
0
07 Nov 2024
Human-inspired Perspectives: A Survey on AI Long-term Memory
Human-inspired Perspectives: A Survey on AI Long-term Memory
Zihong He
Weizhe Lin
Hao Zheng
Fan Zhang
Matt Jones
Laurence Aitchison
X. Xu
Miao Liu
Per Ola Kristensson
Junxiao Shen
77
2
0
01 Nov 2024
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Jiatao Li
Xinyu Hu
Xunjian Yin
Xiaojun Wan
RALM
53
0
0
17 Oct 2024
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces
Ahmed Oumar El-Shangiti
Tatsuya Hiraoka
Hilal AlQuabeh
Benjamin Heinzerling
Kentaro Inui
44
1
0
17 Oct 2024
Evaluating the Instruction-following Abilities of Language Models using Knowledge Tasks
Evaluating the Instruction-following Abilities of Language Models using Knowledge Tasks
Rudra Murthy
Prince Kumar
Praveen Venkateswaran
Danish Contractor
KELM
ALM
ELM
39
1
0
16 Oct 2024
Deciphering the Interplay of Parametric and Non-parametric Memory in
  Retrieval-augmented Language Models
Deciphering the Interplay of Parametric and Non-parametric Memory in Retrieval-augmented Language Models
M. Farahani
Richard Johansson
RALM
36
2
0
07 Oct 2024
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
Junfeng Fang
Houcheng Jiang
Kun Wang
Yunshan Ma
Shi Jie
Xiangnan He
Tat-Seng Chua
Tat-seng Chua
KELM
41
34
0
03 Oct 2024
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time
David Herel
Vojtech Bartek
Jiri Jirak
Tomáš Mikolov
50
3
0
20 Sep 2024
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using
  LLMs
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs
Madhusudan Ghosh
Shrimon Mukherjee
Asmit Ganguly
Partha Basuchowdhuri
S. Naskar
Debasis Ganguly
36
7
0
15 Sep 2024
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
Han Wang
Archiki Prasad
Elias Stengel-Eskin
Joey Tianyi Zhou
85
5
0
11 Sep 2024
Putting People in LLMs' Shoes: Generating Better Answers via Question Rewriter
Putting People in LLMs' Shoes: Generating Better Answers via Question Rewriter
Junhao Chen
Bowen Wang
Zhouqiang Jiang
Yuta Nakashima
48
1
0
20 Aug 2024
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text
  Generation: A State-of-the-Art Investigation
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation
Joy Mahapatra
Utpal Garain
42
8
0
19 Jul 2024
Beyond Individual Facts: Investigating Categorical Knowledge Locality of
  Taxonomy and Meronomy Concepts in GPT Models
Beyond Individual Facts: Investigating Categorical Knowledge Locality of Taxonomy and Meronomy Concepts in GPT Models
Christopher Burger
Yifan Hu
Thai Le
KELM
47
0
0
22 Jun 2024
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in
  LLMs
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs
Jannik Kossen
Jiatong Han
Muhammed Razzak
Lisa Schut
Shreshth A. Malik
Yarin Gal
HILM
60
35
0
22 Jun 2024
How Do Large Language Models Acquire Factual Knowledge During
  Pretraining?
How Do Large Language Models Acquire Factual Knowledge During Pretraining?
Hoyeon Chang
Jinho Park
Seonghyeon Ye
Sohee Yang
Youngkyung Seo
Du-Seong Chang
Minjoon Seo
KELM
37
33
0
17 Jun 2024
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
Shengkang Wang
Hongzhan Lin
Ziyang Luo
Zhen Ye
Guang Chen
Jing Ma
68
3
0
17 Jun 2024
DIEKAE: Difference Injection for Efficient Knowledge Augmentation and
  Editing of Large Language Models
DIEKAE: Difference Injection for Efficient Knowledge Augmentation and Editing of Large Language Models
Alessio Galatolo
Meriem Beloucif
Katie Winkle
41
0
0
15 Jun 2024
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Seungone Kim
Juyoung Suk
Ji Yong Cho
Shayne Longpre
Chaeeun Kim
...
Sean Welleck
Graham Neubig
Moontae Lee
Kyungjae Lee
Minjoon Seo
ELM
ALM
LM&MA
105
31
0
09 Jun 2024
Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents
Recall Them All: Retrieval-Augmented Language Models for Long Object List Extraction from Long Documents
Sneha Singhania
Simon Razniewski
Gerhard Weikum
RALM
34
1
0
04 May 2024
Multi-hop Question Answering over Knowledge Graphs using Large Language
  Models
Multi-hop Question Answering over Knowledge Graphs using Large Language Models
Abir Chakraborty
KELM
RALM
46
3
0
30 Apr 2024
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory
Ali Modarressi
Abdullatif Köksal
Ayyoob Imani
Mohsen Fayyaz
Hinrich Schütze
KELM
112
9
0
17 Apr 2024
Towards Better Generalization in Open-Domain Question Answering by
  Mitigating Context Memorization
Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Zixuan Zhang
R. Reddy
Kevin Small
Tong Zhang
Heng Ji
40
1
0
02 Apr 2024
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video
  Editing
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing
Bryan Wang
Yuliang Li
Zhaoyang Lv
Haijun Xia
Yan Xu
Raj Sodhi
35
42
0
15 Feb 2024
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Parth Sarthi
Salman Abdullah
Aditi Tuli
Shubh Khanna
Anna Goldie
Christopher D. Manning
RALM
19
123
0
31 Jan 2024
Desiderata for the Context Use of Question Answering Systems
Desiderata for the Context Use of Question Answering Systems
Sagi Shaier
Lawrence E Hunter
K. Wense
28
4
0
31 Jan 2024
Question Translation Training for Better Multilingual Reasoning
Question Translation Training for Better Multilingual Reasoning
Wenhao Zhu
Shujian Huang
Fei Yuan
Shuaijie She
Jiajun Chen
Alexandra Birch
LRM
23
29
0
15 Jan 2024
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge
  Graph Completion?
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?
Yusuke Sakai
Hidetaka Kamigaito
Katsuhiko Hayashi
Taro Watanabe
26
1
0
15 Nov 2023
An Emulator for Fine-Tuning Large Language Models using Small Language
  Models
An Emulator for Fine-Tuning Large Language Models using Small Language Models
Eric Mitchell
Rafael Rafailov
Archit Sharma
Chelsea Finn
Christopher D. Manning
ALM
41
52
0
19 Oct 2023
Who Are All The Stochastic Parrots Imitating? They Should Tell Us!
Who Are All The Stochastic Parrots Imitating? They Should Tell Us!
Sagi Shaier
Lawrence E Hunter
K. Wense
36
3
0
16 Oct 2023
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
Jirui Qi
Raquel Fernández
Arianna Bisazza
KELM
HILM
27
60
0
16 Oct 2023
Large Language Models can Learn Rules
Large Language Models can Learn Rules
Zhaocheng Zhu
Yuan Xue
Xinyun Chen
Denny Zhou
Jian Tang
Dale Schuurmans
Hanjun Dai
LRM
ReLM
41
63
0
10 Oct 2023
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
Ori Yoran
Tomer Wolfson
Ori Ram
Jonathan Berant
RALM
LRM
19
185
0
02 Oct 2023
Interpretable Long-Form Legal Question Answering with
  Retrieval-Augmented Large Language Models
Interpretable Long-Form Legal Question Answering with Retrieval-Augmented Large Language Models
Antoine Louis
Gijs van Dijck
Gerasimos Spanakis
ELM
AILaw
30
35
0
29 Sep 2023
Ragas: Automated Evaluation of Retrieval Augmented Generation
Ragas: Automated Evaluation of Retrieval Augmented Generation
ES Shahul
Jithin James
Luis Espinosa-Anke
Steven Schockaert
91
177
0
26 Sep 2023
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM
  Animator
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator
Hanzhuo Huang
Yufan Feng
Cheng Shi
Lan Xu
Jingyi Yu
Sibei Yang
DiffM
VGen
23
64
0
25 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large
  Language Models
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
A. Luu
Wei Bi
Freda Shi
Shuming Shi
RALM
LRM
HILM
48
522
0
03 Sep 2023
Evaluating the Ripple Effects of Knowledge Editing in Language Models
Evaluating the Ripple Effects of Knowledge Editing in Language Models
Roi Cohen
Eden Biran
Ori Yoran
Amir Globerson
Mor Geva
KELM
42
157
0
24 Jul 2023
12345
Next