Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.09144
Cited By
v1
v2 (latest)
Retentive or Forgetful? Diving into the Knowledge Memorizing Mechanism of Language Models
16 May 2023
Boxi Cao
Qiaoyu Tang
Hongyu Lin
Shanshan Jiang
Bin Dong
Xianpei Han
Jiawei Chen
Tianshu Wang
Le Sun
CLL
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Retentive or Forgetful? Diving into the Knowledge Memorizing Mechanism of Language Models"
18 / 18 papers shown
Title
How Much Can We Forget about Data Contamination?
Sebastian Bordt
Suraj Srinivas
Valentyn Boreiko
U. V. Luxburg
119
2
0
04 Oct 2024
The Life Cycle of Knowledge in Big Language Models: A Survey
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
KELM
84
28
0
14 Mar 2023
Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models
Kushal Tirumala
Aram H. Markosyan
Luke Zettlemoyer
Armen Aghajanyan
TDI
110
197
0
22 May 2022
Quantifying Memorization Across Neural Language Models
Nicholas Carlini
Daphne Ippolito
Matthew Jagielski
Katherine Lee
Florian Tramèr
Chiyuan Zhang
PILM
124
630
0
15 Feb 2022
Towards Continual Knowledge Learning of Language Models
Joel Jang
Seonghyeon Ye
Sohee Yang
Joongbo Shin
Janghoon Han
Gyeonghun Kim
Stanley Jungkyu Choi
Minjoon Seo
CLL
KELM
299
161
0
07 Oct 2021
Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases
Boxi Cao
Hongyu Lin
Xianpei Han
Le Sun
Lingyong Yan
M. Liao
Tong Xue
Jin Xu
46
135
0
17 Jun 2021
Measuring and Improving Consistency in Pretrained Language Models
Yanai Elazar
Nora Kassner
Shauli Ravfogel
Abhilasha Ravichander
Eduard H. Hovy
Hinrich Schütze
Yoav Goldberg
HILM
331
368
0
01 Feb 2021
Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models
Nora Kassner
Philipp Dufter
Hinrich Schütze
73
141
0
01 Feb 2021
Extracting Training Data from Large Language Models
Nicholas Carlini
Florian Tramèr
Eric Wallace
Matthew Jagielski
Ariel Herbert-Voss
...
Tom B. Brown
Basel Alomair
Ulfar Erlingsson
Alina Oprea
Colin Raffel
MLAU
SILM
509
1,953
0
14 Dec 2020
X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models
Zhengbao Jiang
Antonios Anastasopoulos
Jun Araki
Haibo Ding
Graham Neubig
HILM
KELM
76
144
0
13 Oct 2020
How Can We Know What Language Models Know?
Zhengbao Jiang
Frank F. Xu
Jun Araki
Graham Neubig
KELM
146
1,412
0
28 Nov 2019
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
579
2,677
0
03 Sep 2019
Does Learning Require Memorization? A Short Tale about a Long Tail
Vitaly Feldman
TDI
139
500
0
12 Jun 2019
What do you learn from context? Probing for sentence structure in contextualized word representations
Ian Tenney
Patrick Xia
Berlin Chen
Alex Jinpeng Wang
Adam Poliak
...
Najoung Kim
Benjamin Van Durme
Samuel R. Bowman
Dipanjan Das
Ellie Pavlick
187
865
0
15 May 2019
A Survey on Multi-Task Learning
Yu Zhang
Qiang Yang
AIMat
605
2,243
0
25 Jul 2017
An Overview of Multi-Task Learning in Deep Neural Networks
Sebastian Ruder
CVBM
159
2,831
0
15 Jun 2017
Learning without Forgetting
Zhizhong Li
Derek Hoiem
CLL
OOD
SSL
308
4,432
0
29 Jun 2016
An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks
Ian Goodfellow
M. Berk Mirza
Xia Da
Aaron Courville
Yoshua Bengio
156
1,455
0
21 Dec 2013
1