Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1911.00172
Cited By
v1
v2 (latest)
Generalization through Memorization: Nearest Neighbor Language Models
1 November 2019
Urvashi Khandelwal
Omer Levy
Dan Jurafsky
Luke Zettlemoyer
M. Lewis
RALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Generalization through Memorization: Nearest Neighbor Language Models"
50 / 597 papers shown
Title
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
Jie Huang
Ming-Yu Liu
Peng Xu
Mohammad Shoeybi
Kevin Chen-Chuan Chang
Bryan Catanzaro
RALM
122
35
0
15 Aug 2023
WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engine
Siqiao Xue
Fan Zhou
Y. Xu
Ming Jin
Qingsong Wen
...
Jun Zhou
Shuo Xie
D. Xiu
James Y. Zhang
Hongyuan Mei
RALM
AIFin
84
15
0
10 Aug 2023
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
Sewon Min
Suchin Gururangan
Eric Wallace
Hannaneh Hajishirzi
Noah A. Smith
Luke Zettlemoyer
AILaw
114
68
0
08 Aug 2023
TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023
Yu. V. Gorishniy
Ivan Rubachev
Nikolay Kartashev
Daniil Shlenskii
Akim Kotelnikov
Artem Babenko
OOD
LMTD
89
15
0
26 Jul 2023
Benchmarking and Analyzing Generative Data for Visual Recognition
Yue Liu
Haotian Liu
Liangyu Chen
Yong Jae Lee
Cuiping Li
Ziwei Liu
EGVM
VLM
53
4
0
25 Jul 2023
Learning to Retrieve In-Context Examples for Large Language Models
Liang Wang
Nan Yang
Furu Wei
RALM
91
43
0
14 Jul 2023
Generating Benchmarks for Factuality Evaluation of Language Models
Dor Muhlgay
Ori Ram
Inbal Magar
Yoav Levine
Nir Ratner
Yonatan Belinkov
Omri Abend
Kevin Leyton-Brown
Amnon Shashua
Y. Shoham
HILM
76
98
0
13 Jul 2023
Copy Is All You Need
Tian Lan
Deng Cai
Yan Wang
Heyan Huang
Xian-Ling Mao
89
30
0
13 Jul 2023
Pluggable Neural Machine Translation Models via Memory-augmented Adapters
Yuzhuang Xu
Shuo Wang
Peng Li
Xuebo Liu
Xiaolong Wang
Weidong Liu
Yang Liu
104
1
0
12 Jul 2023
ReLoRA: High-Rank Training Through Low-Rank Updates
Vladislav Lialin
Namrata Shivagunde
Sherin Muckatira
Anna Rumshisky
BDL
107
117
0
11 Jul 2023
Linear Alignment of Vision-language Models for Image Captioning
Fabian Paischer
M. Hofmarcher
Sepp Hochreiter
Thomas Adler
CLIP
VLM
181
0
0
10 Jul 2023
Focused Transformer: Contrastive Training for Context Scaling
Szymon Tworkowski
Konrad Staniszewski
Mikolaj Pacek
Yuhuai Wu
Henryk Michalewski
Piotr Milo's
83
141
0
06 Jul 2023
Multimodal Prompt Retrieval for Generative Visual Question Answering
Timothy Ossowski
Junjie Hu
43
1
0
30 Jun 2023
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
Kaiyu Yang
Aidan M. Swope
Alex Gu
Rahul Chalamala
Peiyang Song
Shixing Yu
Saad Godil
R. Prenger
Anima Anandkumar
RALM
160
247
0
27 Jun 2023
Long-range Language Modeling with Self-retrieval
Ohad Rubin
Jonathan Berant
RALM
KELM
101
20
0
23 Jun 2023
RepoFusion: Training Code Models to Understand Your Repository
Disha Shrivastava
Denis Kocetkov
H. D. Vries
Dzmitry Bahdanau
Torsten Scholak
135
29
0
19 Jun 2023
Guiding Language Models of Code with Global Context using Monitors
Lakshya A Agrawal
Aditya Kanade
Navin Goyal
Shuvendu K. Lahiri
S. Rajamani
148
27
0
19 Jun 2023
GLIMMER: generalized late-interaction memory reranker
Michiel de Jong
Yury Zemlyanskiy
Nicholas FitzGerald
Sumit Sanghai
William W. Cohen
Joshua Ainslie
RALM
102
5
0
17 Jun 2023
Neural Priming for Sample-Efficient Adaptation
Matthew Wallingford
Vivek Ramanujan
Alex Fang
Aditya Kusupati
Roozbeh Mottaghi
Aniruddha Kembhavi
Ludwig Schmidt
Ali Farhadi
VLM
194
15
0
16 Jun 2023
Encyclopedic VQA: Visual questions about detailed properties of fine-grained categories
Thomas Mensink
J. Uijlings
Lluis Castrejon
A. Goel
Felipe Cadar
Howard Zhou
Fei Sha
A. Araújo
V. Ferrari
94
44
0
15 Jun 2023
Retrieval-Enhanced Contrastive Vision-Text Models
Ahmet Iscen
Mathilde Caron
Alireza Fathi
Cordelia Schmid
CLIP
VLM
111
28
0
12 Jun 2023
Retrosynthesis Prediction with Local Template Retrieval
Shufang Xie
Rui Yan
Junliang Guo
Yingce Xia
Lijun Wu
Tao Qin
103
12
0
07 Jun 2023
Information Flow Control in Machine Learning through Modular Model Architecture
Trishita Tiwari
Suchin Gururangan
Chuan Guo
Weizhe Hua
Sanjay Kariyappa
Udit Gupta
Wenjie Xiong
Kiwan Maeng
Hsien-Hsin S. Lee
G. E. Suh
75
6
0
05 Jun 2023
Retrieval-Enhanced Visual Prompt Learning for Few-shot Classification
Jintao Rong
Hao Chen
Tianrun Chen
Linlin Ou
Xinyi Yu
Yifan Liu
VLM
VPVLM
79
6
0
04 Jun 2023
The Information Pathways Hypothesis: Transformers are Dynamic Self-Ensembles
Md Shamim Hussain
Mohammed J Zaki
D. Subramanian
176
3
0
02 Jun 2023
Exposing Attention Glitches with Flip-Flop Language Modeling
Bingbin Liu
Jordan T. Ash
Surbhi Goel
A. Krishnamurthy
Cyril Zhang
LRM
97
53
0
01 Jun 2023
LMCap: Few-shot Multilingual Image Captioning by Retrieval Augmented Language Model Prompting
R. Ramos
Bruno Martins
Desmond Elliott
VLM
75
16
0
31 May 2023
Information Association for Language Model Updating by Mitigating LM-Logical Discrepancy
Pengfei Yu
Heng Ji
KELM
80
10
0
29 May 2023
Test-Time Training on Nearest Neighbors for Large Language Models
Moritz Hardt
Yu Sun
VLM
RALM
151
30
0
29 May 2023
Prompt-Guided Retrieval Augmentation for Non-Knowledge-Intensive Tasks
Zhicheng Guo
Sijie Cheng
Yile Wang
Peng Li
Yang Liu
RALM
61
21
0
28 May 2023
Landmark Attention: Random-Access Infinite Context Length for Transformers
Amirkeivan Mohtashami
Martin Jaggi
LLMAG
153
164
0
25 May 2023
Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models
Ehsan Doostmohammadi
Tobias Norlund
Marco Kuhlmann
Richard Johansson
RALM
91
8
0
25 May 2023
Enhancing Grammatical Error Correction Systems with Explanations
Yuejiao Fei
Leyang Cui
Sen Yang
Wai Lam
Zhenzhong Lan
Shuming Shi
86
15
0
25 May 2023
Aligning Language Models to User Opinions
EunJeong Hwang
Bodhisattwa Prasad Majumder
Niket Tandon
84
75
0
24 May 2023
Privacy Implications of Retrieval-Based Language Models
Yangsibo Huang
Samyak Gupta
Zexuan Zhong
Keqin Li
Danqi Chen
RALM
61
30
0
24 May 2023
Adapting Language Models to Compress Contexts
Alexis Chevalier
Alexander Wettig
Anirudh Ajith
Danqi Chen
LLMAG
92
192
0
24 May 2023
Trusting Your Evidence: Hallucinate Less with Context-aware Decoding
Weijia Shi
Xiaochuang Han
M. Lewis
Yulia Tsvetkov
Luke Zettlemoyer
Scott Yih
HILM
84
215
0
24 May 2023
Enabling Large Language Models to Generate Text with Citations
Tianyu Gao
Howard Yen
Jiatong Yu
Danqi Chen
LM&MA
HILM
177
358
0
24 May 2023
KNN-LM Does Not Improve Open-ended Text Generation
Shufan Wang
Yixiao Song
Andrew Drozdov
Aparna Garimella
Varun Manjunatha
Mohit Iyyer
RALM
101
8
0
24 May 2023
Accessing Higher Dimensions for Unsupervised Word Translation
Sida I. Wang
76
0
0
23 May 2023
To Copy Rather Than Memorize: A Vertical Learning Paradigm for Knowledge Graph Completion
Rui Li
Xu Chen
Chaozhuo Li
Yanming Shen
Jianan Zhao
...
Weihao Han
Hao Sun
Weiwei Deng
Qi Zhang
Xing Xie
70
7
0
23 May 2023
Non-parametric, Nearest-neighbor-assisted Fine-tuning for Neural Machine Translation
Jiayi Wang
Ke Min Wang
Yuqi Zhang
Yu Zhao
Pontus Stenetorp
52
0
0
23 May 2023
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
Jian Xie
Kai Zhang
Jiangjie Chen
Renze Lou
Yu-Chuan Su
RALM
326
181
0
22 May 2023
LM vs LM: Detecting Factual Errors via Cross Examination
Roi Cohen
May Hamri
Mor Geva
Amir Globerson
HILM
126
144
0
22 May 2023
Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer
R. Gao
Zhirui Zhang
Yichao Du
Lemao Liu
Rui Wang
129
2
0
22 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
143
6
0
21 May 2023
Description-Based Text Similarity
Shauli Ravfogel
Valentina Pyatkin
Amir D. N. Cohen
Avshalom Manevich
Yoav Goldberg
81
5
0
21 May 2023
Complex Claim Verification with Evidence Retrieved in the Wild
Jifan Chen
Grace Kim
Aniruddh Sriram
Greg Durrett
Eunsol Choi
HILM
114
82
0
19 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
156
399
0
19 May 2023
Decouple knowledge from parameters for plug-and-play language modeling
Xin Cheng
Yankai Lin
Preslav Nakov
Dongyan Zhao
Rui Yan
KELM
95
2
0
19 May 2023
Previous
1
2
3
...
6
7
8
...
10
11
12
Next