ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.06991
  4. Cited By
Transformer Memory as a Differentiable Search Index
v1v2v3 (latest)

Transformer Memory as a Differentiable Search Index

14 February 2022
Yi Tay
Vinh Q. Tran
Mostafa Dehghani
Jianmo Ni
Dara Bahri
Harsh Mehta
Zhen Qin
Kai Hui
Zhe Zhao
Jai Gupta
Tal Schuster
William W. Cohen
Donald Metzler
ArXiv (abs)PDFHTML

Papers citing "Transformer Memory as a Differentiable Search Index"

45 / 45 papers shown
Title
GIF: Generative Inspiration for Face Recognition at Scale
GIF: Generative Inspiration for Face Recognition at Scale
Saeed Ebrahimi
Sahar Rahimi
Ali Dabouei
Srinjoy Das
Jeremy M. Dawson
Nasser M. Nasrabadi
CVBM
490
0
0
05 May 2025
Unified Generative Search and Recommendation
Unified Generative Search and Recommendation
Teng Shi
Jun Xu
Xiao Zhang
Xiaoxue Zang
Kai Zheng
Yang Song
Enyun Yu
109
1
0
08 Apr 2025
Universal Item Tokenization for Transferable Generative Recommendation
Universal Item Tokenization for Transferable Generative Recommendation
Bowen Zheng
Hongyu Lu
Yu Chen
Wayne Xin Zhao
Ji-Rong Wen
104
0
0
06 Apr 2025
Pre-training Generative Recommender with Multi-Identifier Item Tokenization
Pre-training Generative Recommender with Multi-Identifier Item Tokenization
Bowen Zheng
Enze Liu
Zhongfu Chen
Zhongrui Ma
Yue Wang
Wayne Xin Zhao
Ji-Rong Wen
138
0
0
06 Apr 2025
Text2Tracks: Prompt-based Music Recommendation via Generative Retrieval
Text2Tracks: Prompt-based Music Recommendation via Generative Retrieval
Enrico Palumbo
Gustavo Penha
Andreas Damianou
José Luis Redondo García
Timothy Christopher Heath
Alice Wang
Hugues Bouchard
M. Lalmas
116
0
0
31 Mar 2025
Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search
Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search
Yedan Shen
Kaixin Wu
Yuechen Ding
Jingyuan Wen
Hong Liu
Mingjie Zhong
Zhouhan Lin
Jia Xu
Linjian Mo
RALM
116
0
0
27 Mar 2025
Fast 3D point clouds retrieval for Large-scale 3D Place Recognition
Fast 3D point clouds retrieval for Large-scale 3D Place Recognition
Chahine-Nicolas Zede
Laurent Carrafa
Valérie Gouet-Brunet
3DPC
120
0
0
28 Feb 2025
A Survey of Model Architectures in Information Retrieval
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
KELM3DV
146
2
0
21 Feb 2025
On Storage Neural Network Augmented Approximate Nearest Neighbor Search
On Storage Neural Network Augmented Approximate Nearest Neighbor Search
Taiga Ikeda
Daisuke Miyashita
J. Deguchi
65
0
0
23 Jan 2025
Generative Retrieval for Book search
Generative Retrieval for Book search
Yubao Tang
Ruqing Zhang
Jiafeng Guo
Maarten de Rijke
Shihao Liu
Shuaiqiang Wang
Dawei Yin
Xueqi Cheng
RALM
141
0
0
19 Jan 2025
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Weiwei Sun
Lingyong Yan
Xinyu Ma
Shuaiqiang Wang
Pengjie Ren
Zhumin Chen
Dawei Yin
Zhaochun Ren
RALMALMELMLRMLM&MA
181
308
0
31 Dec 2024
LLM-Assisted Relevance Assessments: When Should We Ask LLMs for Help?
LLM-Assisted Relevance Assessments: When Should We Ask LLMs for Help?
Rikiya Takehi
E. Voorhees
Tetsuya Sakai
I. Soboroff
193
3
0
11 Nov 2024
From Matching to Generation: A Survey on Generative Information Retrieval
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
167
56
0
23 Apr 2024
LaMDA: Language Models for Dialog Applications
LaMDA: Language Models for Dialog Applications
R. Thoppilan
Daniel De Freitas
Jamie Hall
Noam M. Shazeer
Apoorv Kulshreshtha
...
Blaise Aguera-Arcas
Claire Cui
M. Croak
Ed H. Chi
Quoc Le
ALM
137
1,595
0
20 Jan 2022
WebGPT: Browser-assisted question-answering with human feedback
WebGPT: Browser-assisted question-answering with human feedback
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
...
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALMRALM
187
1,275
0
17 Dec 2021
GenIE: Generative Information Extraction
GenIE: Generative Information Extraction
Martin Josifoski
Nicola De Cao
Maxime Peyrard
Fabio Petroni
Robert West
109
66
0
15 Dec 2021
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts
Nan Du
Yanping Huang
Andrew M. Dai
Simon Tong
Dmitry Lepikhin
...
Kun Zhang
Quoc V. Le
Yonghui Wu
Zhiwen Chen
Claire Cui
ALMMoE
216
813
0
13 Dec 2021
Improving language models by retrieving from trillions of tokens
Improving language models by retrieving from trillions of tokens
Sebastian Borgeaud
A. Mensch
Jordan Hoffmann
Trevor Cai
Eliza Rutherford
...
Simon Osindero
Karen Simonyan
Jack W. Rae
Erich Elsen
Laurent Sifre
KELMRALM
244
1,086
0
08 Dec 2021
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
V. Aribandi
Yi Tay
Tal Schuster
J. Rao
H. Zheng
...
Jianmo Ni
Jai Gupta
Kai Hui
Sebastian Ruder
Donald Metzler
MoE
97
216
0
22 Nov 2021
Exploring the Limits of Large Scale Pre-training
Exploring the Limits of Large Scale Pre-training
Samira Abnar
Mostafa Dehghani
Behnam Neyshabur
Hanie Sedghi
AI4CE
94
119
0
05 Oct 2021
Scale Efficiently: Insights from Pre-training and Fine-tuning
  Transformers
Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Yi Tay
Mostafa Dehghani
J. Rao
W. Fedus
Samira Abnar
Hyung Won Chung
Sharan Narang
Dani Yogatama
Ashish Vaswani
Donald Metzler
250
114
0
22 Sep 2021
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text
  Models
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Jianmo Ni
Gustavo Hernández Ábrego
Noah Constant
Ji Ma
Keith B. Hall
Daniel Cer
Yinfei Yang
212
563
0
19 Aug 2021
SimCSE: Simple Contrastive Learning of Sentence Embeddings
SimCSE: Simple Contrastive Learning of Sentence Embeddings
Tianyu Gao
Xingcheng Yao
Danqi Chen
AILawSSL
261
3,396
0
18 Apr 2021
Switch Transformers: Scaling to Trillion Parameter Models with Simple
  and Efficient Sparsity
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
W. Fedus
Barret Zoph
Noam M. Shazeer
MoE
88
2,187
0
11 Jan 2021
Transformer Feed-Forward Layers Are Key-Value Memories
Transformer Feed-Forward Layers Are Key-Value Memories
Mor Geva
R. Schuster
Jonathan Berant
Omer Levy
KELM
158
828
0
29 Dec 2020
Learned Indexes for a Google-scale Disk-based Database
Learned Indexes for a Google-scale Disk-based Database
Hussam Abu-Libdeh
Deniz Altinbuken
Alex Beutel
Ed H. Chi
Lyric Doshi
Tim Kraska
Xiaozhou Li
Li
Andy Ly
Christopher Olston
51
41
0
23 Dec 2020
Autoregressive Entity Retrieval
Autoregressive Entity Retrieval
Nicola De Cao
Gautier Izacard
Sebastian Riedel
Fabio Petroni
174
445
0
02 Oct 2020
Generation-Augmented Retrieval for Open-domain Question Answering
Generation-Augmented Retrieval for Open-domain Question Answering
Yuning Mao
Pengcheng He
Xiaodong Liu
Yelong Shen
Jianfeng Gao
Jiawei Han
Weizhu Chen
RALM
92
248
0
17 Sep 2020
Hopfield Networks is All You Need
Hopfield Networks is All You Need
Hubert Ramsauer
Bernhard Schafl
Johannes Lehner
Philipp Seidl
Michael Widrich
...
David P. Kreil
Michael K Kopp
Günter Klambauer
Johannes Brandstetter
Sepp Hochreiter
111
433
0
16 Jul 2020
GShard: Scaling Giant Models with Conditional Computation and Automatic
  Sharding
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
Dmitry Lepikhin
HyoukJoong Lee
Yuanzhong Xu
Dehao Chen
Orhan Firat
Yanping Huang
M. Krikun
Noam M. Shazeer
Zhiwen Chen
MoE
103
1,165
0
30 Jun 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
799
42,055
0
28 May 2020
UnifiedQA: Crossing Format Boundaries With a Single QA System
UnifiedQA: Crossing Format Boundaries With a Single QA System
Daniel Khashabi
Sewon Min
Tushar Khot
Ashish Sabharwal
Oyvind Tafjord
Peter Clark
Hannaneh Hajishirzi
143
739
0
02 May 2020
Dense Passage Retrieval for Open-Domain Question Answering
Dense Passage Retrieval for Open-Domain Question Answering
Vladimir Karpukhin
Barlas Oğuz
Sewon Min
Patrick Lewis
Ledell Yu Wu
Sergey Edunov
Danqi Chen
Wen-tau Yih
RALM
195
3,762
0
10 Apr 2020
How Much Knowledge Can You Pack Into the Parameters of a Language Model?
How Much Knowledge Can You Pack Into the Parameters of a Language Model?
Adam Roberts
Colin Raffel
Noam M. Shazeer
KELM
110
891
0
10 Feb 2020
REALM: Retrieval-Augmented Language Model Pre-Training
REALM: Retrieval-Augmented Language Model Pre-Training
Kelvin Guu
Kenton Lee
Zora Tung
Panupong Pasupat
Ming-Wei Chang
RALM
132
2,102
0
10 Feb 2020
Scaling Laws for Neural Language Models
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
608
4,822
0
23 Jan 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
442
20,181
0
23 Oct 2019
Language Models as Knowledge Bases?
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELMAI4MH
571
2,670
0
03 Sep 2019
Document Expansion by Query Prediction
Document Expansion by Query Prediction
Rodrigo Nogueira
Wei Yang
Jimmy J. Lin
Kyunghyun Cho
114
415
0
17 Apr 2019
End-to-End Retrieval in Continuous Space
End-to-End Retrieval in Continuous Space
D. Gillick
Alessandro Presta
Gaurav Singh Tomar
94
103
0
19 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
94,891
0
11 Oct 2018
The Case for Learned Index Structures
The Case for Learned Index Structures
Tim Kraska
Alex Beutel
Ed H. Chi
J. Dean
N. Polyzotis
76
1,042
0
04 Dec 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
704
131,652
0
12 Jun 2017
Neural Ranking Models with Weak Supervision
Neural Ranking Models with Weak Supervision
Mostafa Dehghani
Hamed Zamani
Aliaksei Severyn
J. Kamps
W. Bruce Croft
70
420
0
28 Apr 2017
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
437
20,568
0
10 Sep 2014
1