Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.02080
Cited By
v1
v2
v3
v4
v5
v6 (latest)
An Explanation of In-context Learning as Implicit Bayesian Inference
3 November 2021
Sang Michael Xie
Aditi Raghunathan
Percy Liang
Tengyu Ma
ReLM
BDL
VPVLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Explanation of In-context Learning as Implicit Bayesian Inference"
50 / 562 papers shown
Title
Leveraging Large Language Models for Exploiting ASR Uncertainty
Pranay Dighe
Yi Su
Shangshang Zheng
Yunshu Liu
Vineet Garg
Xiaochuan Niu
Ahmed H. Tewfik
72
13
0
09 Sep 2023
Are Emergent Abilities in Large Language Models just In-Context Learning?
Sheng Lu
Irina Bigoulaeva
Rachneet Sachdeva
Harish Tayyar Madabushi
Iryna Gurevych
LRM
ELM
ReLM
148
100
0
04 Sep 2023
Inductive-bias Learning: Generating Code Models with Large Language Model
Toma Tanaka
Naofumi Emoto
Tsukasa Yumibayashi
AI4CE
61
0
0
19 Aug 2023
DiagGPT: An LLM-based and Multi-agent Dialogue System with Automatic Topic Management for Flexible Task-Oriented Dialogue
Lang Cao
LM&MA
LLMAG
69
4
0
15 Aug 2023
CausalLM is not optimal for in-context learning
Nan Ding
Tomer Levinboim
Jialin Wu
Sebastian Goodman
Radu Soricut
72
26
0
14 Aug 2023
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from Text
Nandana Mihindukulasooriya
Sanju Tiwari
Carlos F. Enguix
K. Lata
89
62
0
04 Aug 2023
When Large Language Models Meet Personalization: Perspectives of Challenges and Opportunities
Jin Chen
Zheng Liu
Xunpeng Huang
Chenwang Wu
Qi Liu
...
Yuxuan Lei
Xiaolong Chen
Xingmei Wang
Defu Lian
Enhong Chen
ALM
92
129
0
31 Jul 2023
Uncertainty in Natural Language Generation: From Theory to Applications
Joris Baan
Nico Daheim
Evgenia Ilia
Dennis Ulmer
Haau-Sing Li
Raquel Fernández
Barbara Plank
Rico Sennrich
Chrysoula Zerva
Wilker Aziz
UQLM
155
45
0
28 Jul 2023
Investigating the Learning Behaviour of In-context Learning: A Comparison with Supervised Learning
Xindi Wang
Yufei Wang
Can Xu
Xiubo Geng
Bowen Zhang
Chongyang Tao
Frank Rudzicz
Robert E. Mercer
Daxin Jiang
87
11
0
28 Jul 2023
In-Context Learning Learns Label Relationships but Is Not Conventional Learning
Jannik Kossen
Y. Gal
Tom Rainforth
132
36
0
23 Jul 2023
What can a Single Attention Layer Learn? A Study Through the Random Features Lens
Hengyu Fu
Tianyu Guo
Yu Bai
Song Mei
MLT
102
26
0
21 Jul 2023
Overthinking the Truth: Understanding how Language Models Process False Demonstrations
Danny Halawi
Jean-Stanislas Denain
Jacob Steinhardt
92
59
0
18 Jul 2023
On the (In)Effectiveness of Large Language Models for Chinese Text Correction
Hai-Tao Zheng
Haojing Huang
Shirong Ma
Yong Jiang
Yongqian Li
F. Zhou
Haitao Zheng
Qingyu Zhou
107
47
0
18 Jul 2023
Learning to Retrieve In-Context Examples for Large Language Models
Liang Wang
Nan Yang
Furu Wei
RALM
91
43
0
14 Jul 2023
Large Language Models
Michael R Douglas
LLMAG
LM&MA
174
645
0
11 Jul 2023
Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps
Fuxiao Liu
Paiheng Xu
Zongxi Li
Yue Feng
Hyemi Song
116
35
0
11 Jul 2023
Large Language Models as General Pattern Machines
Suvir Mirchandani
F. Xia
Peter R. Florence
Brian Ichter
Danny Driess
Montse Gonzalez Arenas
Kanishka Rao
Dorsa Sadigh
Andy Zeng
LLMAG
133
201
0
10 Jul 2023
Bidirectional Attention as a Mixture of Continuous Word Experts
Kevin Christian Wibisono
Yixin Wang
MoE
28
0
0
08 Jul 2023
One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention
Arvind V. Mahankali
Tatsunori B. Hashimoto
Tengyu Ma
MLT
83
102
0
07 Jul 2023
Amplifying Limitations, Harms and Risks of Large Language Models
Michael OÑeill
M. Connor
49
9
0
06 Jul 2023
Scaling In-Context Demonstrations with Structured Attention
Tianle Cai
Kaixuan Huang
Jason D. Lee
Mengdi Wang
LRM
80
8
0
05 Jul 2023
External Reasoning: Towards Multi-Large-Language-Models Interchangeable Assistance with Human Feedback
Akide Liu
KELM
LRM
37
1
0
05 Jul 2023
Trainable Transformer in Transformer
A. Panigrahi
Sadhika Malladi
Mengzhou Xia
Sanjeev Arora
VLM
118
13
0
03 Jul 2023
Still No Lie Detector for Language Models: Probing Empirical and Conceptual Roadblocks
B. Levinstein
Daniel A. Herrmann
102
61
0
30 Jun 2023
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning
Aaron Mueller
Kanika Narang
Lambert Mathias
Qifan Wang
Hamed Firooz
RALM
77
3
0
30 Jun 2023
DisasterResponseGPT: Large Language Models for Accelerated Plan of Action Development in Disaster Response Scenarios
Vinicius G. Goecks
Nicholas R. Waytowich
77
31
0
29 Jun 2023
Could Small Language Models Serve as Recommenders? Towards Data-centric Cold-start Recommendations
Xuansheng Wu
Huachi Zhou
Yucheng Shi
Wenlin Yao
Xiao Shi Huang
Ninghao Liu
LRM
104
13
0
29 Jun 2023
Understanding In-Context Learning via Supportive Pretraining Data
Xiaochuang Han
Daniel Simig
Todor Mihaylov
Yulia Tsvetkov
Asli Celikyilmaz
Tianlu Wang
AIMat
113
38
0
26 Jun 2023
Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression
Allan Raventós
Mansheej Paul
F. Chen
Surya Ganguli
127
87
0
26 Jun 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
118
86
0
26 Jun 2023
Beyond Scale: The Diversity Coefficient as a Data Quality Metric for Variability in Natural Language Data
Alycia Lee
Brando Miranda
Sudharsan Sundar
Allison Casasola
Rylan Schaeffer
Elyas Obbad
Sanmi Koyejo
131
17
0
24 Jun 2023
Harnessing the Power of Adversarial Prompting and Large Language Models for Robust Hypothesis Generation in Astronomy
I. Ciucă
Y. Ting 丁
Sandor Kruk
K. Iyer
86
11
0
20 Jun 2023
Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts
Xuan-Phi Nguyen
Sharifah Mahani Aljunied
Shafiq Joty
Lidong Bing
118
38
0
20 Jun 2023
Trained Transformers Learn Linear Models In-Context
Ruiqi Zhang
Spencer Frei
Peter L. Bartlett
97
207
0
16 Jun 2023
Pushing the Limits of ChatGPT on NLP Tasks
Xiaofei Sun
Linfeng Dong
Xiaoya Li
Zhen Wan
Shuhe Wang
...
Jiwei Li
Fei Cheng
Lingjuan Lyu
Leilei Gan
Guoyin Wang
AI4MH
LRM
117
32
0
16 Jun 2023
Schema-learning and rebinding as mechanisms of in-context learning and emergence
Siva K. Swaminathan
Antoine Dedieu
Rajkumar Vasudeva Raju
Murray Shanahan
Miguel Lazaro-Gredilla
Dileep George
97
14
0
16 Jun 2023
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
Xiao Liu
Hanyu Lai
Hao Yu
Yifan Xu
Aohan Zeng
Zhengxiao Du
Peng Zhang
Yuxiao Dong
Jie Tang
78
105
0
13 Jun 2023
TART: A plug-and-play Transformer module for task-agnostic reasoning
Kush S. Bhatia
A. Narayan
Chris De Sa
Christopher Ré
LRM
ReLM
VLM
63
15
0
13 Jun 2023
In-Context Learning through the Bayesian Prism
Madhuri Panwar
Kabir Ahuja
Navin Goyal
BDL
89
48
0
08 Jun 2023
Multi-modal Latent Diffusion
Mustapha Bounoua
Giulio Franzese
Pietro Michiardi
DiffM
98
13
0
07 Jun 2023
Birth of a Transformer: A Memory Viewpoint
A. Bietti
Vivien A. Cabannes
Diane Bouchacourt
Hervé Jégou
Léon Bottou
112
96
0
01 Jun 2023
On Masked Pre-training and the Marginal Likelihood
Pablo Moreno-Muñoz
Pol G. Recasens
Søren Hauberg
SSL
55
6
0
01 Jun 2023
Transformers learn to implement preconditioned gradient descent for in-context learning
Kwangjun Ahn
Xiang Cheng
Hadi Daneshmand
S. Sra
ODL
95
176
0
01 Jun 2023
What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization
Yufeng Zhang
Fengzhuo Zhang
Zhuoran Yang
Zhaoran Wang
BDL
104
74
0
30 May 2023
Contextual Vision Transformers for Robust Representation Learning
Yu Bao
Theofanis Karaletsos
ViT
47
14
0
30 May 2023
Dissecting Chain-of-Thought: Compositionality through In-Context Filtering and Learning
Yingcong Li
Kartik K. Sreenivasan
Angeliki Giannou
Dimitris Papailiopoulos
Samet Oymak
LRM
113
18
0
30 May 2023
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain Feedback
Shengchao Liu
Jiong Wang
Yijin Yang
Chengpeng Wang
Ling Liu
Hongyu Guo
Chaowei Xiao
LM&MA
KELM
AI4MH
107
38
0
29 May 2023
Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models
Yuhui Zhang
Michihiro Yasunaga
Zhengping Zhou
Jeff Z. HaoChen
James Zou
Percy Liang
Serena Yeung
95
9
0
27 May 2023
Im-Promptu: In-Context Composition from Image Prompts
Bhishma Dedhia
Michael Chang
Jake C. Snell
Thomas Griffiths
N. Jha
LRM
MLLM
103
2
0
26 May 2023
A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks
Jacob D. Abernethy
Alekh Agarwal
T. V. Marinov
Manfred K. Warmuth
85
21
0
26 May 2023
Previous
1
2
3
...
10
11
12
8
9
Next