Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.03846
Cited By
Larger language models do in-context learning differently
7 March 2023
Jerry W. Wei
Jason W. Wei
Yi Tay
Dustin Tran
Albert Webson
Yifeng Lu
Xinyun Chen
Hanxiao Liu
Da Huang
Denny Zhou
Tengyu Ma
ReLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Larger language models do in-context learning differently"
50 / 280 papers shown
Title
Fine-tune Language Models to Approximate Unbiased In-context Learning
Timothy Chu
Zhao-quan Song
Chiwun Yang
27
15
0
05 Oct 2023
Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions
S. Bhattamishra
Arkil Patel
Phil Blunsom
Varun Kanade
21
41
0
04 Oct 2023
PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning
Tianci Xue
Ziqi Wang
Yixia Li
Yun-Nung Chen
Guanhua Chen
26
2
0
02 Oct 2023
Self-Supervised Open-Ended Classification with Small Visual Language Models
Mohammad Mahdi Derakhshani
Ivona Najdenkoska
Cees G. M. Snoek
M. Worring
Yuki M. Asano
VLM
22
0
0
30 Sep 2023
From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning
Xuansheng Wu
Wenlin Yao
Jianshu Chen
Xiaoman Pan
Xiaoyang Wang
Ninghao Liu
Dong Yu
LRM
20
27
0
30 Sep 2023
Understanding In-Context Learning from Repetitions
Jianhao Yan
Jin Xu
Chiyu Song
Chenming Wu
Yafu Li
Yue Zhang
27
20
0
30 Sep 2023
Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives
Elizabeth Seger
Noemi Dreksler
Richard Moulange
Emily Dardaman
Jonas Schuett
...
Emma Bluemke
Michael Aird
Patrick Levermore
Julian Hazell
Abhishek Gupta
20
40
0
29 Sep 2023
Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt Engineering
Han Zhou
Xingchen Wan
Lev Proleev
Diana Mincu
Jilin Chen
Katherine A. Heller
Subhrajit Roy
UQLM
31
53
0
29 Sep 2023
SCALE: Synergized Collaboration of Asymmetric Language Translation Engines
Xin Cheng
Xun Wang
Tao Ge
Si-Qing Chen
Heng Chang
Dongyan Zhao
Rui Yan
69
2
0
29 Sep 2023
Attention Sorting Combats Recency Bias In Long Context Language Models
A. Peysakhovich
Adam Lerer
LRM
RALM
36
42
0
28 Sep 2023
Prompting and Fine-Tuning Open-Sourced Large Language Models for Stance Classification
Iain J. Cruickshank
Lynnette Hui Xian Ng
24
9
0
24 Sep 2023
In-Context Learning for Text Classification with Many Labels
Aristides Milios
Siva Reddy
Dzmitry Bahdanau
20
34
0
19 Sep 2023
Understanding Catastrophic Forgetting in Language Models via Implicit Inference
Suhas Kotha
Jacob Mitchell Springer
Aditi Raghunathan
CLL
42
57
0
18 Sep 2023
Prompt a Robot to Walk with Large Language Models
Yen-Jen Wang
Bike Zhang
Jianyu Chen
K. Sreenath
LM&Ro
LLMAG
32
49
0
18 Sep 2023
Ambiguity-Aware In-Context Learning with Large Language Models
Lingyu Gao
Aditi Chaudhary
Krishna Srinivasan
Kazuma Hashimoto
K. Raman
Michael Bendersky
21
7
0
14 Sep 2023
Breaking through the learning plateaus of in-context learning in Transformer
Jingwen Fu
Tao Yang
Yuwang Wang
Yan Lu
Nanning Zheng
30
1
0
12 Sep 2023
Large Language Models as Optimizers
Chengrun Yang
Xuezhi Wang
Yifeng Lu
Hanxiao Liu
Quoc V. Le
Denny Zhou
Xinyun Chen
ODL
43
376
0
07 Sep 2023
Gender-specific Machine Translation with Large Language Models
Eduardo Sánchez
Pierre Yves Andrews
Pontus Stenetorp
Mikel Artetxe
Marta R. Costa-jussá
32
2
0
06 Sep 2023
Are Emergent Abilities in Large Language Models just In-Context Learning?
Sheng Lu
Irina Bigoulaeva
Rachneet Sachdeva
Harish Tayyar Madabushi
Iryna Gurevych
LRM
ELM
ReLM
49
93
0
04 Sep 2023
Explainability for Large Language Models: A Survey
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Jundong Li
LRM
29
411
0
02 Sep 2023
Context Aware Query Rewriting for Text Rankers using LLM
Abhijit Anand
Venktesh V
Vinay Setty
Avishek Anand
35
17
0
31 Aug 2023
Inductive-bias Learning: Generating Code Models with Large Language Model
Toma Tanaka
Naofumi Emoto
Tsukasa Yumibayashi
AI4CE
19
0
0
19 Aug 2023
CausalLM is not optimal for in-context learning
Nan Ding
Tomer Levinboim
Jialin Wu
Sebastian Goodman
Radu Soricut
24
23
0
14 Aug 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Youliang Yuan
Wenxiang Jiao
Wenxuan Wang
Jen-tse Huang
Pinjia He
Shuming Shi
Zhaopeng Tu
SILM
76
232
0
12 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
43
41
0
01 Aug 2023
FinVis-GPT: A Multimodal Large Language Model for Financial Chart Analysis
Ziao Wang
Yuhang Li
Junda Wu
Jaehyeon Soon
Xiaofeng Zhang
MLLM
17
15
0
31 Jul 2023
An Effective Data Creation Pipeline to Generate High-quality Financial Instruction Data for Large Language Model
Ziao Wang
Jianning Wang
Junda Wu
Xiaofeng Zhang
ALM
28
0
0
31 Jul 2023
In-Context Learning Learns Label Relationships but Is Not Conventional Learning
Jannik Kossen
Y. Gal
Tom Rainforth
37
27
0
23 Jul 2023
Instruction-following Evaluation through Verbalizer Manipulation
Shiyang Li
Jun Yan
Hai Wang
Zheng Tang
Xiang Ren
Vijay Srinivasan
Hongxia Jin
36
25
0
20 Jul 2023
Overthinking the Truth: Understanding how Language Models Process False Demonstrations
Danny Halawi
Jean-Stanislas Denain
Jacob Steinhardt
28
53
0
18 Jul 2023
Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps
Fuxiao Liu
Paiheng Xu
Zongxi Li
Yue Feng
Hyemi Song
19
31
0
11 Jul 2023
One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention
Arvind V. Mahankali
Tatsunori B. Hashimoto
Tengyu Ma
MLT
18
80
0
07 Jul 2023
InstructEval: Systematic Evaluation of Instruction Selection Methods
Anirudh Ajith
Chris Pan
Mengzhou Xia
A. Deshpande
Karthik Narasimhan
ELM
25
16
0
01 Jul 2023
Personality Traits in Large Language Models
Gregory Serapio-García
Mustafa Safdari
Clément Crepy
Luning Sun
Stephen Fitz
P. Romero
Marwa Abdulhai
Aleksandra Faust
Maja J. Matarić
LM&MA
LLMAG
58
119
0
01 Jul 2023
GPT-FinRE: In-context Learning for Financial Relation Extraction using Large Language Models
P. Rajpoot
Ankur P. Parikh
24
14
0
30 Jun 2023
Understanding In-Context Learning via Supportive Pretraining Data
Xiaochuang Han
Daniel Simig
Todor Mihaylov
Yulia Tsvetkov
Asli Celikyilmaz
Tianlu Wang
AIMat
35
33
0
26 Jun 2023
Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression
Allan Raventós
Mansheej Paul
F. Chen
Surya Ganguli
27
70
0
26 Jun 2023
Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts
Xuan-Phi Nguyen
Sharifah Mahani Aljunied
Shafiq R. Joty
Lidong Bing
18
32
0
20 Jun 2023
TART: A plug-and-play Transformer module for task-agnostic reasoning
Kush S. Bhatia
A. Narayan
Chris De Sa
Christopher Ré
LRM
ReLM
VLM
28
14
0
13 Jun 2023
In-Context Learning through the Bayesian Prism
Madhuri Panwar
Kabir Ahuja
Navin Goyal
BDL
34
38
0
08 Jun 2023
Dissecting Chain-of-Thought: Compositionality through In-Context Filtering and Learning
Yingcong Li
Kartik K. Sreenivasan
Angeliki Giannou
Dimitris Papailiopoulos
Samet Oymak
LRM
16
16
0
30 May 2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model
Xi Chen
Josip Djolonga
Piotr Padlewski
Basil Mustafa
Soravit Changpinyo
...
Mojtaba Seyedhosseini
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
VLM
62
187
0
29 May 2023
Taming AI Bots: Controllability of Neural States in Large Language Models
Stefano Soatto
Paulo Tabuada
Pratik Chaudhari
Tianwei Liu
LLMAG
LM&Ro
18
13
0
29 May 2023
Mitigating Label Biases for In-context Learning
Yu Fei
Yifan Hou
Zeming Chen
Antoine Bosselut
35
69
0
28 May 2023
Im-Promptu: In-Context Composition from Image Prompts
Bhishma Dedhia
Michael Chang
Jake C. Snell
Thomas L. Griffiths
N. Jha
LRM
MLLM
32
1
0
26 May 2023
A Closer Look at In-Context Learning under Distribution Shifts
Kartik Ahuja
David Lopez-Paz
40
14
0
26 May 2023
READ: Recurrent Adaptation of Large Transformers
Sida I. Wang
John Nguyen
Ke Li
Carole-Jean Wu
22
11
0
24 May 2023
Self-ICL: Zero-Shot In-Context Learning with Self-Generated Demonstrations
Wei-Lin Chen
Cheng-Kuang Wu
Yun-Nung Chen
Hsin-Hsi Chen
21
27
0
24 May 2023
Adversarial Demonstration Attacks on Large Language Models
Jiong Wang
Zi-yang Liu
Keun Hee Park
Zhuojun Jiang
Zhaoheng Zheng
Zhuofeng Wu
Muhao Chen
Chaowei Xiao
SILM
22
52
0
24 May 2023
Universal Self-Adaptive Prompting
Xingchen Wan
Ruoxi Sun
Hootan Nakhost
H. Dai
Julian Martin Eisenschlos
Sercan Ö. Arik
Tomas Pfister
LRM
38
9
0
24 May 2023
Previous
1
2
3
4
5
6
Next