ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.03846
  4. Cited By
Larger language models do in-context learning differently

Larger language models do in-context learning differently

7 March 2023
Jerry W. Wei
Jason W. Wei
Yi Tay
Dustin Tran
Albert Webson
Yifeng Lu
Xinyun Chen
Hanxiao Liu
Da Huang
Denny Zhou
Tengyu Ma
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Larger language models do in-context learning differently"

50 / 280 papers shown
Title
Fine-tune Language Models to Approximate Unbiased In-context Learning
Fine-tune Language Models to Approximate Unbiased In-context Learning
Timothy Chu
Zhao-quan Song
Chiwun Yang
27
15
0
05 Oct 2023
Understanding In-Context Learning in Transformers and LLMs by Learning
  to Learn Discrete Functions
Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions
S. Bhattamishra
Arkil Patel
Phil Blunsom
Varun Kanade
21
41
0
04 Oct 2023
PACIT: Unlocking the Power of Examples for Better In-Context Instruction
  Tuning
PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning
Tianci Xue
Ziqi Wang
Yixia Li
Yun-Nung Chen
Guanhua Chen
26
2
0
02 Oct 2023
Self-Supervised Open-Ended Classification with Small Visual Language
  Models
Self-Supervised Open-Ended Classification with Small Visual Language Models
Mohammad Mahdi Derakhshani
Ivona Najdenkoska
Cees G. M. Snoek
M. Worring
Yuki M. Asano
VLM
22
0
0
30 Sep 2023
From Language Modeling to Instruction Following: Understanding the
  Behavior Shift in LLMs after Instruction Tuning
From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning
Xuansheng Wu
Wenlin Yao
Jianshu Chen
Xiaoman Pan
Xiaoyang Wang
Ninghao Liu
Dong Yu
LRM
20
27
0
30 Sep 2023
Understanding In-Context Learning from Repetitions
Understanding In-Context Learning from Repetitions
Jianhao Yan
Jin Xu
Chiyu Song
Chenming Wu
Yafu Li
Yue Zhang
27
20
0
30 Sep 2023
Open-Sourcing Highly Capable Foundation Models: An evaluation of risks,
  benefits, and alternative methods for pursuing open-source objectives
Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives
Elizabeth Seger
Noemi Dreksler
Richard Moulange
Emily Dardaman
Jonas Schuett
...
Emma Bluemke
Michael Aird
Patrick Levermore
Julian Hazell
Abhishek Gupta
20
40
0
29 Sep 2023
Batch Calibration: Rethinking Calibration for In-Context Learning and
  Prompt Engineering
Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt Engineering
Han Zhou
Xingchen Wan
Lev Proleev
Diana Mincu
Jilin Chen
Katherine A. Heller
Subhrajit Roy
UQLM
31
53
0
29 Sep 2023
SCALE: Synergized Collaboration of Asymmetric Language Translation
  Engines
SCALE: Synergized Collaboration of Asymmetric Language Translation Engines
Xin Cheng
Xun Wang
Tao Ge
Si-Qing Chen
Heng Chang
Dongyan Zhao
Rui Yan
69
2
0
29 Sep 2023
Attention Sorting Combats Recency Bias In Long Context Language Models
Attention Sorting Combats Recency Bias In Long Context Language Models
A. Peysakhovich
Adam Lerer
LRM
RALM
36
42
0
28 Sep 2023
Prompting and Fine-Tuning Open-Sourced Large Language Models for Stance
  Classification
Prompting and Fine-Tuning Open-Sourced Large Language Models for Stance Classification
Iain J. Cruickshank
Lynnette Hui Xian Ng
24
9
0
24 Sep 2023
In-Context Learning for Text Classification with Many Labels
In-Context Learning for Text Classification with Many Labels
Aristides Milios
Siva Reddy
Dzmitry Bahdanau
20
34
0
19 Sep 2023
Understanding Catastrophic Forgetting in Language Models via Implicit
  Inference
Understanding Catastrophic Forgetting in Language Models via Implicit Inference
Suhas Kotha
Jacob Mitchell Springer
Aditi Raghunathan
CLL
42
57
0
18 Sep 2023
Prompt a Robot to Walk with Large Language Models
Prompt a Robot to Walk with Large Language Models
Yen-Jen Wang
Bike Zhang
Jianyu Chen
K. Sreenath
LM&Ro
LLMAG
32
49
0
18 Sep 2023
Ambiguity-Aware In-Context Learning with Large Language Models
Ambiguity-Aware In-Context Learning with Large Language Models
Lingyu Gao
Aditi Chaudhary
Krishna Srinivasan
Kazuma Hashimoto
K. Raman
Michael Bendersky
21
7
0
14 Sep 2023
Breaking through the learning plateaus of in-context learning in
  Transformer
Breaking through the learning plateaus of in-context learning in Transformer
Jingwen Fu
Tao Yang
Yuwang Wang
Yan Lu
Nanning Zheng
30
1
0
12 Sep 2023
Large Language Models as Optimizers
Large Language Models as Optimizers
Chengrun Yang
Xuezhi Wang
Yifeng Lu
Hanxiao Liu
Quoc V. Le
Denny Zhou
Xinyun Chen
ODL
43
376
0
07 Sep 2023
Gender-specific Machine Translation with Large Language Models
Gender-specific Machine Translation with Large Language Models
Eduardo Sánchez
Pierre Yves Andrews
Pontus Stenetorp
Mikel Artetxe
Marta R. Costa-jussá
32
2
0
06 Sep 2023
Are Emergent Abilities in Large Language Models just In-Context
  Learning?
Are Emergent Abilities in Large Language Models just In-Context Learning?
Sheng Lu
Irina Bigoulaeva
Rachneet Sachdeva
Harish Tayyar Madabushi
Iryna Gurevych
LRM
ELM
ReLM
49
93
0
04 Sep 2023
Explainability for Large Language Models: A Survey
Explainability for Large Language Models: A Survey
Haiyan Zhao
Hanjie Chen
Fan Yang
Ninghao Liu
Huiqi Deng
Hengyi Cai
Shuaiqiang Wang
Dawei Yin
Jundong Li
LRM
29
411
0
02 Sep 2023
Context Aware Query Rewriting for Text Rankers using LLM
Context Aware Query Rewriting for Text Rankers using LLM
Abhijit Anand
Venktesh V
Vinay Setty
Avishek Anand
35
17
0
31 Aug 2023
Inductive-bias Learning: Generating Code Models with Large Language
  Model
Inductive-bias Learning: Generating Code Models with Large Language Model
Toma Tanaka
Naofumi Emoto
Tsukasa Yumibayashi
AI4CE
19
0
0
19 Aug 2023
CausalLM is not optimal for in-context learning
CausalLM is not optimal for in-context learning
Nan Ding
Tomer Levinboim
Jialin Wu
Sebastian Goodman
Radu Soricut
24
23
0
14 Aug 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Youliang Yuan
Wenxiang Jiao
Wenxuan Wang
Jen-tse Huang
Pinjia He
Shuming Shi
Zhaopeng Tu
SILM
76
232
0
12 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language
  Models
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
43
41
0
01 Aug 2023
FinVis-GPT: A Multimodal Large Language Model for Financial Chart
  Analysis
FinVis-GPT: A Multimodal Large Language Model for Financial Chart Analysis
Ziao Wang
Yuhang Li
Junda Wu
Jaehyeon Soon
Xiaofeng Zhang
MLLM
17
15
0
31 Jul 2023
An Effective Data Creation Pipeline to Generate High-quality Financial
  Instruction Data for Large Language Model
An Effective Data Creation Pipeline to Generate High-quality Financial Instruction Data for Large Language Model
Ziao Wang
Jianning Wang
Junda Wu
Xiaofeng Zhang
ALM
28
0
0
31 Jul 2023
In-Context Learning Learns Label Relationships but Is Not Conventional
  Learning
In-Context Learning Learns Label Relationships but Is Not Conventional Learning
Jannik Kossen
Y. Gal
Tom Rainforth
37
27
0
23 Jul 2023
Instruction-following Evaluation through Verbalizer Manipulation
Instruction-following Evaluation through Verbalizer Manipulation
Shiyang Li
Jun Yan
Hai Wang
Zheng Tang
Xiang Ren
Vijay Srinivasan
Hongxia Jin
36
25
0
20 Jul 2023
Overthinking the Truth: Understanding how Language Models Process False
  Demonstrations
Overthinking the Truth: Understanding how Language Models Process False Demonstrations
Danny Halawi
Jean-Stanislas Denain
Jacob Steinhardt
28
53
0
18 Jul 2023
Towards Understanding In-Context Learning with Contrastive
  Demonstrations and Saliency Maps
Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps
Fuxiao Liu
Paiheng Xu
Zongxi Li
Yue Feng
Hyemi Song
19
31
0
11 Jul 2023
One Step of Gradient Descent is Provably the Optimal In-Context Learner
  with One Layer of Linear Self-Attention
One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention
Arvind V. Mahankali
Tatsunori B. Hashimoto
Tengyu Ma
MLT
18
80
0
07 Jul 2023
InstructEval: Systematic Evaluation of Instruction Selection Methods
InstructEval: Systematic Evaluation of Instruction Selection Methods
Anirudh Ajith
Chris Pan
Mengzhou Xia
A. Deshpande
Karthik Narasimhan
ELM
25
16
0
01 Jul 2023
Personality Traits in Large Language Models
Personality Traits in Large Language Models
Gregory Serapio-García
Mustafa Safdari
Clément Crepy
Luning Sun
Stephen Fitz
P. Romero
Marwa Abdulhai
Aleksandra Faust
Maja J. Matarić
LM&MA
LLMAG
58
119
0
01 Jul 2023
GPT-FinRE: In-context Learning for Financial Relation Extraction using
  Large Language Models
GPT-FinRE: In-context Learning for Financial Relation Extraction using Large Language Models
P. Rajpoot
Ankur P. Parikh
24
14
0
30 Jun 2023
Understanding In-Context Learning via Supportive Pretraining Data
Understanding In-Context Learning via Supportive Pretraining Data
Xiaochuang Han
Daniel Simig
Todor Mihaylov
Yulia Tsvetkov
Asli Celikyilmaz
Tianlu Wang
AIMat
35
33
0
26 Jun 2023
Pretraining task diversity and the emergence of non-Bayesian in-context
  learning for regression
Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression
Allan Raventós
Mansheej Paul
F. Chen
Surya Ganguli
27
70
0
26 Jun 2023
Democratizing LLMs for Low-Resource Languages by Leveraging their
  English Dominant Abilities with Linguistically-Diverse Prompts
Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts
Xuan-Phi Nguyen
Sharifah Mahani Aljunied
Shafiq R. Joty
Lidong Bing
18
32
0
20 Jun 2023
TART: A plug-and-play Transformer module for task-agnostic reasoning
TART: A plug-and-play Transformer module for task-agnostic reasoning
Kush S. Bhatia
A. Narayan
Chris De Sa
Christopher Ré
LRM
ReLM
VLM
28
14
0
13 Jun 2023
In-Context Learning through the Bayesian Prism
In-Context Learning through the Bayesian Prism
Madhuri Panwar
Kabir Ahuja
Navin Goyal
BDL
34
38
0
08 Jun 2023
Dissecting Chain-of-Thought: Compositionality through In-Context
  Filtering and Learning
Dissecting Chain-of-Thought: Compositionality through In-Context Filtering and Learning
Yingcong Li
Kartik K. Sreenivasan
Angeliki Giannou
Dimitris Papailiopoulos
Samet Oymak
LRM
16
16
0
30 May 2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model
PaLI-X: On Scaling up a Multilingual Vision and Language Model
Xi Chen
Josip Djolonga
Piotr Padlewski
Basil Mustafa
Soravit Changpinyo
...
Mojtaba Seyedhosseini
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
VLM
62
187
0
29 May 2023
Taming AI Bots: Controllability of Neural States in Large Language
  Models
Taming AI Bots: Controllability of Neural States in Large Language Models
Stefano Soatto
Paulo Tabuada
Pratik Chaudhari
Tianwei Liu
LLMAG
LM&Ro
18
13
0
29 May 2023
Mitigating Label Biases for In-context Learning
Mitigating Label Biases for In-context Learning
Yu Fei
Yifan Hou
Zeming Chen
Antoine Bosselut
35
69
0
28 May 2023
Im-Promptu: In-Context Composition from Image Prompts
Im-Promptu: In-Context Composition from Image Prompts
Bhishma Dedhia
Michael Chang
Jake C. Snell
Thomas L. Griffiths
N. Jha
LRM
MLLM
32
1
0
26 May 2023
A Closer Look at In-Context Learning under Distribution Shifts
A Closer Look at In-Context Learning under Distribution Shifts
Kartik Ahuja
David Lopez-Paz
40
14
0
26 May 2023
READ: Recurrent Adaptation of Large Transformers
READ: Recurrent Adaptation of Large Transformers
Sida I. Wang
John Nguyen
Ke Li
Carole-Jean Wu
22
11
0
24 May 2023
Self-ICL: Zero-Shot In-Context Learning with Self-Generated
  Demonstrations
Self-ICL: Zero-Shot In-Context Learning with Self-Generated Demonstrations
Wei-Lin Chen
Cheng-Kuang Wu
Yun-Nung Chen
Hsin-Hsi Chen
21
27
0
24 May 2023
Adversarial Demonstration Attacks on Large Language Models
Adversarial Demonstration Attacks on Large Language Models
Jiong Wang
Zi-yang Liu
Keun Hee Park
Zhuojun Jiang
Zhaoheng Zheng
Zhuofeng Wu
Muhao Chen
Chaowei Xiao
SILM
22
52
0
24 May 2023
Universal Self-Adaptive Prompting
Universal Self-Adaptive Prompting
Xingchen Wan
Ruoxi Sun
Hootan Nakhost
H. Dai
Julian Martin Eisenschlos
Sercan Ö. Arik
Tomas Pfister
LRM
38
9
0
24 May 2023
Previous
123456
Next