v1v2v3v4v5v6 (latest)

An Explanation of In-context Learning as Implicit Bayesian Inference

3 November 2021

Papers citing "An Explanation of In-context Learning as Implicit Bayesian Inference"

50 / 562 papers shown

Title
Leveraging Large Language Models for Exploiting ASR Uncertainty Pranay Dighe Yi Su Shangshang Zheng Yunshu Liu Vineet Garg Xiaochuan Niu Ahmed H. Tewfik 72 13 0 09 Sep 2023
Are Emergent Abilities in Large Language Models just In-Context Learning? Sheng Lu Irina Bigoulaeva Rachneet Sachdeva Harish Tayyar Madabushi Iryna Gurevych LRM ELM ReLM 148 100 0 04 Sep 2023
Inductive-bias Learning: Generating Code Models with Large Language Model Toma Tanaka Naofumi Emoto Tsukasa Yumibayashi AI4CE 61 0 0 19 Aug 2023
DiagGPT: An LLM-based and Multi-agent Dialogue System with Automatic Topic Management for Flexible Task-Oriented Dialogue Lang Cao LM&MA LLMAG 69 4 0 15 Aug 2023
CausalLM is not optimal for in-context learning Nan Ding Tomer Levinboim Jialin Wu Sebastian Goodman Radu Soricut 72 26 0 14 Aug 2023
Text2KGBench: A Benchmark for Ontology-Driven Knowledge Graph Generation from Text Nandana Mihindukulasooriya Sanju Tiwari Carlos F. Enguix K. Lata 89 62 0 04 Aug 2023
When Large Language Models Meet Personalization: Perspectives of Challenges and Opportunities Jin Chen Zheng Liu Xunpeng Huang Chenwang Wu Qi Liu ... Yuxuan Lei Xiaolong Chen Xingmei Wang Defu Lian Enhong Chen ALM 92 129 0 31 Jul 2023
Uncertainty in Natural Language Generation: From Theory to Applications Joris Baan Nico Daheim Evgenia Ilia Dennis Ulmer Haau-Sing Li Raquel Fernández Barbara Plank Rico Sennrich Chrysoula Zerva Wilker Aziz UQLM 155 45 0 28 Jul 2023
Investigating the Learning Behaviour of In-context Learning: A Comparison with Supervised Learning Xindi Wang Yufei Wang Can Xu Xiubo Geng Bowen Zhang Chongyang Tao Frank Rudzicz Robert E. Mercer Daxin Jiang 87 11 0 28 Jul 2023
In-Context Learning Learns Label Relationships but Is Not Conventional Learning Jannik Kossen Y. Gal Tom Rainforth 132 36 0 23 Jul 2023
What can a Single Attention Layer Learn? A Study Through the Random Features Lens Hengyu Fu Tianyu Guo Yu Bai Song Mei MLT 102 26 0 21 Jul 2023
Overthinking the Truth: Understanding how Language Models Process False Demonstrations Danny Halawi Jean-Stanislas Denain Jacob Steinhardt 92 59 0 18 Jul 2023
On the (In)Effectiveness of Large Language Models for Chinese Text Correction Hai-Tao Zheng Haojing Huang Shirong Ma Yong Jiang Yongqian Li F. Zhou Haitao Zheng Qingyu Zhou 107 47 0 18 Jul 2023
Learning to Retrieve In-Context Examples for Large Language Models Liang Wang Nan Yang Furu Wei RALM 91 43 0 14 Jul 2023
Large Language Models Michael R Douglas LLMAG LM&MA 174 645 0 11 Jul 2023
Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps Fuxiao Liu Paiheng Xu Zongxi Li Yue Feng Hyemi Song 116 35 0 11 Jul 2023
Large Language Models as General Pattern Machines Suvir Mirchandani F. Xia Peter R. Florence Brian Ichter Danny Driess Montse Gonzalez Arenas Kanishka Rao Dorsa Sadigh Andy Zeng LLMAG 133 201 0 10 Jul 2023
Bidirectional Attention as a Mixture of Continuous Word Experts Kevin Christian Wibisono Yixin Wang MoE 28 0 0 08 Jul 2023
One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention Arvind V. Mahankali Tatsunori B. Hashimoto Tengyu Ma MLT 83 102 0 07 Jul 2023
Amplifying Limitations, Harms and Risks of Large Language Models Michael OÑeill M. Connor 49 9 0 06 Jul 2023
Scaling In-Context Demonstrations with Structured Attention Tianle Cai Kaixuan Huang Jason D. Lee Mengdi Wang LRM 80 8 0 05 Jul 2023
External Reasoning: Towards Multi-Large-Language-Models Interchangeable Assistance with Human Feedback Akide Liu KELM LRM 37 1 0 05 Jul 2023
Trainable Transformer in Transformer A. Panigrahi Sadhika Malladi Mengzhou Xia Sanjeev Arora VLM 118 13 0 03 Jul 2023
Still No Lie Detector for Language Models: Probing Empirical and Conceptual Roadblocks B. Levinstein Daniel A. Herrmann 102 61 0 30 Jun 2023
Meta-training with Demonstration Retrieval for Efficient Few-shot Learning Aaron Mueller Kanika Narang Lambert Mathias Qifan Wang Hamed Firooz RALM 77 3 0 30 Jun 2023
DisasterResponseGPT: Large Language Models for Accelerated Plan of Action Development in Disaster Response Scenarios Vinicius G. Goecks Nicholas R. Waytowich 77 31 0 29 Jun 2023
Could Small Language Models Serve as Recommenders? Towards Data-centric Cold-start Recommendations Xuansheng Wu Huachi Zhou Yucheng Shi Wenlin Yao Xiao Shi Huang Ninghao Liu LRM 104 13 0 29 Jun 2023
Understanding In-Context Learning via Supportive Pretraining Data Xiaochuang Han Daniel Simig Todor Mihaylov Yulia Tsvetkov Asli Celikyilmaz Tianlu Wang AIMat 113 38 0 26 Jun 2023
Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression Allan Raventós Mansheej Paul F. Chen Surya Ganguli 127 87 0 26 Jun 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning Jonathan Lee Annie Xie Aldo Pacchiano Yash Chandak Chelsea Finn Ofir Nachum Emma Brunskill OffRL 118 86 0 26 Jun 2023
Beyond Scale: The Diversity Coefficient as a Data Quality Metric for Variability in Natural Language Data Alycia Lee Brando Miranda Sudharsan Sundar Allison Casasola Rylan Schaeffer Elyas Obbad Sanmi Koyejo 131 17 0 24 Jun 2023
Harnessing the Power of Adversarial Prompting and Large Language Models for Robust Hypothesis Generation in Astronomy I. Ciucă Y. Ting 丁 Sandor Kruk K. Iyer 86 11 0 20 Jun 2023
Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts Xuan-Phi Nguyen Sharifah Mahani Aljunied Shafiq Joty Lidong Bing 118 38 0 20 Jun 2023
Trained Transformers Learn Linear Models In-Context Ruiqi Zhang Spencer Frei Peter L. Bartlett 97 207 0 16 Jun 2023
Pushing the Limits of ChatGPT on NLP Tasks Xiaofei Sun Linfeng Dong Xiaoya Li Zhen Wan Shuhe Wang ... Jiwei Li Fei Cheng Lingjuan Lyu Leilei Gan Guoyin Wang AI4MH LRM 117 32 0 16 Jun 2023
Schema-learning and rebinding as mechanisms of in-context learning and emergence Siva K. Swaminathan Antoine Dedieu Rajkumar Vasudeva Raju Murray Shanahan Miguel Lazaro-Gredilla Dileep George 97 14 0 16 Jun 2023
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences Xiao Liu Hanyu Lai Hao Yu Yifan Xu Aohan Zeng Zhengxiao Du Peng Zhang Yuxiao Dong Jie Tang 78 105 0 13 Jun 2023
TART: A plug-and-play Transformer module for task-agnostic reasoning Kush S. Bhatia A. Narayan Chris De Sa Christopher Ré LRM ReLM VLM 63 15 0 13 Jun 2023
In-Context Learning through the Bayesian Prism Madhuri Panwar Kabir Ahuja Navin Goyal BDL 89 48 0 08 Jun 2023
Multi-modal Latent Diffusion Mustapha Bounoua Giulio Franzese Pietro Michiardi DiffM 98 13 0 07 Jun 2023
Birth of a Transformer: A Memory Viewpoint A. Bietti Vivien A. Cabannes Diane Bouchacourt Hervé Jégou Léon Bottou 112 96 0 01 Jun 2023
On Masked Pre-training and the Marginal Likelihood Pablo Moreno-Muñoz Pol G. Recasens Søren Hauberg SSL 55 6 0 01 Jun 2023
Transformers learn to implement preconditioned gradient descent for in-context learning Kwangjun Ahn Xiang Cheng Hadi Daneshmand S. Sra ODL 95 176 0 01 Jun 2023
What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization Yufeng Zhang Fengzhuo Zhang Zhuoran Yang Zhaoran Wang BDL 104 74 0 30 May 2023
Contextual Vision Transformers for Robust Representation Learning Yu Bao Theofanis Karaletsos ViT 47 14 0 30 May 2023
Dissecting Chain-of-Thought: Compositionality through In-Context Filtering and Learning Yingcong Li Kartik K. Sreenivasan Angeliki Giannou Dimitris Papailiopoulos Samet Oymak LRM 113 18 0 30 May 2023
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain Feedback Shengchao Liu Jiong Wang Yijin Yang Chengpeng Wang Ling Liu Hongyu Guo Chaowei Xiao LM&MA KELM AI4MH 107 38 0 29 May 2023
Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models Yuhui Zhang Michihiro Yasunaga Zhengping Zhou Jeff Z. HaoChen James Zou Percy Liang Serena Yeung 95 9 0 27 May 2023
Im-Promptu: In-Context Composition from Image Prompts Bhishma Dedhia Michael Chang Jake C. Snell Thomas Griffiths N. Jha LRM MLLM 103 2 0 26 May 2023
A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks Jacob D. Abernethy Alekh Agarwal T. V. Marinov Manfred K. Warmuth 85 21 0 26 May 2023