Papers citing "Language Models are Few-Shot Learners"

50 / 12,362 papers shown

Title
Linearly Mapping from Image to Text Space Jack Merullo Louis Castricato Carsten Eickhoff Ellie Pavlick VLM 248 117 0 30 Sep 2022
PART: Pre-trained Authorship Representation Transformer Javier Huertas-Tato Álvaro Huertas-García Alejandro Martín 135 9 0 30 Sep 2022
Improving Molecular Pretraining with Complementary Featurizations Yanqiao Zhu Dingshuo Chen Yuanqi Du Yingze Wang Qiang Liu Shu Wu AI4CE 68 7 0 29 Sep 2022
Unpacking Large Language Models with Conceptual Consistency Pritish Sahu Michael Cogswell Yunye Gong Ajay Divakaran LRM 116 17 0 29 Sep 2022
Toward Trustworthy Neural Program Synthesis Darren Key Wen-Ding Li Kevin Ellis NAI 171 6 0 29 Sep 2022
Scaling Laws for a Multi-Agent Reinforcement Learning Model Oren Neumann C. Gros 92 27 0 29 Sep 2022
Compositional Semantic Parsing with Large Language Models Andrew Drozdov Nathanael Scharli Ekin Akyuurek Nathan Scales Xinying Song Xinyun Chen Olivier Bousquet Denny Zhou ReLM LRM 269 93 0 29 Sep 2022
Co-Writing Screenplays and Theatre Scripts with Language Models: An Evaluation by Industry Professionals Piotr Wojciech Mirowski Kory W. Mathewson Jaylen Pittman Richard Evans HAI 115 267 0 29 Sep 2022
Does Zero-Shot Reinforcement Learning Exist? Ahmed Touati Jérémy Rapin Yann Ollivier OffRL 116 46 0 29 Sep 2022
Repairing Bugs in Python Assignments Using Large Language Models Jialu Zhang J. Cambronero Sumit Gulwani Vu Le R. Piskac Gustavo Soares Gust Verbruggen KELM 77 56 0 29 Sep 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data Uriel Singer Adam Polyak Thomas Hayes Xiaoyue Yin Jie An ... Oron Ashual Oran Gafni Devi Parikh Sonal Gupta Yaniv Taigman DiffM VGen 97 1,439 0 29 Sep 2022
A Multiagent Framework for the Asynchronous and Collaborative Extension of Multitask ML Systems Andrea Gesmundo 104 2 0 29 Sep 2022
Is Complexity Required for Neural Network Pruning? A Case Study on Global Magnitude Pruning Manas Gupta Efe Camci Vishandi Rudy Keneta Abhishek Vaidyanathan Ritwik Kanodia Chuan-Sheng Foo Wu Min Lin Jie 71 14 0 29 Sep 2022
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning Pan Lu Liang Qiu Kai-Wei Chang Ying Nian Wu Song-Chun Zhu Tanmay Rajpurohit Peter Clark Ashwin Kalyan ReLM LRM 207 300 0 29 Sep 2022
Dataset Distillation Using Parameter Pruning Guang Li Ren Togo Takahiro Ogawa Miki Haseyama DD 142 22 0 29 Sep 2022
Bidirectional Language Models Are Also Few-shot Learners Ajay Patel Bryan Li Mohammad Sadegh Rasooli Noah Constant Colin Raffel Chris Callison-Burch LRM 140 47 0 29 Sep 2022
The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative Modeling Yusong Wu Josh Gardner Ethan Manilow Ian Simon Curtis Hawthorne Jesse Engel 91 10 0 28 Sep 2022
Downstream Datasets Make Surprisingly Good Pretraining Corpora Kundan Krishna Saurabh Garg Jeffrey P. Bigham Zachary Chase Lipton 108 33 0 28 Sep 2022
Clinical Language Understanding Evaluation (CLUE) Travis R. Goodwin Dina Demner-Fushman ELM LM&MA 28 1 0 28 Sep 2022
Who is GPT-3? An Exploration of Personality, Values and Demographics Marilù Miotto Nicola Rossberg Bennett Kleinberg ELM PILM 83 116 0 28 Sep 2022
Causal Proxy Models for Concept-Based Model Explanations Zhengxuan Wu Karel DÓosterlinck Atticus Geiger Amir Zur Christopher Potts MILM 132 37 0 28 Sep 2022
Prompt-driven efficient Open-set Semi-supervised Learning Haoran Li Chun-Mei Feng Tao Zhou Yong Xu Xiaojun Chang 126 4 0 28 Sep 2022
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks Zhiyang Chen Yousong Zhu Zhaowen Li Fan Yang Wei Li ... Chaoyang Zhao Liwei Wu Rui Zhao Jinqiao Wang Ming Tang VLM VOS 124 16 0 28 Sep 2022
YATO: Yet Another deep learning based Text analysis Open toolkit Zeqiang Wang Yile Wang Jiageng Wu Zhiyang Teng Jie Yang 100 3 0 28 Sep 2022
ButterflyFlow: Building Invertible Layers with Butterfly Matrices Chenlin Meng Linqi Zhou Kristy Choi Tri Dao Stefano Ermon TPM 163 12 0 28 Sep 2022
PROD: Progressive Distillation for Dense Retrieval Zhenghao Lin Yeyun Gong Xiao Liu Hang Zhang Chen Lin ... Jian Jiao Jing Lu Daxin Jiang Rangan Majumder Nan Duan 129 27 0 27 Sep 2022
EditEval: An Instruction-Based Benchmark for Text Improvements Jane Dwivedi-Yu Timo Schick Zhengbao Jiang Maria Lomeli Patrick Lewis Gautier Izacard Edouard Grave Sebastian Riedel Fabio Petroni 104 28 0 27 Sep 2022
Deep Generative Multimedia Children's Literature Matthew Lyle Olson 35 0 0 27 Sep 2022
Improving Radiology Report Generation Systems by Removing Hallucinated References to Non-existent Priors Vignav Ramesh Nathan Chi Pranav Rajpurkar MedIm 93 50 0 27 Sep 2022
Local Grammar-Based Coding Revisited L. Debowski 74 0 0 27 Sep 2022
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective Lunjun Zhang Bradly C. Stadie 52 1 0 26 Sep 2022
Lex2Sent: A bagging approach to unsupervised sentiment analysis Kai-Robin Lange Jonas Rieger Carsten Jentsch SSL 32 2 0 26 Sep 2022
Learning to Learn with Generative Models of Neural Network Checkpoints William S. Peebles Ilija Radosavovic Tim Brooks Alexei A. Efros Jitendra Malik UQCV 156 69 0 26 Sep 2022
Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier Layers Nurullah Sevim Ege Ozan Özyedek Furkan Şahinuç Aykut Koç 85 12 0 26 Sep 2022
Do ever larger octopi still amplify reporting biases? Evidence from judgments of typical colour Fangyu Liu Julian Martin Eisenschlos Jeremy R. Cole Nigel Collier 90 4 0 26 Sep 2022
Can Large Language Models Truly Understand Prompts? A Case Study with Negated Prompts Joel Jang Seonghyeon Ye Minjoon Seo ELM LRM 162 64 0 26 Sep 2022
Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs Ðorðe Miladinovic Kumar Shridhar Kushal Kumar Jain Max B. Paulus J. M. Buhmann Mrinmaya Sachan Carl Allen DRL 97 5 0 26 Sep 2022
Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video Grounding Erica K. Shimomoto Edison Marrese-Taylor Hiroya Takamura Ichiro Kobayashi Hideki Nakayama Yusuke Miyao 88 7 0 26 Sep 2022
Entailment Semantics Can Be Extracted from an Ideal Language Model William Merrill Alex Warstadt Tal Linzen 179 14 0 26 Sep 2022
Knowledge Representation for Conceptual, Motivational, and Affective Processes in Natural Language Communication Seng-Beng Ho Zhaoxia Wang B. Quek Min Zhang 36 3 0 26 Sep 2022
News Summarization and Evaluation in the Era of GPT-3 Tanya Goyal Junyi Jessy Li Greg Durrett ELM 124 411 0 26 Sep 2022
Re-contextualizing Fairness in NLP: The Case of India Shaily Bhatt Sunipa Dev Partha P. Talukdar Shachi Dave Vinodkumar Prabhakaran 101 61 0 25 Sep 2022
WinoDict: Probing language models for in-context word acquisition Julian Martin Eisenschlos Jeremy R. Cole Fangyu Liu William W. Cohen KELM 58 13 0 25 Sep 2022
Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity Gabriel Simmons 181 67 0 24 Sep 2022
Interventional Causal Representation Learning Kartik Ahuja Divyat Mahajan Yixin Wang Yoshua Bengio CML 156 95 0 24 Sep 2022
Whodunit? Learning to Contrast for Authorship Attribution Bo Ai Yuchen Wang Yugin Tan Samson Tan SSL 164 19 0 23 Sep 2022
Multiple-Choice Question Generation: Towards an Automated Assessment Framework Vatsal Raina Mark Gales AI4Ed ELM 75 34 0 23 Sep 2022
Promptagator: Few-shot Dense Retrieval From 8 Examples Zhuyun Dai Vincent Zhao Ji Ma Yi Luan Jianmo Ni Jing Lu A. Bakalov Kelvin Guu Keith B. Hall Ming-Wei Chang RALM 104 242 0 23 Sep 2022
Visual representations in the human brain are aligned with large language models Adrien Doerig Tim C Kietzmann Emily J. Allen Yihan Wu Thomas Naselaris Kendrick Norris Kay I. Charest 95 24 0 23 Sep 2022
Best Prompts for Text-to-Image Models and How to Find Them Nikita Pavlichenko Dmitry Ustalov DiffM 85 63 0 23 Sep 2022