Papers citing "Language Models are Few-Shot Learners"

50 / 12,355 papers shown

Title
Efficient Quantized Sparse Matrix Operations on Tensor Cores Shigang Li Kazuki Osawa Torsten Hoefler 160 32 0 14 Sep 2022
Out of One, Many: Using Language Models to Simulate Human Samples Lisa P. Argyle Ethan C. Busby Nancy Fulda Joshua R Gubler Christopher Rytting David Wingate SyDa 105 615 0 14 Sep 2022
PaLI: A Jointly-Scaled Multilingual Language-Image Model Xi Chen Tianlin Li Soravit Changpinyo A. Piergiovanni Piotr Padlewski ... Andreas Steiner A. Angelova Xiaohua Zhai N. Houlsby Radu Soricut MLLM VLM 205 741 0 14 Sep 2022
vec2text with Round-Trip Translations Geoffrey Cideron Sertan Girgin Anton Raichuk Olivier Pietquin Olivier Bachem Léonard Hussenot 91 3 0 14 Sep 2022
Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models Suhyune Son Chanjun Park Jungseob Lee Midan Shim Chanhee Lee Yoonna Jang Jaehyung Seo Heu-Jeoung Lim 71 0 0 14 Sep 2022
Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot Challenge on Conversational Task Assistance Anna Gottardi Osman Ipek Giuseppe Castellucci Shui Hu Lavina Vaz ... Oleg Rokhlenko Kate Bland Eugene Agichtein R. Ghanadan Y. Maarek 84 23 0 13 Sep 2022
Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest Jack Hessel Ana Marasović Jena D. Hwang Lillian Lee Jeff Da Rowan Zellers Robert Mankoff Yejin Choi VLM 112 91 0 13 Sep 2022
Improving Language Model Prompting in Support of Semi-autonomous Task Learning James R. Kirk R. Wray Peter Lindes John E. Laird LRM 64 11 0 13 Sep 2022
Revisiting Neural Scaling Laws in Language and Vision Ibrahim Alabdulmohsin Behnam Neyshabur Xiaohua Zhai 235 111 0 13 Sep 2022
Vision Transformers for Action Recognition: A Survey Anwaar Ulhaq Naveed Akhtar Ganna Pogrebna Ajmal Mian ViT 82 45 0 13 Sep 2022
Rule-adhering synthetic data -- the lingua franca of learning Michaela D. Platzer Ivona Krchova 119 2 0 12 Sep 2022
An Embedding-Based Grocery Search Model at Instacart Yuqing Xie Taesik Na X. Xiao Saurav Manchanda Young Rao Zhihong Xu Guanghua Shu Esther Vasiete Tejaswi Tenneti Haixun Wang DML RALM 56 6 0 12 Sep 2022
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation Mohit Shridhar Lucas Manuelli Dieter Fox LM&Ro 286 501 0 12 Sep 2022
FP8 Formats for Deep Learning Paulius Micikevicius Dusan Stosic N. Burgess Marius Cornea Pradeep Dubey ... Naveen Mellempudi S. Oberman Mohammad Shoeybi Michael Siu Hao Wu BDL VLM MQ 156 141 0 12 Sep 2022
Factual and Informative Review Generation for Explainable Recommendation Zhouhang Xie Sameer Singh Julian McAuley Bodhisattwa Prasad Majumder 104 26 0 12 Sep 2022
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe Hongyang Li Chonghao Sima Jifeng Dai Wenhai Wang Lewei Lu ... Xiaosong Jia Siqian Liu Jianping Shi Dahua Lin Yu Qiao 172 150 0 12 Sep 2022
GenLoco: Generalized Locomotion Controllers for Quadrupedal Robots Gilbert Feng Hongbo Zhang Zhongyu Li Xue Bin Peng Bhuvan Basireddy ... Zhitao Song Lizhi Yang Yunhui Liu Koushil Sreenath Sergey Levine 156 67 0 12 Sep 2022
Open-Domain Dialog Evaluation using Follow-Ups Likelihood Maxime De Bruyn Ehsan Lotfi Jeska Buhmann Walter Daelemans 91 9 0 12 Sep 2022
Knowledge Base Question Answering: A Semantic Parsing Perspective Yu Gu Vardaan Pahuja Gong Cheng Yu-Chuan Su 120 29 0 12 Sep 2022
Vec2Face-v2: Unveil Human Faces from their Blackbox Features via Attention-based Network in Face Recognition Thanh-Dat Truong C. Duong Ngan Le Marios Savvides Khoa Luu CVBM 103 9 0 11 Sep 2022
On The Computational Complexity of Self-Attention Feyza Duman Keles Pruthuvi Maheshakya Wijewardena Chinmay Hegde 142 130 0 11 Sep 2022
Structured Q-learning For Antibody Design Alexander I. Cowen-Rivers P. Gorinski Aivar Sootla Asif R. Khan Liu Furui Jun Wang Jan Peters H. Ammar OffRL OnRL 89 3 0 10 Sep 2022
Improved Masked Image Generation with Token-Critic José Lezama Huiwen Chang Lu Jiang Irfan Essa DiffM 248 48 0 09 Sep 2022
Automatic Readability Assessment of German Sentences with Transformer Ensembles Patrick Gustav Blaneck Tobias Bornheim Niklas Grieger Stephan Bialonski 74 10 0 09 Sep 2022
Fast Neural Kernel Embeddings for General Activations Insu Han A. Zandieh Jaehoon Lee Roman Novak Lechao Xiao Amin Karbasi 120 19 0 09 Sep 2022
Vision for Bosnia and Herzegovina in Artificial Intelligence Age: Global Trends, Potential Opportunities, Selected Use-cases and Realistic Goals Zlatan Ajanović E. Alickovic Aida Brankovic Sead Delalic Eldar Kurtic S. Malikić Adnan Mehonic Hamza Merzic Kenan Sehic Bahrudin Trbalic 88 0 0 08 Sep 2022
Data Feedback Loops: Model-driven Amplification of Dataset Biases Rohan Taori Tatsunori B. Hashimoto 124 48 0 08 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation Yile Wang Linyi Yang Zhiyang Teng M. Zhou Yue Zhang GNN 81 1 0 08 Sep 2022
FETA: Towards Specializing Foundation Models for Expert Task Applications Amit Alfassy Assaf Arbelle Oshri Halimi Sivan Harary Roei Herzig ... Christoph Auer Kate Saenko Peter W. J. Staar Rogerio Feris Leonid Karlinsky 90 20 0 08 Sep 2022
What does a platypus look like? Generating customized prompts for zero-shot image classification Sarah M Pratt Ian Covert Rosanne Liu Ali Farhadi VLM 189 224 0 07 Sep 2022
AutoPruner: Transformer-Based Call Graph Pruning Thanh Le-Cong Hong Jin Kang Truong-Giang Nguyen S. A. Haryono David Lo X. Le H. Thang 66 20 0 07 Sep 2022
On the Effectiveness of Compact Biomedical Transformers Omid Rohanian Mohammadmahdi Nouriborji Samaneh Kouchaki David Clifton MedIm 87 31 0 07 Sep 2022
AudioLM: a Language Modeling Approach to Audio Generation Zalan Borsos Raphaël Marinier Damien Vincent Eugene Kharitonov Olivier Pietquin ... Dominik Roblek O. Teboul David Grangier Marco Tagliasacchi Neil Zeghidour AuLLM 163 616 0 07 Sep 2022
SynSciPass: detecting appropriate uses of scientific text generation Domenic Rosati DeLMO 126 19 0 07 Sep 2022
Studying Bias in GANs through the Lens of Race V. Maluleke Neerja Thakkar Tim Brooks Ethan Weber Trevor Darrell Alexei A. Efros Angjoo Kanazawa Devin Guillory 107 36 0 06 Sep 2022
Explaining Machine Learning Models in Natural Conversations: Towards a Conversational XAI Agent Van Bach Nguyen Jorg Schlotterer C. Seifert AILaw 38 12 0 06 Sep 2022
EnergonAI: An Inference System for 10-100 Billion Parameter Transformer Models Jiangsu Du Ziming Liu Jiarui Fang Shenggui Li Yongbin Li Yutong Lu Yang You MoE 52 4 0 06 Sep 2022
Few-Shot Document-Level Event Argument Extraction Xianjun Yang Yujie Lu Linda R. Petzold 69 16 0 06 Sep 2022
Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples Hezekiah J. Branch Jonathan Rodriguez Cefalu Jeremy McHugh Leyla Hujer Aditya Bahl Daniel del Castillo Iglesias Ron Heichman Ramesh Darwishi ELM SILM AAML 70 56 0 05 Sep 2022
Selective Annotation Makes Language Models Better Few-Shot Learners Hongjin Su Jungo Kasai Chen Henry Wu Weijia Shi Tianlu Wang ... Rui Zhang Mari Ostendorf Luke Zettlemoyer Noah A. Smith Tao Yu 118 262 0 05 Sep 2022
A Review of Sparse Expert Models in Deep Learning W. Fedus J. Dean Barret Zoph MoE 129 154 0 04 Sep 2022
The Effectiveness of Bidirectional Generative Patent Language Models Jieh-Sheng Lee 48 1 0 04 Sep 2022
Do Large Language Models know what humans know? Sean Trott Cameron J. Jones Tyler A. Chang J. Michaelov Benjamin Bergen 74 97 0 04 Sep 2022
How to Prompt? Opportunities and Challenges of Zero- and Few-Shot Learning for Human-AI Interaction in Creative Applications of Generative Models Hai Dang Lukas Mecke Florian Lehmann Sven Goller Daniel Buschek 74 102 0 03 Sep 2022
HammingMesh: A Network Topology for Large-Scale Deep Learning Torsten Hoefler Tommaso Bonato Daniele De Sensi Salvatore Di Girolamo Shigang Li Marco Heddes Jon Belk Deepak Goel Miguel Castro Steve Scott 3DH GNN AI4CE 79 23 0 03 Sep 2022
TransPolymer: a Transformer-based language model for polymer property predictions Changwen Xu Yuyang Wang A. Farimani 105 93 0 03 Sep 2022
Petals: Collaborative Inference and Fine-tuning of Large Models Alexander Borzunov Dmitry Baranchuk Tim Dettmers Max Ryabinin Younes Belkada Artem Chumachenko Pavel Samygin Colin Raffel VLM 116 67 0 02 Sep 2022
INTERACTION: A Generative XAI Framework for Natural Language Inference Explanations Jialin Yu Alexandra I. Cristea Anoushka Harit Zhongtian Sun O. Aduragba Lei Shi Noura Al Moubayed 74 10 0 02 Sep 2022
Multi-Modal Experience Inspired AI Creation Qian Cao Xu Chen Ruihua Song Hao Jiang Guangyan Yang Bo Zhao 68 3 0 02 Sep 2022
IMG2IMU: Translating Knowledge from Large-Scale Images to IMU Sensing Applications Hyungjun Yoon Hyeong-Tae Cha Hoang C. Nguyen Taesik Gong Sungyeop Lee VLM SSL 106 1 0 02 Sep 2022