Papers citing "Language Models are Few-Shot Learners"

50 / 12,243 papers shown

Title
Neural Text Classification by Jointly Learning to Cluster and Align Yekun Chai Haidong Zhang Shuo Jin 44 2 0 24 Nov 2020
Argument from Old Man's View: Assessing Social Bias in Argumentation Maximilian Spliethover Henning Wachsmuth 54 20 0 24 Nov 2020
GLGE: A New General Language Generation Evaluation Benchmark Dayiheng Liu Yu Yan Yeyun Gong Weizhen Qi Hang Zhang ... Jiancheng Lv Ruofei Zhang Winnie Wu Ming Zhou Nan Duan ELM 109 66 0 24 Nov 2020
Language guided machine action Feng Qi LM&Ro 8 0 0 23 Nov 2020
Investigating Emotion-Color Association in Deep Neural Networks Shivi Gupta Shashi Kant Gupta 11 2 0 22 Nov 2020
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images R. Child BDL VLM 190 353 0 20 Nov 2020
ONION: A Simple and Effective Defense Against Textual Backdoor Attacks Fanchao Qi Yangyi Chen Mukai Li Yuan Yao Zhiyuan Liu Maosong Sun AAML 109 283 0 20 Nov 2020
ClickTrain: Efficient and Accurate End-to-End Deep Learning Training via Fine-Grained Architecture-Preserving Pruning Chengming Zhang Geng Yuan Wei Niu Jiannan Tian Sian Jin ... Zhe Jiang Yanzhi Wang Bin Ren Shuaiwen Leon Song Dingwen Tao 3DV 61 1 0 20 Nov 2020
Deep Neural Networks using a Single Neuron: Folded-in-Time Architecture using Feedback-Modulated Delay Loops Florian Stelzer André Röhm Raul Vicente Ingo Fischer University of Tartu AI4CE 73 48 0 19 Nov 2020
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning Zhenda Xie Yutong Lin Zheng Zhang Yue Cao Stephen Lin Han Hu SSL 120 415 0 19 Nov 2020
A Definition and a Test for Human-Level Artificial Intelligence Deokgun Park Md Ashaduzzaman Rubel Mondol Aishwarya Pothula Mazharul Islam VLM 14 4 0 18 Nov 2020
Whale: Efficient Giant Model Training over Heterogeneous GPUs Xianyan Jia Le Jiang Ang Wang Wencong Xiao Ziji Shi ... Lan-yue Chen Yong Li Zhen Zheng Xiaoyong Liu Wei Lin 78 56 0 18 Nov 2020
Do Fine-tuned Commonsense Language Models Really Generalize? Mayank Kejriwal Ke Shen ELM LRM 57 10 0 18 Nov 2020
A Novel Memory-Efficient Deep Learning Training Framework via Error-Bounded Lossy Compression Sian Jin Guanpeng Li Shuaiwen Leon Song Dingwen Tao AI4CE 65 12 0 18 Nov 2020
A Review of Generalized Zero-Shot Learning Methods Farhad Pourpanah Moloud Abdar Yuxuan Luo Xinlei Zhou Ran Wang C. P. Lim Xizhao Wang Q. M. Jonathan Wu VLM 147 358 0 17 Nov 2020
MGIC: Multigrid-in-Channels Neural Network Architectures Moshe Eliasof Jonathan Ephrath Lars Ruthotto Eran Treister 94 8 0 17 Nov 2020
Learning from Task Descriptions Orion Weller Nicholas Lourie Matt Gardner Matthew E. Peters 113 91 0 16 Nov 2020
A partition-based similarity for classification distributions Hayden S. Helm Ronak D. Mehta Brandon Duderstadt Weiwei Yang Christoper M. White Ali Geisa Joshua T. Vogelstein Carey E. Priebe 34 6 0 12 Nov 2020
Fairness and Robustness in Invariant Learning: A Case Study in Toxicity Classification Robert Adragna Elliot Creager David Madras R. Zemel OOD FaML 80 43 0 12 Nov 2020
Hurricane Forecasting: A Novel Multimodal Machine Learning Framework L. Boussioux C. Zeng Théo Guénais Dimitris Bertsimas 53 40 0 11 Nov 2020
When Do You Need Billions of Words of Pretraining Data? Yian Zhang Alex Warstadt Haau-Sing Li Samuel R. Bowman 62 141 0 10 Nov 2020
Multi-document Summarization via Deep Learning Techniques: A Survey Congbo Ma W. Zhang Mingyu Guo Hu Wang Quan Z. Sheng 125 129 0 10 Nov 2020
An Analysis of Dataset Overlap on Winograd-Style Tasks Ali Emami Adam Trischler Kaheer Suleman Jackie C.K. Cheung 76 22 0 09 Nov 2020
Improving Neural Network Training in Low Dimensional Random Bases Frithjof Gressmann Zach Eaton-Rosen Carlo Luschi 78 28 0 09 Nov 2020
Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads Zhengyan Zhang Fanchao Qi Zhiyuan Liu Qun Liu Maosong Sun VLM 86 31 0 07 Nov 2020
Exploring the limits of Concurrency in ML Training on Google TPUs Sameer Kumar James Bradbury C. Young Yu Emma Wang Anselm Levskaya ... Tao Wang Tayo Oguntebi Yazhou Zu Yuanzhong Xu Andy Swing BDL AIMat MoE LRM 64 27 0 07 Nov 2020
Machine Generation and Detection of Arabic Manipulated and Fake News El Moatez Billah Nagoudi AbdelRahim Elmadany Muhammad Abdul-Mageed Tariq Alhindi H. Cavusoglu DeLMO 89 52 0 05 Nov 2020
Detecting Hallucinated Content in Conditional Neural Sequence Generation Chunting Zhou Graham Neubig Jiatao Gu Mona T. Diab P. Guzmán Luke Zettlemoyer Marjan Ghazvininejad HILM 133 200 0 05 Nov 2020
Rearrangement: A Challenge for Embodied AI Dhruv Batra Angel X. Chang Sonia Chernova Andrew J. Davison Jia Deng ... Jitendra Malik Igor Mordatch Roozbeh Mottaghi Manolis Savva Hao Su LM&Ro 114 220 0 03 Nov 2020
Automatic Detection of Machine Generated Text: A Critical Survey Ganesh Jawahar Muhammad Abdul-Mageed L. Lakshmanan DeLMO 81 239 0 02 Nov 2020
Emergent Communication Pretraining for Few-Shot Machine Translation Yaoyiran Li Edoardo Ponti Ivan Vulić Anna Korhonen 106 19 0 02 Nov 2020
Training EfficientNets at Supercomputer Scale: 83% ImageNet Top-1 Accuracy in One Hour Arissa Wongpanich Hieu H. Pham J. Demmel Mingxing Tan Quoc V. Le Yang You Sameer Kumar 78 8 0 30 Oct 2020
A New Neural Search and Insights Platform for Navigating and Organizing AI Research Marzieh Fadaee Olga Gureenkova Fernando Rejon Barrera Carsten Schnober W. Weerkamp Jakub Zavrel 43 7 0 30 Oct 2020
Topic-Preserving Synthetic News Generation: An Adversarial Deep Reinforcement Learning Approach Ahmadreza Mosallanezhad Kai Shu Huan Liu 48 10 0 30 Oct 2020
AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts Taylor Shin Yasaman Razeghi Robert L Logan IV Eric Wallace Sameer Singh KELM 108 407 0 29 Oct 2020
Melody-Conditioned Lyrics Generation with SeqGANs Yihao Chen Alexander Lerch GAN MGen 82 29 0 28 Oct 2020
Scaling Laws for Autoregressive Generative Modeling T. Henighan Jared Kaplan Mor Katz Mark Chen Christopher Hesse ... Nick Ryder Daniel M. Ziegler John Schulman Dario Amodei Sam McCandlish 121 433 0 28 Oct 2020
Predicting Themes within Complex Unstructured Texts: A Case Study on Safeguarding Reports A. Edwards David Rogers Jose Camacho-Collados Hélène de Ribaupierre Alun D. Preece 72 1 0 27 Oct 2020
A Statistical Framework for Low-bitwidth Training of Deep Neural Networks Jianfei Chen Yujie Gai Z. Yao Michael W. Mahoney Joseph E. Gonzalez MQ 68 59 0 27 Oct 2020
Out-of-core Training for Extremely Large-Scale Neural Networks With Adaptive Window-Based Scheduling Akio Hayakawa T. Narihira 26 4 0 27 Oct 2020
Dutch Humor Detection by Generating Negative Examples Thomas Winters Pieter Delobelle 115 11 0 26 Oct 2020
Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification Timo Schick Helmut Schmid Hinrich Schütze VLM 92 208 0 26 Oct 2020
Accelerating Training of Transformer-Based Language Models with Progressive Layer Dropping Minjia Zhang Yuxiong He AI4CE 48 104 0 26 Oct 2020
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models Benjamin Muller Antonis Anastasopoulos Benoît Sagot Djamé Seddah LRM 209 170 0 24 Oct 2020
Text Editing by Command Felix Faltings Michel Galley Gerold Hintz Chris Brockett Chris Quirk Jianfeng Gao Bill Dolan KELM 207 38 0 24 Oct 2020
Rethinking embedding coupling in pre-trained language models Hyung Won Chung Thibault Févry Henry Tsai Melvin Johnson Sebastian Ruder 172 143 0 24 Oct 2020
Pre-training Text-to-Text Transformers for Concept-centric Common Sense Wangchunshu Zhou Dong-Ho Lee Ravi Kiran Selvam Seyeon Lee Bill Yuchen Lin Xiang Ren LRM VLM 55 72 0 24 Oct 2020
Improving Multilingual Models with Language-Clustered Vocabularies Hyung Won Chung Dan Garrette Kiat Chuan Tan Jason Riesa VLM 129 65 0 24 Oct 2020
Text Style Transfer: A Review and Experimental Evaluation Zhiqiang Hu Roy Ka-wei Lee Charu C. Aggarwal Aston Zhang AI4TS 126 27 0 24 Oct 2020
An Evaluation Protocol for Generative Conversational Systems Seolhwa Lee Heuiseok Lim Jo˜ao Sedoc ELM 80 10 0 24 Oct 2020