v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019

Sharan Narang

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown

Title
QuestEval: Summarization Asks for Fact-based Evaluation Thomas Scialom Paul-Alexis Dray Patrick Gallinari Sylvain Lamprier Benjamin Piwowarski Jacopo Staiano Alex Jinpeng Wang HILM 78 276 0 23 Mar 2021
Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection Jan Philip Wahle Terry Ruas Norman Meuschke Bela Gipp 119 34 0 23 Mar 2021
Detecting Hate Speech with GPT-3 Ke-Li Chiu Annie Collins Rohan Alexander AILaw 106 114 0 23 Mar 2021
Multi-Modal Answer Validation for Knowledge-Based VQA Jialin Wu Jiasen Lu Ashish Sabharwal Roozbeh Mottaghi 164 146 0 23 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge David Elliott Carlos E. Otero Steven Wyatt Evan Martino 81 16 0 22 Mar 2021
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets Julia Kreutzer Isaac Caswell Lisa Wang Ahsan Wahab D. Esch ... Duygu Ataman Orevaoghene Ahia Oghenefego Ahia Sweta Agrawal Mofetoluwa Adeyemi 72 280 0 22 Mar 2021
Improving and Simplifying Pattern Exploiting Training Derek Tam Rakesh R Menon Joey Tianyi Zhou Shashank Srivastava Colin Raffel 78 151 0 22 Mar 2021
AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization Tiezheng Yu Zihan Liu Pascale Fung CLL 107 81 0 21 Mar 2021
Local Interpretations for Explainable Natural Language Processing: A Survey Siwen Luo Hamish Ivison S. Han Josiah Poon MILM 120 51 0 20 Mar 2021
Attribute Alignment: Controlling Text Generation from Pre-trained Language Models Dian Yu Zhou Yu Kenji Sagae 82 39 0 20 Mar 2021
GPT Understands, Too Xiao Liu Yanan Zheng Zhengxiao Du Ming Ding Yujie Qian Zhilin Yang Jie Tang VLM 184 1,185 0 18 Mar 2021
GLM: General Language Model Pretraining with Autoregressive Blank Infilling Zhengxiao Du Yujie Qian Xiao Liu Ming Ding J. Qiu Zhilin Yang Jie Tang BDL AI4CE 162 1,565 0 18 Mar 2021
Structure Inducing Pre-Training Matthew B. A. McDermott Brendan Yap Peter Szolovits Marinka Zitnik 95 21 0 18 Mar 2021
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning Mandela Patrick Yuki M. Asano Bernie Huang Ishan Misra Florian Metze Joao Henriques Andrea Vedaldi AI4TS 96 35 0 18 Mar 2021
Structural Adapters in Pretrained Language Models for AMR-to-text Generation Leonardo F. R. Ribeiro Yue Zhang Iryna Gurevych 100 72 0 16 Mar 2021
Robustly Optimized and Distilled Training for Natural Language Understanding Haytham ElFadeel Stanislav Peshterliev VLM OffRL 37 1 0 16 Mar 2021
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval Siqi Sun Yen-Chun Chen Linjie Li Shuohang Wang Yuwei Fang Jingjing Liu VLM 89 84 0 16 Mar 2021
Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence Tal Schuster Adam Fisch Regina Barzilay 114 239 0 15 Mar 2021
How Many Data Points is a Prompt Worth? Teven Le Scao Alexander M. Rush VLM 205 303 0 15 Mar 2021
Membership Inference Attacks on Machine Learning: A Survey Hongsheng Hu Z. Salcic Lichao Sun Gillian Dobbie Philip S. Yu Xuyun Zhang MIACV 125 446 0 14 Mar 2021
SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple Levels Chenliang Li Ming Yan Haiyang Xu Fuli Luo Wei Wang Bin Bi Songfang Huang VLM 74 36 0 14 Mar 2021
Constrained Text Generation with Global Guidance -- Case Study on CommonGen Yixian Liu Liwen Zhang Wenjuan Han Yue Zhang Kewei Tu 87 10 0 12 Mar 2021
Inductive Relation Prediction by BERT H. Zha Zhiyu Zoey Chen Xifeng Yan 146 58 0 12 Mar 2021
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation J. Clark Dan Garrette Iulia Turc John Wieting 117 224 0 11 Mar 2021
Conversational Answer Generation and Factuality for Reading Comprehension Question-Answering Stanislav Peshterliev Barlas Oğuz Debojeet Chatterjee Hakan Inan Vikas Bhardwaj 39 4 0 11 Mar 2021
Full Page Handwriting Recognition via Image to Sequence Extraction Sumeet S. Singh Sergey Karayev 81 55 0 11 Mar 2021
Hurdles to Progress in Long-form Question Answering Kalpesh Krishna Aurko Roy Mohit Iyyer 72 200 0 10 Mar 2021
Pretrained Transformers as Universal Computation Engines Kevin Lu Aditya Grover Pieter Abbeel Igor Mordatch 81 221 0 09 Mar 2021
Self-supervised Regularization for Text Classification Meng Zhou Zechen Li P. Xie 60 16 0 09 Mar 2021
Domain Controlled Title Generation with Human Evaluation Abdul Waheed Muskan Goyal Nimisha Mittal D. Gupta 21 2 0 08 Mar 2021
Empathetic BERT2BERT Conversational Model: Learning Arabic Language Generation with Little Data Tarek Naous Wissam Antoun Reem A. Mahmoud Hazem M. Hajj 80 18 0 07 Mar 2021
Syntax-BERT: Improving Pre-trained Transformers with Syntax Trees Jiangang Bai Yujing Wang Yiren Chen Yaming Yang Jing Bai Jiahao Yu Yunhai Tong 88 104 0 07 Mar 2021
Measuring Mathematical Problem Solving With the MATH Dataset Dan Hendrycks Collin Burns Saurav Kadavath Akul Arora Steven Basart Eric Tang Basel Alomair Jacob Steinhardt ReLM FaML 233 2,414 0 05 Mar 2021
Attention is Not All You Need: Pure Attention Loses Rank Doubly Exponentially with Depth Yihe Dong Jean-Baptiste Cordonnier Andreas Loukas 163 388 0 05 Mar 2021
A Systematic Evaluation of Transfer Learning and Pseudo-labeling with BERT-based Ranking Models Iurii Mokrii Leonid Boytsov Pavel Braslavski 90 26 0 04 Mar 2021
Self-supervised Pretraining of Visual Features in the Wild Priya Goyal Mathilde Caron Benjamin Lefaudeux Min Xu Pengchao Wang ... Mannat Singh Vitaliy Liptchinsky Ishan Misra Armand Joulin Piotr Bojanowski VLM SSL 98 274 0 02 Mar 2021
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning Krishna Srinivasan K. Raman Jiecao Chen Michael Bendersky Marc Najork VLM 286 322 0 02 Mar 2021
Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language Avia Efrat Uri Shaham D. Kilman Omer Levy ELM 68 18 0 01 Mar 2021
OmniNet: Omnidirectional Representations from Transformers Yi Tay Mostafa Dehghani V. Aribandi Jai Gupta Philip Pham Zhen Qin Dara Bahri Da-Cheng Juan Donald Metzler 113 30 0 01 Mar 2021
Token-Modification Adversarial Attacks for Natural Language Processing: A Survey Tom Roth Yansong Gao A. Abuadbba Surya Nepal Wei Liu AAML 110 12 0 01 Mar 2021
Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP Timo Schick Sahana Udupa Hinrich Schütze 321 388 0 28 Feb 2021
Generative Chemical Transformer: Neural Machine Learning of Molecular Geometric Structures from Chemical Language via Attention Hyunseung Kim Jonggeol Na Won Bo Lee 84 47 0 27 Feb 2021
Learning Transferable Visual Models From Natural Language Supervision Alec Radford Jong Wook Kim Chris Hallacy Aditya A. Ramesh Gabriel Goh ... Amanda Askell Pamela Mishkin Jack Clark Gretchen Krueger Ilya Sutskever CLIP VLM 1.1K 30,053 0 26 Feb 2021
LazyTensor: combining eager execution with domain-specific compilers Alex Suhan Davide Libenzi Ailing Zhang Parker Schuh Brennan Saeta Jie Young Sohn Denys Shabalin 47 18 0 26 Feb 2021
PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using Transfer Learning Nasi Jofche Kostadin Mishev Riste Stojanov Milos Jovanovik D. Trajanov 56 18 0 25 Feb 2021
Investigating the Limitations of Transformers with Simple Arithmetic Tasks Rodrigo Nogueira Zhiying Jiang Jimmy J. Li LRM 122 130 0 25 Feb 2021
A Primer on Contrastive Pretraining in Language Processing: Methods, Lessons Learned and Perspectives Nils Rethmeier Isabelle Augenstein SSL VLM 159 94 0 25 Feb 2021
LazyFormer: Self Attention with Lazy Update Chengxuan Ying Guolin Ke Di He Tie-Yan Liu 79 16 0 25 Feb 2021
OneStop QAMaker: Extract Question-Answer Pairs from Text in a One-Stop Approach Shaobo Cui Xintong Bao Xinxing Zu Yangyang Guo Zhongzhou Zhao Ji Zhang Haiqing Chen RALM 52 15 0 24 Feb 2021
Do Transformer Modifications Transfer Across Implementations and Applications? Sharan Narang Hyung Won Chung Yi Tay W. Fedus Thibault Févry ... Wei Li Nan Ding Jake Marcus Adam Roberts Colin Raffel 100 128 0 23 Feb 2021