v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019

Sharan Narang

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown

Title
Entity-Based Knowledge Conflicts in Question Answering Shayne Longpre Kartik Perisetla Anthony Chen Nikhil Ramesh Chris DuBois Sameer Singh HILM 340 264 0 10 Sep 2021
Controlled Neural Sentence-Level Reframing of News Articles Wei-Fan Chen Khalid Al Khatib Benno Stein Henning Wachsmuth 70 13 0 10 Sep 2021
Does Pretraining for Summarization Require Knowledge Transfer? Kundan Krishna Jeffrey P. Bigham Zachary Chase Lipton 73 39 0 10 Sep 2021
Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding Shane Storks Qiaozi Gao Yichi Zhang J. Chai ReLM LRM 108 23 0 10 Sep 2021
Beyond the Tip of the Iceberg: Assessing Coherence of Text Classifiers Shane Storks J. Chai 94 7 0 10 Sep 2021
Document-level Entity-based Extraction as Template Generation Kung-Hsiang Huang Sam Tang Nanyun Peng 52 54 0 10 Sep 2021
Zero-Shot Dialogue State Tracking via Cross-Task Transfer Zhaojiang Lin Bing-Quan Liu Andrea Madotto Seungwhan Moon Paul A. Crook ... Zhiguang Wang Zhou Yu Eunjoon Cho R. Subba Pascale Fung 87 74 0 10 Sep 2021
What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers Boseop Kim Hyoungseok Kim Sang-Woo Lee Gichang Lee Donghyun Kwak ... Jaewook Kang Inho Kang Jung-Woo Ha W. Park Nako Sung VLM 292 124 0 10 Sep 2021
CINS: Comprehensive Instruction for Few-shot Learning in Task-oriented Dialog Systems Fei Mi Yitong Li Yasheng Wang Xin Jiang Qun Liu 97 43 0 10 Sep 2021
TIAGE: A Benchmark for Topic-Shift Aware Dialog Modeling Huiyuan Xie Zhenghao Liu Chenyan Xiong Zhiyuan Liu Ann A. Copestake VLM 52 27 0 09 Sep 2021
TxT: Crossmodal End-to-End Learning with Transformers Jan-Martin O. Steitz Jonas Pfeiffer Iryna Gurevych Stefan Roth LRM 29 2 0 09 Sep 2021
Multi-granularity Textual Adversarial Attack with Behavior Cloning Yangyi Chen Jingtong Su Wei Wei AAML 52 33 0 09 Sep 2021
PPT: Pre-trained Prompt Tuning for Few-shot Learning Yuxian Gu Xu Han Zhiyuan Liu Minlie Huang VLM 159 420 0 09 Sep 2021
Translate & Fill: Improving Zero-Shot Multilingual Semantic Parsing with Synthetic Data Massimo Nicosia Zhongdi Qu Yasemin Altun 76 26 0 09 Sep 2021
MetaXT: Meta Cross-Task Transfer between Disparate Label Spaces Srinagesh Sharma Guoqing Zheng Ahmed Hassan Awadallah 50 1 0 09 Sep 2021
KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs Yinquan Lu H. Lu Guirong Fu Qun Liu KELM 46 34 0 09 Sep 2021
Weakly-Supervised Visual-Retriever-Reader for Knowledge-based Question Answering Man Luo Yankai Zeng Pratyay Banerjee Chitta Baral RALM 131 66 0 09 Sep 2021
What's Hidden in a One-layer Randomly Weighted Transformer? Sheng Shen Z. Yao Douwe Kiela Kurt Keutzer Michael W. Mahoney 50 4 0 08 Sep 2021
Retrieve, Caption, Generate: Visual Grounding for Enhancing Commonsense in Text Generation Models Steven Y. Feng Kevin Lu Zhuofu Tao Malihe Alikhani Teruko Mitamura Eduard H. Hovy Varun Gangal LRM 79 13 0 08 Sep 2021
Sparsity and Sentence Structure in Encoder-Decoder Attention of Summarization Systems Potsawee Manakul Mark Gales 64 5 0 08 Sep 2021
TruthfulQA: Measuring How Models Mimic Human Falsehoods Stephanie C. Lin Jacob Hilton Owain Evans HILM 151 1,953 0 08 Sep 2021
Memory and Knowledge Augmented Language Models for Inferring Salience in Long-Form Stories David Wilmot Frank Keller RALM KELM 78 21 0 08 Sep 2021
Label Verbalization and Entailment for Effective Zero- and Few-Shot Relation Extraction Oscar Sainz Oier López de Lacalle Gorka Labaka Ander Barrena Eneko Agirre 54 126 0 08 Sep 2021
Discrete and Soft Prompting for Multilingual Models Mengjie Zhao Hinrich Schütze LRM 92 72 0 08 Sep 2021
NSP-BERT: A Prompt-based Few-Shot Learner Through an Original Pre-training Task--Next Sentence Prediction Yi Sun Yu Zheng Chao Hao Hangping Qiu VLM 107 37 0 08 Sep 2021
R2-D2: A Modular Baseline for Open-Domain Question Answering Martin Fajcik Martin Docekal Karel Ondrej Pavel Smrz 72 47 0 08 Sep 2021
Sequence Level Contrastive Learning for Text Summarization Shusheng Xu Xingxing Zhang Yi Wu Furu Wei 117 98 0 08 Sep 2021
On the Challenges of Evaluating Compositional Explanations in Multi-Hop Inference: Relevance, Completeness, and Expert Ratings Peter Alexander Jansen Kelly Smith Dan Moreno Huitzilin Ortiz CoGe ReLM LRM 72 9 0 07 Sep 2021
Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression Canwen Xu Wangchunshu Zhou Tao Ge Kelvin J. Xu Julian McAuley Furu Wei 73 42 0 07 Sep 2021
Aspect-Controllable Opinion Summarization Reinald Kim Amplayo Stefanos Angelidis Mirella Lapata 69 75 0 07 Sep 2021
How much pretraining data do language models need to learn syntax? Laura Pérez-Mayos Miguel Ballesteros Leo Wanner 55 32 0 07 Sep 2021
Generate & Rank: A Multi-task Framework for Math Word Problems Jianhao Shen Yichun Yin Lin Li Lifeng Shang Xin Jiang Ming Zhang Qun Liu AIMat 87 133 0 07 Sep 2021
Datasets: A Community Library for Natural Language Processing Quentin Lhoest Albert Villanova del Moral Yacine Jernite A. Thakur Patrick von Platen ... Thibault Goehringer Victor Mustar François Lagunas Alexander M. Rush Thomas Wolf 266 614 0 07 Sep 2021
Text-to-Table: A New Way of Information Extraction Xueqing Wu Jiacheng Zhang Hang Li LMTD 88 57 0 06 Sep 2021
General-Purpose Question-Answering with Macaw Oyvind Tafjord Peter Clark SyDa ELM MLLM 83 60 0 06 Sep 2021
Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization Tiezheng Yu Wenliang Dai Zihan Liu Pascale Fung 105 74 0 06 Sep 2021
PermuteFormer: Efficient Relative Position Encoding for Long Sequences Peng-Jen Chen 93 21 0 06 Sep 2021
Modular Framework for Visuomotor Language Grounding Kolby Nottingham Litian Liang Daeyun Shin Charless C. Fowlkes Roy Fox Sameer Singh 81 12 0 05 Sep 2021
SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks Wanyu Du Yangfeng Ji AI4CE 47 7 0 05 Sep 2021
FewshotQA: A simple framework for few-shot learning of question answering tasks using pre-trained text-to-text models Rakesh Chada P. Natarajan 89 46 0 04 Sep 2021
CREAK: A Dataset for Commonsense Reasoning over Entity Knowledge Yasumasa Onoe Michael J.Q. Zhang Eunsol Choi Greg Durrett HILM 87 87 0 03 Sep 2021
Finetuned Language Models Are Zero-Shot Learners Jason W. Wei Maarten Bosma Vincent Zhao Kelvin Guu Adams Wei Yu Brian Lester Nan Du Andrew M. Dai Quoc V. Le ALM UQCV 326 3,806 0 03 Sep 2021
Biomedical Data-to-Text Generation via Fine-Tuning Transformers Ruslan Yermakov Nicholas Drago Angelo Ziletti MedIm 58 13 0 03 Sep 2021
Do Prompt-Based Models Really Understand the Meaning of their Prompts? Albert Webson Ellie Pavlick LRM 136 374 0 02 Sep 2021
MultiEURLEX -- A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer Ilias Chalkidis Manos Fergadiotis Ion Androutsopoulos AILaw 104 111 0 02 Sep 2021
CodeT5: Identifier-aware Unified Pre-trained Encoder-Decoder Models for Code Understanding and Generation Yue Wang Weishi Wang Shafiq Joty Guosheng Lin 370 1,610 0 02 Sep 2021
Survey of Low-Resource Machine Translation Barry Haddow Rachel Bawden Antonio Valerio Miceli Barone Jindvrich Helcl Alexandra Birch AIMat 118 163 0 01 Sep 2021
Boosting Search Engines with Interactive Agents Leonard Adolphs Benjamin Boerschinger Christian Buck Michelle Chen Huebscher Massimiliano Ciaramita ... Thomas Hofmann Yannic Kilcher Sascha Rothe Pier Giuseppe Sessa Lierni Sestorain Saralegui LLMAG 139 24 0 01 Sep 2021
It's not Rocket Science : Interpreting Figurative Language in Narratives Tuhin Chakrabarty Yejin Choi Vered Shwartz 97 58 0 31 Aug 2021
Effective Sequence-to-Sequence Dialogue State Tracking Jeffrey Zhao Mahdis Mahdieh Ye Zhang Yuan Cao Yonghui Wu 134 42 0 31 Aug 2021