v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019

Sharan Narang

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown

Title
Scalable and Efficient MoE Training for Multitask Multilingual Models Young Jin Kim A. A. Awan Alexandre Muzio Andres Felipe Cruz Salinas Liyang Lu Amr Hendy Samyam Rajbhandari Yuxiong He Hany Awadalla MoE 148 85 0 22 Sep 2021
RETRONLU: Retrieval Augmented Task-Oriented Semantic Parsing Vivek Gupta Akshat Shrivastava Adithya Sagar Armen Aghajanyan Denis Savenkov RALM 87 23 0 21 Sep 2021
Relation-Guided Pre-Training for Open-Domain Question Answering Ziniu Hu Yizhou Sun Kai-Wei Chang RALM OnRL 73 6 0 21 Sep 2021
Knowledge Distillation with Noisy Labels for Natural Language Understanding Shivendra Bhardwaj Abbas Ghaddar Ahmad Rashid Khalil Bibi Cheng-huan Li A. Ghodsi Philippe Langlais Mehdi Rezagholizadeh 53 1 0 21 Sep 2021
ConvFiT: Conversational Fine-Tuning of Pretrained Language Models Ivan Vulić Pei-hao Su Sam Coope D. Gerz Paweł Budzianowski I. Casanueva Nikola Mrkvsić Tsung-Hsien Wen 100 37 0 21 Sep 2021
A Plug-and-Play Method for Controlled Text Generation Damian Pascual Béni Egressy Clara Meister Ryan Cotterell Roger Wattenhofer 130 94 0 20 Sep 2021
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese Nguyen Luong Tran Duong Minh Le Dat Quoc Nguyen 70 55 0 20 Sep 2021
PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation Siqi Bao H. He Fan Wang Hua Wu Haifeng Wang ... Xinxian Huang Xin Tian Xinchao Xu Yingzhan Lin Zhengyu Niu VLM ALM 81 63 0 20 Sep 2021
Towards Zero-Label Language Learning Zirui Wang Adams Wei Yu Orhan Firat Yuan Cao SyDa 246 105 0 19 Sep 2021
Multi-Task Learning in Natural Language Processing: An Overview Shijie Chen Yu Zhang Qiang Yang AIMat 145 113 0 19 Sep 2021
Text Detoxification using Large Pre-trained Neural Models David Dale Anton Voronov Daryna Dementieva V. Logacheva Olga Kozlova Nikita Semenov Alexander Panchenko 124 74 0 18 Sep 2021
RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering Xi Ye Semih Yavuz Kazuma Hashimoto Yingbo Zhou Caiming Xiong 222 148 0 17 Sep 2021
Primer: Searching for Efficient Transformers for Language Modeling David R. So Wojciech Mañke Hanxiao Liu Zihang Dai Noam M. Shazeer Quoc V. Le VLM 277 156 0 17 Sep 2021
Hierarchy-Aware T5 with Path-Adaptive Mask Mechanism for Hierarchical Text Classification Wei Huang Chen Liu Yihua Zhao Xinyun Yang Zhaoming Pan Zhimin Zhang Guiquan Liu 41 2 0 17 Sep 2021
Exploring Multitask Learning for Low-Resource AbstractiveSummarization Ahmed Magooda Mohamed S. Elaraby Diane Litman 73 11 0 17 Sep 2021
Task-adaptive Pre-training of Language Models with Word Embedding Regularization Kosuke Nishida Kyosuke Nishida Sen Yoshida VLM 94 8 0 17 Sep 2021
Language Models as a Knowledge Source for Cognitive Agents R. Wray James R. Kirk John E. Laird 57 15 0 17 Sep 2021
Pre-trained Gaussian processes for Bayesian optimization Zehao Wang George E. Dahl Kevin Swersky Chansoo Lee Zachary Nado Justin Gilmer Jasper Snoek Zoubin Ghahramani 151 46 0 16 Sep 2021
Phrase Retrieval Learns Passage Retrieval, Too Jinhyuk Lee Alexander Wettig Danqi Chen RALM DML 82 48 0 16 Sep 2021
Does External Knowledge Help Explainable Natural Language Inference? Automatic Evaluation vs. Human Ratings Hendrik Schuff Hsiu-yu Yang Heike Adel Ngoc Thang Vu ELM ReLM LRM 62 13 0 16 Sep 2021
Scaling Laws for Neural Machine Translation Behrooz Ghorbani Orhan Firat Markus Freitag Ankur Bapna M. Krikun Xavier Garcia Ciprian Chelba Colin Cherry 90 103 0 16 Sep 2021
Language Models are Few-shot Multilingual Learners Genta Indra Winata Andrea Madotto Zhaojiang Lin Rosanne Liu J. Yosinski Pascale Fung ELM LRM 115 138 0 16 Sep 2021
On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation Dan Iter David Grangier 92 10 0 15 Sep 2021
Dialogue State Tracking with a Language Model using Schema-Driven Prompting Chia-Hsuan Lee Hao Cheng Mari Ostendorf 102 132 0 15 Sep 2021
Challenges in Detoxifying Language Models Johannes Welbl Amelia Glaese J. Uesato Sumanth Dathathri John F. J. Mellor Lisa Anne Hendricks Kirsty Anderson Pushmeet Kohli Ben Coppin Po-Sen Huang LM&MA 313 196 0 15 Sep 2021
Topic Transferable Table Question Answering Saneem A. Chemmengath Vishwajeet Kumar Samarth Bharadwaj Jaydeep Sen Mustafa Canim Soumen Chakrabarti A. Gliozzo Karthik Sankaranarayanan OOD 96 11 0 15 Sep 2021
Prefix-to-SQL: Text-to-SQL Generation from Incomplete User Questions Naihao Deng Shuaichen Chang Peng Shi Tao Yu Rui Zhang LMTD 64 4 0 15 Sep 2021
Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering Ander Salaberria Gorka Azkune Oier López de Lacalle Aitor Soroa Etxabe Eneko Agirre 92 61 0 15 Sep 2021
Improving Text Auto-Completion with Next Phrase Prediction Dong-Ho Lee Zhiqiang Hu Roy Ka-wei Lee LRM 50 4 0 15 Sep 2021
Attention Is Indeed All You Need: Semantically Attention-Guided Decoding for Data-to-Text NLG Juraj Juraska M. Walker 56 17 0 15 Sep 2021
Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension Naoya Inoue H. Trivedi Steven K. Sinha Niranjan Balasubramanian Kentaro Inui 78 16 0 14 Sep 2021
KFCNet: Knowledge Filtering and Contrastive Learning Network for Generative Commonsense Reasoning Haonan Li Yeyun Gong Jian Jiao Ruofei Zhang Timothy Baldwin Nan Duan OffRL 93 6 0 14 Sep 2021
Exploring Prompt-based Few-shot Learning for Grounded Dialog Generation Chujie Zheng Minlie Huang 104 44 0 14 Sep 2021
Task-adaptive Pre-training and Self-training are Complementary for Natural Language Understanding Shiyang Li Semih Yavuz Wenhu Chen Xifeng Yan 69 12 0 14 Sep 2021
STraTA: Self-Training with Task Augmentation for Better Few-shot Learning Tu Vu Minh-Thang Luong Quoc V. Le Grady Simon Mohit Iyyer 176 61 0 13 Sep 2021
SituatedQA: Incorporating Extra-Linguistic Contexts into QA Michael J.Q. Zhang Eunsol Choi RALM 87 154 0 13 Sep 2021
Packed Levitated Marker for Entity and Relation Extraction Deming Ye Yankai Lin Peng Li Maosong Sun 212 112 0 13 Sep 2021
Question Answering over Electronic Devices: A New Benchmark Dataset and a Multi-Task Learning based QA Framework Abhilash Nandy Soumya Sharma Shubham Maddhashiya K. Sachdeva Pawan Goyal Niloy Ganguly 68 19 0 13 Sep 2021
Abstract, Rationale, Stance: A Joint Model for Scientific Claim Verification Zhiwei Zhang Jiyi Li Fumiyo Fukumoto Yanming Ye 84 28 0 13 Sep 2021
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation Yunfan Shao Zhichao Geng Yitao Liu Junqi Dai Hang Yan Fei Yang Li Zhe Hujun Bao Xipeng Qiu MedIm 148 151 0 13 Sep 2021
Contrastive Learning for Context-aware Neural Machine TranslationUsing Coreference Information Yong-keun Hwang Hyungu Yun Kyomin Jung 64 11 0 13 Sep 2021
How to Select One Among All? An Extensive Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding Tianda Li Ahmad Rashid A. Jafari Pranav Sharma A. Ghodsi Mehdi Rezagholizadeh AAML 122 5 0 13 Sep 2021
SHAPE: Shifted Absolute Position Embedding for Transformers Shun Kiyono Sosuke Kobayashi Jun Suzuki Kentaro Inui 292 47 0 13 Sep 2021
Good-Enough Example Extrapolation Jason W. Wei 60 6 0 12 Sep 2021
End-to-End Conversational Search for Online Shopping with Utterance Transfer Liqiang Xiao Jun Ma Xin Luna Dong Pascual Martínez-Gómez Nasser Zalmout Wei Chen Tong Zhao Hao He Yaohui Jin 46 12 0 12 Sep 2021
"Let Your Characters Tell Their Story": A Dataset for Character-Centric Narrative Understanding Faeze Brahman Meng Huang Oyvind Tafjord Chao Zhao Mrinmaya Sachan Snigdha Chaturvedi 77 57 0 12 Sep 2021
Multilingual Translation via Grafting Pre-trained Language Models Zewei Sun Mingxuan Wang Lei Li AI4CE 240 22 0 11 Sep 2021
Semantic Categorization of Social Knowledge for Commonsense Question Answering Gengyu Wang Xiaochen Hou Diyi Yang Kathleen McKeown Jing Huang VLM 49 3 0 11 Sep 2021
StreamHover: Livestream Transcript Summarization and Annotation Sangwoo Cho Franck Dernoncourt Timothy Jeewun Ganter Trung Bui Nedim Lipka Walter Chang Hailin Jin Jonathan Brandt H. Foroosh Fei Liu 3DGS AI4TS 75 29 0 11 Sep 2021
PICARD: Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models Torsten Scholak Nathan Schucher Dzmitry Bahdanau 236 396 0 10 Sep 2021