ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,870 papers shown
Title
Learning to Transfer Prompts for Text Generation
Learning to Transfer Prompts for Text Generation
Junyi Li
Tianyi Tang
J. Nie
Ji-Rong Wen
Wayne Xin Zhao
81
40
0
03 May 2022
Meta Learning for Natural Language Processing: A Survey
Meta Learning for Natural Language Processing: A Survey
Hung-yi Lee
Shang-Wen Li
Ngoc Thang Vu
98
45
0
03 May 2022
Textual Entailment for Event Argument Extraction: Zero- and Few-Shot
  with Multi-Source Learning
Textual Entailment for Event Argument Extraction: Zero- and Few-Shot with Multi-Source Learning
Oscar Sainz
Itziar Gonzalez-Dios
Oier López de Lacalle
Bonan Min
Eneko Agirre
79
50
0
03 May 2022
A Survey of Deep Learning Models for Structural Code Understanding
A Survey of Deep Learning Models for Structural Code Understanding
Ruoting Wu
Yuxin Zhang
Qibiao Peng
Liang Chen
Zibin Zheng
83
7
0
03 May 2022
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo
  Languages
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages
Felix Wu
Kwangyoun Kim
Shinji Watanabe
Kyu Jeong Han
Ryan T. McDonald
Kilian Q. Weinberger
Yoav Artzi
SyDa
105
39
0
02 May 2022
OPT: Open Pre-trained Transformer Language Models
OPT: Open Pre-trained Transformer Language Models
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
...
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLMOSLMAI4CE
394
3,707
0
02 May 2022
State-of-the-art in Open-domain Conversational AI: A Survey
State-of-the-art in Open-domain Conversational AI: A Survey
Tosin Adewumi
F. Liwicki
Marcus Liwicki
113
15
0
02 May 2022
Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering
Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering
A. Piergiovanni
Wei Li
Weicheng Kuo
M. Saffar
Fred Bertsch
A. Angelova
77
16
0
02 May 2022
Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming
  Disfluency Detection
Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection
Angelica Chen
Vicky Zayats
D. D. Walker
Dirk Padfield
65
14
0
02 May 2022
POLITICS: Pretraining with Same-story Article Comparison for Ideology
  Prediction and Stance Detection
POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection
Yujian Liu
Xinliang Frederick Zhang
David Wegsman
Nick Beauchamp
Lu Wang
69
73
0
02 May 2022
MRKL Systems: A modular, neuro-symbolic architecture that combines large
  language models, external knowledge sources and discrete reasoning
MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning
Ehud D. Karpas
Omri Abend
Yonatan Belinkov
Barak Lenz
Opher Lieber
...
Erez Schwartz
Gal Shachaf
Shai Shalev-Shwartz
Amnon Shashua
Moshe Tenenholtz
LLMAG
74
70
0
01 May 2022
Don't Blame the Annotator: Bias Already Starts in the Annotation
  Instructions
Don't Blame the Annotator: Bias Already Starts in the Annotation Instructions
Mihir Parmar
Swaroop Mishra
Mor Geva
Chitta Baral
114
55
0
01 May 2022
Leveraging Emotion-specific Features to Improve Transformer Performance
  for Emotion Classification
Leveraging Emotion-specific Features to Improve Transformer Performance for Emotion Classification
Shaily Desai
Atharva Kshirsagar
Aditi Sidnerlikar
Nikhil Khodake
M. Marathe
47
4
0
30 Apr 2022
Clues Before Answers: Generation-Enhanced Multiple-Choice QA
Clues Before Answers: Generation-Enhanced Multiple-Choice QA
Zixian Huang
Ao Wu
Jiaying Zhou
Yu Gu
Yue Zhao
Gong Cheng
41
27
0
30 Apr 2022
Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence
  Encoders
Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders
Ivan Vulić
Goran Glavaš
Fangyu Liu
Nigel Collier
Edoardo Ponti
Anna Korhonen
96
9
0
30 Apr 2022
EasyNLP: A Comprehensive and Easy-to-use Toolkit for Natural Language
  Processing
EasyNLP: A Comprehensive and Easy-to-use Toolkit for Natural Language Processing
Chengyu Wang
Minghui Qiu
Chen Shi
Taolin Zhang
Tingting Liu
Lei Li
Jiadong Wang
Ming Wang
Jun Huang
W. Lin
76
21
0
30 Apr 2022
Solution of DeBERTaV3 on CommonsenseQA
Solution of DeBERTaV3 on CommonsenseQA
Letian Peng
Zuchao Li
Hai Zhao
30
0
0
30 Apr 2022
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem
  Solvers
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers
Vivek Kumar
Rishabh Maheshwary
Vikram Pudi
AIMat
91
14
0
30 Apr 2022
Prompt Consistency for Zero-Shot Task Generalization
Prompt Consistency for Zero-Shot Task Generalization
Chunting Zhou
Junxian He
Xuezhe Ma
Taylor Berg-Kirkpatrick
Graham Neubig
VLM
108
79
0
29 Apr 2022
Polyglot Prompt: Multilingual Multitask PrompTraining
Polyglot Prompt: Multilingual Multitask PrompTraining
Jinlan Fu
See-Kiong Ng
Pengfei Liu
68
8
0
29 Apr 2022
TemporalWiki: A Lifelong Benchmark for Training and Evaluating
  Ever-Evolving Language Models
TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models
Joel Jang
Seonghyeon Ye
Changho Lee
Sohee Yang
Joongbo Shin
Janghoon Han
Gyeonghun Kim
Minjoon Seo
CLLKELM
144
98
0
29 Apr 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLMVLM
431
3,617
0
29 Apr 2022
Training Language Models with Language Feedback
Training Language Models with Language Feedback
Jérémy Scheurer
Jon Ander Campos
Jun Shern Chan
Angelica Chen
Kyunghyun Cho
Ethan Perez
ALM
122
51
0
29 Apr 2022
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model
  Pretraining
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining
Yuting Gao
Jinfeng Liu
Zihan Xu
Jinchao Zhang
Ke Li
Rongrong Ji
Chunhua Shen
VLMCLIP
131
104
0
29 Apr 2022
Instilling Type Knowledge in Language Models via Multi-Task QA
Instilling Type Knowledge in Language Models via Multi-Task QA
Shuyang Li
Mukund Sridhar
Chandan Prakash
Jin Cao
Wael Hamza
Julian McAuley
KELM
79
7
0
28 Apr 2022
Faithful to the Document or to the World? Mitigating Hallucinations via
  Entity-linked Knowledge in Abstractive Summarization
Faithful to the Document or to the World? Mitigating Hallucinations via Entity-linked Knowledge in Abstractive Summarization
Yue Dong
John Wieting
Pat Verga
HILM
82
26
0
28 Apr 2022
CAVES: A Dataset to facilitate Explainable Classification and
  Summarization of Concerns towards COVID Vaccines
CAVES: A Dataset to facilitate Explainable Classification and Summarization of Concerns towards COVID Vaccines
Soham Poddar
Azlaan Mustafa Samad
Rajdeep Mukherjee
Niloy Ganguly
Saptarshi Ghosh
82
28
0
28 Apr 2022
On the Effect of Pretraining Corpora on In-context Learning by a
  Large-scale Language Model
On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model
Seongjin Shin
Sang-Woo Lee
Hwijeen Ahn
Sungdong Kim
Hyoungseok Kim
...
Kyunghyun Cho
Gichang Lee
W. Park
Jung-Woo Ha
Nako Sung
LRM
117
97
0
28 Apr 2022
Attention Mechanism with Energy-Friendly Operations
Attention Mechanism with Energy-Friendly Operations
Boyi Deng
Baosong Yang
Dayiheng Liu
Rong Xiao
Derek F. Wong
Haibo Zhang
Boxing Chen
Lidia S. Chao
MU
380
2
0
28 Apr 2022
Systematic Literature Review: Anti-Phishing Defences and Their
  Application to Before-the-click Phishing Email Detection
Systematic Literature Review: Anti-Phishing Defences and Their Application to Before-the-click Phishing Email Detection
T. Wood
Vitor Basto-Fernandes
E. Boiten
I. Yevseyeva
AAML
52
3
0
27 Apr 2022
DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for
  Dialog Response Generation
DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation
Wei Chen
Yeyun Gong
Song Wang
Bolun Yao
Weizhen Qi
...
Bartuer Zhou
Yi Mao
Weizhu Chen
Biao Cheng
Nan Duan
VLM
82
48
0
27 Apr 2022
An End-to-End Dialogue Summarization System for Sales Calls
An End-to-End Dialogue Summarization System for Sales Calls
Abedelkadir Asi
Song Wang
Roy Eisenstadt
Dean Geckt
Yarin Kuper
Yi Mao
Royi Ronen
95
16
0
27 Apr 2022
Modern Baselines for SPARQL Semantic Parsing
Modern Baselines for SPARQL Semantic Parsing
Debayan Banerjee
Pranav Ajit Nair
Jivat Neet Kaur
Ricardo Usbeck
Chris Biemann
85
32
0
27 Apr 2022
Plug-and-Play Adaptation for Continuously-updated QA
Plug-and-Play Adaptation for Continuously-updated QA
Kyungjae Lee
Wookje Han
Seung-won Hwang
Hwaran Lee
Joonsuk Park
Sang-Woo Lee
KELM
94
16
0
27 Apr 2022
On the Limitations of Dataset Balancing: The Lost Battle Against
  Spurious Correlations
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations
Roy Schwartz
Gabriel Stanovsky
102
26
0
27 Apr 2022
SkillNet-NLG: General-Purpose Natural Language Generation with a
  Sparsely Activated Approach
SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach
Junwei Liao
Duyu Tang
Fan Zhang
Shuming Shi
MoE
57
5
0
26 Apr 2022
GypSum: Learning Hybrid Representations for Code Summarization
GypSum: Learning Hybrid Representations for Code Summarization
Yu Wang
Yu Dong
Xuesong Lu
Aoying Zhou
57
27
0
26 Apr 2022
Super-Prompting: Utilizing Model-Independent Contextual Data to Reduce
  Data Annotation Required in Visual Commonsense Tasks
Super-Prompting: Utilizing Model-Independent Contextual Data to Reduce Data Annotation Required in Visual Commonsense Tasks
Navid Rezaei
Marek Reformat
VLM
48
2
0
25 Apr 2022
Translation between Molecules and Natural Language
Translation between Molecules and Natural Language
Carl Edwards
T. Lai
Kevin Ros
Garrett Honke
Kyunghyun Cho
Heng Ji
136
172
0
25 Apr 2022
Conversational Question Answering on Heterogeneous Sources
Conversational Question Answering on Heterogeneous Sources
Philipp Christmann
Rishiraj Saha Roy
Gerhard Weikum
87
44
0
25 Apr 2022
ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking
  Inference
ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference
Kai Hui
Honglei Zhuang
Tao Chen
Zhen Qin
Jing Lu
...
Ji Ma
Jai Gupta
Cicero Nogueira dos Santos
Yi Tay
Donald Metzler
94
16
0
25 Apr 2022
Exploring the Role of Task Transferability in Large-Scale Multi-Task
  Learning
Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning
Vishakh Padmakumar
Leonard Lausen
Miguel Ballesteros
Sheng Zha
He He
George Karypis
104
20
0
23 Apr 2022
LitMind Dictionary: An Open-Source Online Dictionary
LitMind Dictionary: An Open-Source Online Dictionary
Cunliang Kong
Xuezhi Fang
Liner Yang
Yuxiang Chen
Erhong Yang
28
0
0
23 Apr 2022
Locally Aggregated Feature Attribution on Natural Language Model
  Understanding
Locally Aggregated Feature Attribution on Natural Language Model Understanding
Shenmin Zhang
Jin Wang
Haitao Jiang
Rui Song
FAtt
69
3
0
22 Apr 2022
FaithDial: A Faithful Benchmark for Information-Seeking Dialogue
FaithDial: A Faithful Benchmark for Information-Seeking Dialogue
Nouha Dziri
Ehsan Kamalloo
Sivan Milton
Osmar Zaiane
Mo Yu
Edoardo Ponti
Siva Reddy
HILM
153
91
0
22 Apr 2022
Autoregressive Search Engines: Generating Substrings as Document
  Identifiers
Autoregressive Search Engines: Generating Substrings as Document Identifiers
Michele Bevilacqua
G. Ottaviano
Patrick Lewis
Wen-tau Yih
Sebastian Riedel
Fabio Petroni
KELMRALM
147
166
0
22 Apr 2022
KALA: Knowledge-Augmented Language Model Adaptation
KALA: Knowledge-Augmented Language Model Adaptation
Minki Kang
Jinheon Baek
Sung Ju Hwang
VLMKELM
100
36
0
22 Apr 2022
Decorate the Examples: A Simple Method of Prompt Design for Biomedical
  Relation Extraction
Decorate the Examples: A Simple Method of Prompt Design for Biomedical Relation Extraction
Hui-Syuan Yeh
Thomas Lavergne
Pierre Zweigenbaum
51
12
0
21 Apr 2022
Standing on the Shoulders of Giant Frozen Language Models
Standing on the Shoulders of Giant Frozen Language Models
Yoav Levine
Itay Dalmedigos
Ori Ram
Yoel Zeldes
Daniel Jannai
...
Barak Lenz
Shai Shalev-Shwartz
Amnon Shashua
Kevin Leyton-Brown
Y. Shoham
VLM
99
49
0
21 Apr 2022
Spurious Correlations in Reference-Free Evaluation of Text Generation
Spurious Correlations in Reference-Free Evaluation of Text Generation
Esin Durmus
Faisal Ladhak
Tatsunori Hashimoto
62
32
0
21 Apr 2022
Previous
123...161162163...196197198
Next