Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,903 papers shown
Title
SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages
Alireza Mohammadshahi
Vassilina Nikoulina
Alexandre Berard
Caroline Brun
James Henderson
Laurent Besacier
VLM
MoE
LRM
91
21
0
20 Oct 2022
Boosting Natural Language Generation from Instructions with Meta-Learning
Budhaditya Deb
Guoqing Zheng
Ahmed Hassan Awadallah
72
16
0
20 Oct 2022
Tag-Set-Sequence Learning for Generating Question-Answer Pairs
Cheng Zhang
Jie Wang
52
2
0
20 Oct 2022
Dense Paraphrasing for Textual Enrichment
Jingxuan Tu
Kyeongmin Rim
E. Holderness
James Pustejovsky
66
6
0
20 Oct 2022
Scaling Instruction-Finetuned Language Models
Hyung Won Chung
Le Hou
Shayne Longpre
Barret Zoph
Yi Tay
...
Jacob Devlin
Adam Roberts
Denny Zhou
Quoc V. Le
Jason W. Wei
ReLM
LRM
311
3,178
0
20 Oct 2022
Transcending Scaling Laws with 0.1% Extra Compute
Yi Tay
Jason W. Wei
Hyung Won Chung
Vinh Q. Tran
David R. So
...
Donald Metzler
Slav Petrov
N. Houlsby
Quoc V. Le
Mostafa Dehghani
LRM
109
71
0
20 Oct 2022
Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts
Xiangyang Liu
Tianxiang Sun
Xuanjing Huang
Xipeng Qiu
VLM
103
29
0
20 Oct 2022
Disentangling Reasoning Capabilities from Language Models with Compositional Reasoning Transformers
Wanjun Zhong
Tingting Ma
Jiahai Wang
Jian Yin
Tiejun Zhao
Chin-Yew Lin
Nan Duan
LRM
CoGe
79
2
0
20 Oct 2022
Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation
Yu Zhao
Jianguo Wei
Zhichao Lin
Yueheng Sun
Meishan Zhang
Hao Fei
79
16
0
20 Oct 2022
OCR-VQGAN: Taming Text-within-Image Generation
Juan A. Rodriguez
David Vazquez
I. Laradji
M. Pedersoli
Pau Rodríguez López
152
20
0
19 Oct 2022
NGEP: A Graph-based Event Planning Framework for Story Generation
Chen Tang
Zhihao Zhang
Tyler Loakman
Chenghua Lin
Frank Guerin
83
16
0
19 Oct 2022
Attribution and Obfuscation of Neural Text Authorship: A Data Mining Perspective
Adaku Uchendu
Thai Le
Dongwon Lee
DeLMO
121
45
0
19 Oct 2022
CPL: Counterfactual Prompt Learning for Vision and Language Models
Xuehai He
Diji Yang
Weixi Feng
Tsu-Jui Fu
Arjun Reddy Akula
Varun Jampani
P. Narayana
Sugato Basu
William Yang Wang
Xinze Wang
VPVLM
VLM
100
15
0
19 Oct 2022
Forging Multiple Training Objectives for Pre-trained Language Models via Meta-Learning
Hongqiu Wu
Ruixue Ding
Haizhen Zhao
Boli Chen
Pengjun Xie
Fei Huang
Min Zhang
MoMe
102
8
0
19 Oct 2022
Improving Aspect Sentiment Quad Prediction via Template-Order Data Augmentation
Mengting Hu
Yike Wu
H. Gao
Yinhao Bai
Shiwan Zhao
91
52
0
19 Oct 2022
Continued Pretraining for Better Zero- and Few-Shot Promptability
Zhaofeng Wu
IV RobertL.Logan
Pete Walsh
Akshita Bhagia
Dirk Groeneveld
Sameer Singh
Iz Beltagy
VLM
108
12
0
19 Oct 2022
Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models
Luke Vilnis
Yury Zemlyanskiy
Patrick C. Murray
Alexandre Passos
Sumit Sanghai
105
10
0
18 Oct 2022
Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering
Jialin Wu
Raymond J. Mooney
RALM
138
11
0
18 Oct 2022
ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler
Jiaxin Zhang
Yashar Moshfeghi
AIMat
68
18
0
18 Oct 2022
The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks
Nikil Selvam
Sunipa Dev
Daniel Khashabi
Tushar Khot
Kai-Wei Chang
ALM
76
26
0
18 Oct 2022
Controllable Fake Document Infilling for Cyber Deception
Yibo Hu
Yu Lin
Eric Parolin
Latif Khan
Kevin W. Hamlen
66
8
0
18 Oct 2022
Taxonomy of Abstractive Dialogue Summarization: Scenarios, Approaches and Future Directions
Qi Jia
Yizhu Liu
Siyu Ren
Kenny Q. Zhu
80
8
0
18 Oct 2022
Transfer learning with affine model transformation
Shunya Minami
Kenji Fukumizu
Yoshihiro Hayashi
Ryo Yoshida
69
1
0
18 Oct 2022
Summary Workbench: Unifying Application and Evaluation of Text Summarization Models
S. Syed
Dominik Schwabe
Martin Potthast
47
0
0
18 Oct 2022
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Rui Li
Weihua Li
Yi Yang
Hanyu Wei
Jianhua Jiang
Quan-wei Bai
DiffM
153
11
0
18 Oct 2022
Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models
Zhiyuan Zhang
Lingjuan Lyu
Xingjun Ma
Chenguang Wang
Xu Sun
AAML
64
43
0
18 Oct 2022
Less is More: A Lightweight and Robust Neural Architecture for Discourse Parsing
Ming Li
Ruihong Huang
59
2
0
18 Oct 2022
Deepfake Text Detection: Limitations and Opportunities
Jiameng Pu
Zain Sarwar
Sifat Muhammad Abdullah
A. Rehman
Yoonjin Kim
P. Bhattacharya
M. Javed
Bimal Viswanath
AAML
70
57
0
17 Oct 2022
Imagic: Text-Based Real Image Editing with Diffusion Models
Bahjat Kawar
Shiran Zada
Oran Lang
Omer Tov
Hui-Tang Chang
Tali Dekel
Inbar Mosseri
Michal Irani
138
1,107
0
17 Oct 2022
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Mirac Suzgun
Nathan Scales
Nathanael Scharli
Sebastian Gehrmann
Yi Tay
...
Aakanksha Chowdhery
Quoc V. Le
Ed H. Chi
Denny Zhou
Jason W. Wei
ALM
ELM
LRM
ReLM
292
1,144
0
17 Oct 2022
Table-To-Text generation and pre-training with TabT5
Ewa Andrejczuk
Julian Martin Eisenschlos
Francesco Piccinno
Syrine Krichene
Yasemin Altun
LMTD
66
31
0
17 Oct 2022
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Shansan Gong
Mukai Li
Jiangtao Feng
Zhiyong Wu
Lingpeng Kong
96
334
0
17 Oct 2022
PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks
Weiwen Xu
Xin Li
Yang Deng
W. Lam
Lidong Bing
86
10
0
17 Oct 2022
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance
Yang Deng
Wenqiang Lei
Wenxuan Zhang
W. Lam
Tat-Seng Chua
101
56
0
17 Oct 2022
Towards Summary Candidates Fusion
Mathieu Ravaut
Shafiq Joty
Nancy F. Chen
92
14
0
17 Oct 2022
Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero Training
A. M. H. Tiong
Junnan Li
Boyang Albert Li
Silvio Savarese
Guosheng Lin
MLLM
133
109
0
17 Oct 2022
ReasonChainQA: Text-based Complex Question Answering with Explainable Evidence Chains
Minjun Zhu
Yixuan Weng
Shizhu He
Kang Liu
Jun Zhao
LRM
66
6
0
17 Oct 2022
Keep Me Updated! Memory Management in Long-term Conversations
Sanghwan Bae
Donghyun Kwak
Soyoung Kang
Min Young Lee
Sungdong Kim
Yuin Jeong
Hyeri Kim
Sang-Woo Lee
W. Park
Nako Sung
116
52
0
17 Oct 2022
RARR: Researching and Revising What Language Models Say, Using Language Models
Luyu Gao
Zhuyun Dai
Panupong Pasupat
Anthony Chen
Arun Tejasvi Chaganty
...
Vincent Zhao
Ni Lao
Hongrae Lee
Da-Cheng Juan
Kelvin Guu
HILM
KELM
135
260
0
17 Oct 2022
Teacher Forcing Recovers Reward Functions for Text Generation
Yongchang Hao
Yuxin Liu
Lili Mou
OffRL
91
12
0
17 Oct 2022
A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems
Hong Liu
Yucheng Cai
Zhijian Ou
Yi Huang
Junlan Feng
ELM
75
4
0
17 Oct 2022
NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly
Yi R. Fung
Tuhin Chakraborty
Hao Guo
Owen Rambow
Smaranda Muresan
Heng Ji
86
43
0
16 Oct 2022
Investigating the Robustness of Natural Language Generation from Logical Forms via Counterfactual Samples
Chengyuan Liu
Leilei Gan
Kun Kuang
Leilei Gan
68
3
0
16 Oct 2022
AskYourDB: An end-to-end system for querying and visualizing relational databases using natural language
Manu Joseph
Harsh Raj
Anubhav Yadav
Aaryaman Sharma
55
5
0
16 Oct 2022
Self-Repetition in Abstractive Neural Summarizers
Nikita Salkar
T. Trikalinos
Byron C. Wallace
A. Nenkova
81
12
0
14 Oct 2022
Neural Attentive Circuits
Nasim Rahaman
M. Weiß
Francesco Locatello
C. Pal
Yoshua Bengio
Bernhard Schölkopf
Erran L. Li
Nicolas Ballas
124
7
0
14 Oct 2022
Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition
Shuguang Chen
Leonardo Neves
Thamar Solorio
110
4
0
14 Oct 2022
Extracting Cultural Commonsense Knowledge at Scale
Shrestha Ghosh
Simon Razniewski
A. Varde
Gerhard Weikum
106
66
0
14 Oct 2022
Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey
Sachin Kumar
Vidhisha Balachandran
Lucille Njoo
Antonios Anastasopoulos
Yulia Tsvetkov
ELM
187
91
0
14 Oct 2022
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling
Jinchao Zhang
Shuyang Jiang
Jiangtao Feng
Lin Zheng
Dianbo Sui
3DV
197
9
0
14 Oct 2022
Previous
1
2
3
...
148
149
150
...
197
198
199
Next