ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,973 papers shown
Title
EDM3: Event Detection as Multi-task Text Generation
EDM3: Event Detection as Multi-task Text Generation
Ujjwala Anantheswaran
Himanshu Gupta
Mihir Parmar
Kuntal Kumar Pal
Chitta Baral
89
5
0
25 May 2023
Enhancing Grammatical Error Correction Systems with Explanations
Enhancing Grammatical Error Correction Systems with Explanations
Yuejiao Fei
Leyang Cui
Sen Yang
Wai Lam
Zhenzhong Lan
Shuming Shi
88
15
0
25 May 2023
Scaling Data-Constrained Language Models
Scaling Data-Constrained Language Models
Niklas Muennighoff
Alexander M. Rush
Boaz Barak
Teven Le Scao
Aleksandra Piktus
Nouamane Tazi
S. Pyysalo
Thomas Wolf
Colin Raffel
ALM
198
226
0
25 May 2023
READ: Recurrent Adaptation of Large Transformers
READ: Recurrent Adaptation of Large Transformers
Sida I. Wang
John Nguyen
Ke Li
Carole-Jean Wu
55
11
0
24 May 2023
Learning Answer Generation using Supervision from Automatic Question
  Answering Evaluators
Learning Answer Generation using Supervision from Automatic Question Answering Evaluators
Matteo Gabburo
Siddhant Garg
Rik Koncel-Kedziorski
Alessandro Moschitti
80
6
0
24 May 2023
Visual Programming for Text-to-Image Generation and Evaluation
Visual Programming for Text-to-Image Generation and Evaluation
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
MLLM
130
51
0
24 May 2023
Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of
  Language Model
Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model
Zirui Liu
Guanchu Wang
Shaochen Zhong
Zhaozhuo Xu
Daochen Zha
...
Zhimeng Jiang
Kaixiong Zhou
Vipin Chaudhary
Shuai Xu
Helen Zhou
106
15
0
24 May 2023
The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing
The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing
Debayan Banerjee
Pranav Ajit Nair
Ricardo Usbeck
Chris Biemann
75
1
0
24 May 2023
Referral Augmentation for Zero-Shot Information Retrieval
Referral Augmentation for Zero-Shot Information Retrieval
Michael Tang
Shunyu Yao
John Yang
Karthik Narasimhan
90
3
0
24 May 2023
Dynamic Masking Rate Schedules for MLM Pretraining
Dynamic Masking Rate Schedules for MLM Pretraining
Zachary Ankner
Naomi Saphra
Davis W. Blalock
Jonathan Frankle
Matthew L. Leavitt
101
8
0
24 May 2023
Lawyer LLaMA Technical Report
Lawyer LLaMA Technical Report
Quzhe Huang
Mingxu Tao
Chen Zhang
Zhenwei An
Cong Jiang
Zhibin Chen
Zirui Wu
Yansong Feng
ELMALMAILaw
131
55
0
24 May 2023
A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event
  Extraction
A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction
Erica Cai
Brendan O'Connor
72
3
0
24 May 2023
ImageNetVC: Zero- and Few-Shot Visual Commonsense Evaluation on 1000
  ImageNet Categories
ImageNetVC: Zero- and Few-Shot Visual Commonsense Evaluation on 1000 ImageNet Categories
Heming Xia
Qingxiu Dong
Lei Li
Jingjing Xu
Tianyu Liu
Ziwei Qin
Zhifang Sui
MLLMVLM
61
3
0
24 May 2023
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large
  Language Models
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
Gen Luo
Yiyi Zhou
Tianhe Ren
Shen Chen
Xiaoshuai Sun
Rongrong Ji
VLMMLLM
124
98
0
24 May 2023
An Efficient Multilingual Language Model Compression through Vocabulary
  Trimming
An Efficient Multilingual Language Model Compression through Vocabulary Trimming
Asahi Ushio
Yi Zhou
Jose Camacho-Collados
141
8
0
24 May 2023
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through
  Interaction with Symbolic Systems
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems
Marek Kadlcík
Michal Štefánik
Ondřej Sotolář
Vlastimil Martinek
LRM
72
15
0
24 May 2023
Are Chatbots Ready for Privacy-Sensitive Applications? An Investigation
  into Input Regurgitation and Prompt-Induced Sanitization
Are Chatbots Ready for Privacy-Sensitive Applications? An Investigation into Input Regurgitation and Prompt-Induced Sanitization
Aman Priyanshu
Supriti Vijay
Ayush Kumar
Rakshit Naidu
Fatemehsadat Mireshghallah
SILM
136
25
0
24 May 2023
Sentiment Analysis in the Era of Large Language Models: A Reality Check
Sentiment Analysis in the Era of Large Language Models: A Reality Check
Wenxuan Zhang
Yue Deng
Bing-Quan Liu
Sinno Jialin Pan
Lidong Bing
AI4MH
103
312
0
24 May 2023
LLMDet: A Third Party Large Language Models Generated Text Detection
  Tool
LLMDet: A Third Party Large Language Models Generated Text Detection Tool
Kangxi Wu
Liang Pang
Huawei Shen
Xueqi Cheng
Tat-Seng Chua
DeLMO
93
42
0
24 May 2023
MuLER: Detailed and Scalable Reference-based Evaluation
MuLER: Detailed and Scalable Reference-based Evaluation
Taelin Karidi
Leshem Choshen
Gal Patel
Omri Abend
81
0
0
24 May 2023
Investigating Table-to-Text Generation Capabilities of LLMs in
  Real-World Information Seeking Scenarios
Investigating Table-to-Text Generation Capabilities of LLMs in Real-World Information Seeking Scenarios
Yilun Zhao
Haowei Zhang
Shengyun Si
Linyong Nan
Xiangru Tang
Arman Cohan
LMTD
108
12
0
24 May 2023
Improving Factuality of Abstractive Summarization without Sacrificing
  Summary Quality
Improving Factuality of Abstractive Summarization without Sacrificing Summary Quality
Tanay Dixit
Fei Wang
Muhao Chen
HILM
63
10
0
24 May 2023
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large
  Language Models with SocKET Benchmark
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
Minje Choi
Jiaxin Pei
Sagar Kumar
Chang Shu
David Jurgens
ALMLLMAG
143
72
0
24 May 2023
Universal Self-Adaptive Prompting
Universal Self-Adaptive Prompting
Xingchen Wan
Ruoxi Sun
Hootan Nakhost
H. Dai
Julian Martin Eisenschlos
Sercan O. Arik
Tomas Pfister
LRM
110
12
0
24 May 2023
PURR: Efficiently Editing Language Model Hallucinations by Denoising
  Language Model Corruptions
PURR: Efficiently Editing Language Model Hallucinations by Denoising Language Model Corruptions
Anthony Chen
Panupong Pasupat
Sameer Singh
Hongrae Lee
Kelvin Guu
125
48
0
24 May 2023
Chain-of-Questions Training with Latent Answers for Robust Multistep
  Question Answering
Chain-of-Questions Training with Latent Answers for Robust Multistep Question Answering
Wang Zhu
Jesse Thomason
Robin Jia
BDLLRM
77
8
0
24 May 2023
Extracting Psychological Indicators Using Question Answering
Extracting Psychological Indicators Using Question Answering
Luka Pavlović
21
0
0
24 May 2023
Pre-RMSNorm and Pre-CRMSNorm Transformers: Equivalent and Efficient
  Pre-LN Transformers
Pre-RMSNorm and Pre-CRMSNorm Transformers: Equivalent and Efficient Pre-LN Transformers
Zixuan Jiang
Jiaqi Gu
Hanqing Zhu
David Z. Pan
AI4CE
104
18
0
24 May 2023
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and
  Compositional Experts
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts
Yunshui Li
Binyuan Hui
Zhichao Yin
Min Yang
Fei Huang
Yongbin Li
MoE
98
21
0
24 May 2023
Debiasing Made State-of-the-art: Revisiting the Simple Seed-based Weak
  Supervision for Text Classification
Debiasing Made State-of-the-art: Revisiting the Simple Seed-based Weak Supervision for Text Classification
Chengyu Dong
Zihan Wang
Jingbo Shang
97
4
0
24 May 2023
Faithful Low-Resource Data-to-Text Generation through Cycle Training
Faithful Low-Resource Data-to-Text Generation through Cycle Training
Zhuoer Wang
Marcus D. Collins
Nikhita Vedula
Simone Filice
S. Malmasi
Oleg Rokhlenko
99
10
0
24 May 2023
Anthropomorphization of AI: Opportunities and Risks
Anthropomorphization of AI: Opportunities and Risks
Ameet Deshpande
Tanmay Rajpurohit
Karthik Narasimhan
Ashwin Kalyan
83
24
0
24 May 2023
Allies: Prompting Large Language Model with Beam Search
Allies: Prompting Large Language Model with Beam Search
Hao Sun
Xiao Liu
Yeyun Gong
Yan Zhang
Daxin Jiang
Linjun Yang
Nan Duan
RALM
105
6
0
24 May 2023
UniChart: A Universal Vision-language Pretrained Model for Chart
  Comprehension and Reasoning
UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning
Ahmed Masry
P. Kavehzadeh
Do Xuan Long
Enamul Hoque
Shafiq Joty
LRM
97
113
0
24 May 2023
A New Era in Software Security: Towards Self-Healing Software via Large
  Language Models and Formal Verification
A New Era in Software Security: Towards Self-Healing Software via Large Language Models and Formal Verification
Norbert Tihanyi
Ridhi Jain
Yiannis Charalambous
M. Ferrag
Youcheng Sun
Lucas C. Cordeiro
92
59
0
24 May 2023
Instructions as Backdoors: Backdoor Vulnerabilities of Instruction
  Tuning for Large Language Models
Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models
Lyne Tchapmi
Mingyu Derek Ma
Fei Wang
Chaowei Xiao
Muhao Chen
SILM
149
85
0
24 May 2023
TACR: A Table-alignment-based Cell-selection and Reasoning Model for
  Hybrid Question-Answering
TACR: A Table-alignment-based Cell-selection and Reasoning Model for Hybrid Question-Answering
Jian Wu
Yicheng Xu
Yan Gao
Jian-Guang Lou
Börje F. Karlsson
Manabu Okumura
LMTD
64
3
0
24 May 2023
GRILL: Grounded Vision-language Pre-training via Aligning Text and Image
  Regions
GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions
Woojeong Jin
Subhabrata Mukherjee
Yu Cheng
Yelong Shen
Weizhu Chen
Ahmed Hassan Awadallah
Damien Jose
Xiang Ren
ObjDVLM
121
8
0
24 May 2023
InteractiveIE: Towards Assessing the Strength of Human-AI Collaboration
  in Improving the Performance of Information Extraction
InteractiveIE: Towards Assessing the Strength of Human-AI Collaboration in Improving the Performance of Information Extraction
Ishani Mondal
Michelle Yuan
N. Anandhavelu
Aparna Garimella
Francis Ferraro
Andrew Blair-Stanek
Benjamin Van Durme
Jordan L. Boyd-Graber
73
1
0
24 May 2023
Enabling Large Language Models to Generate Text with Citations
Enabling Large Language Models to Generate Text with Citations
Tianyu Gao
Howard Yen
Jiatong Yu
Danqi Chen
LM&MAHILM
177
358
0
24 May 2023
Self-Checker: Plug-and-Play Modules for Fact-Checking with Large
  Language Models
Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models
Miaoran Li
Baolin Peng
Michel Galley
Jianfeng Gao
Zhu Zhang
LRMHILMKELM
95
30
0
24 May 2023
Increasing Probability Mass on Answer Choices Does Not Always Improve
  Accuracy
Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy
Sarah Wiegreffe
Matthew Finlayson
Oyvind Tafjord
Peter Clark
Ashish Sabharwal
91
7
0
24 May 2023
Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable
  Language Style Understanding
Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding
Ruohao Guo
Wei Xu
Alan Ritter
87
4
0
24 May 2023
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator
Ziwei He
Meng Yang
Minwei Feng
Jingcheng Yin
Xiang Wang
Jingwen Leng
Zhouhan Lin
ViT
104
14
0
24 May 2023
MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties
  Grounded in Math Reasoning Problems
MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems
Jakub Macina
Nico Daheim
Sankalan Pal Chowdhury
Tanmay Sinha
Manu Kapur
Iryna Gurevych
Mrinmaya Sachan
LRM
131
68
0
23 May 2023
The State of the Art in Creating Visualization Corpora for Automated
  Chart Analysis
The State of the Art in Creating Visualization Corpora for Automated Chart Analysis
Chong Chen
Zhicheng Liu
95
14
0
23 May 2023
Sociocultural Norm Similarities and Differences via Situational
  Alignment and Explainable Textual Entailment
Sociocultural Norm Similarities and Differences via Situational Alignment and Explainable Textual Entailment
Sky CH-Wang
Arkadiy Saakyan
Aochong Li
Zhou Yu
Smaranda Muresan
110
17
0
23 May 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language
  Models
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models
Tarek Naous
Michael Joseph Ryan
Alan Ritter
Wei Xu
129
96
0
23 May 2023
On Robustness of Finetuned Transformer-based NLP Models
On Robustness of Finetuned Transformer-based NLP Models
Pavan Kalyan Reddy Neerudu
Subba Reddy Oota
Mounika Marreddy
Venkateswara Rao Kagita
Manish Gupta
85
9
0
23 May 2023
Schema-Driven Information Extraction from Heterogeneous Tables
Schema-Driven Information Extraction from Heterogeneous Tables
Fan Bai
Junmo Kang
Gabriel Stanovsky
Dayne Freitag
Alan Ritter
LMTD
91
14
0
23 May 2023
Previous
123...130131132...198199200
Next