ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,948 papers shown
Title
An Analysis of Abstractive Text Summarization Using Pre-trained Models
An Analysis of Abstractive Text Summarization Using Pre-trained Models
Tohida Rehman
S. Das
Debarshi Kumar Sanyal
S. Chattopadhyay
114
11
0
25 Feb 2023
Named Entity Recognition Based Automatic Generation of Research
  Highlights
Named Entity Recognition Based Automatic Generation of Research Highlights
Tohida Rehman
Debarshi Kumar Sanyal
Prasenjit Majumder
S. Chattopadhyay
83
9
0
25 Feb 2023
AugGPT: Leveraging ChatGPT for Text Data Augmentation
AugGPT: Leveraging ChatGPT for Text Data Augmentation
Haixing Dai
Zheng Liu
Wenxiong Liao
Xiaoke Huang
Yihan Cao
...
Lichao Sun
Quanzheng Li
Dinggang Shen
Tianming Liu
Xiang Li
141
161
0
25 Feb 2023
STA: Self-controlled Text Augmentation for Improving Text
  Classifications
STA: Self-controlled Text Augmentation for Improving Text Classifications
Congcong Wang
Gonzalo Fiz Pontiveros
Steven Derby
Tri Kurniawan Wijaya
74
4
0
24 Feb 2023
CARE: Collaborative AI-Assisted Reading Environment
CARE: Collaborative AI-Assisted Reading Environment
Dennis Zyska
Nils Dycke
Jan Buchmann
Ilia Kuznetsov
Iryna Gurevych
77
6
0
24 Feb 2023
MUX-PLMs: Data Multiplexing for High-throughput Language Models
MUX-PLMs: Data Multiplexing for High-throughput Language Models
Vishvak Murahari
Ameet Deshpande
Carlos E. Jimenez
Izhak Shafran
Mingqiu Wang
Yuan Cao
Karthik Narasimhan
MoE
63
5
0
24 Feb 2023
Aligning Text-to-Image Models using Human Feedback
Aligning Text-to-Image Models using Human Feedback
Kimin Lee
Hao Liu
Moonkyung Ryu
Olivia Watkins
Yuqing Du
Craig Boutilier
Pieter Abbeel
Mohammad Ghavamzadeh
S. Gu
EGVM
169
286
0
23 Feb 2023
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Shengnan An
Zeqi Lin
B. Chen
Qiang Fu
Nanning Zheng
Jian-Guang Lou
92
5
0
23 Feb 2023
ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep
  Learning Paradigms
ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning Paradigms
Minzhou Pan
Yi Zeng
Lingjuan Lyu
Xinyu Lin
R. Jia
AAML
104
38
0
22 Feb 2023
On the Robustness of ChatGPT: An Adversarial and Out-of-distribution
  Perspective
On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective
Jindong Wang
Xixu Hu
Wenxin Hou
Hao Chen
Runkai Zheng
...
Weirong Ye
Xiubo Geng
Binxing Jiao
Yue Zhang
Xingxu Xie
AI4MH
180
241
0
22 Feb 2023
$k$NN-Adapter: Efficient Domain Adaptation for Black-Box Language Models
kkkNN-Adapter: Efficient Domain Adaptation for Black-Box Language Models
Yangsibo Huang
Daogao Liu
Zexuan Zhong
Weijia Shi
Y. Lee
RALMALM
80
16
0
21 Feb 2023
Deep Transformers without Shortcuts: Modifying Self-attention for
  Faithful Signal Propagation
Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation
Bobby He
James Martens
Guodong Zhang
Aleksandar Botev
Andy Brock
Samuel L. Smith
Yee Whye Teh
92
30
0
20 Feb 2023
Poisoning Web-Scale Training Datasets is Practical
Poisoning Web-Scale Training Datasets is Practical
Nicholas Carlini
Matthew Jagielski
Christopher A. Choquette-Choo
Daniel Paleka
Will Pearce
Hyrum S. Anderson
Andreas Terzis
Kurt Thomas
Florian Tramèr
SILM
131
204
0
20 Feb 2023
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained
  Transformers
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers
Chen Liang
Haoming Jiang
Zheng Li
Xianfeng Tang
Bin Yin
Tuo Zhao
VLM
136
25
0
19 Feb 2023
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and
  Fine-tuned BERT
Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
AI4MH
145
245
0
19 Feb 2023
Transformadores: Fundamentos teoricos y Aplicaciones
Transformadores: Fundamentos teoricos y Aplicaciones
J. D. L. Torre
175
0
0
18 Feb 2023
KILM: Knowledge Injection into Encoder-Decoder Language Models
KILM: Knowledge Injection into Encoder-Decoder Language Models
Yan Xu
Mahdi Namazifar
Devamanyu Hazarika
Aishwarya Padmakumar
Yang Liu
Dilek Z. Hakkani-Tür
KELM
79
27
0
17 Feb 2023
Cluster-Guided Label Generation in Extreme Multi-Label Classification
Cluster-Guided Label Generation in Extreme Multi-Label Classification
Taehee Jung
Joo-Kyung Kim
Sungjin Lee
Dongyeop Kang
VLM
82
6
0
17 Feb 2023
Complex QA and language models hybrid architectures, Survey
Complex QA and language models hybrid architectures, Survey
Xavier Daull
P. Bellot
Emmanuel Bruno
Vincent Martin
Elisabeth Murisasco
ELM
221
16
0
17 Feb 2023
Natural Response Generation for Chinese Reading Comprehension
Natural Response Generation for Chinese Reading Comprehension
Nuo Chen
Hongguang Li
Yinan Bao
Baoyuan Wang
Jia Li
40
1
0
17 Feb 2023
PAC Prediction Sets for Large Language Models of Code
PAC Prediction Sets for Large Language Models of Code
Adam Khakhar
Stephen Mell
Osbert Bastani
124
6
0
17 Feb 2023
LEVER: Learning to Verify Language-to-Code Generation with Execution
LEVER: Learning to Verify Language-to-Code Generation with Execution
Ansong Ni
Srini Iyer
Dragomir R. Radev
Ves Stoyanov
Wen-tau Yih
Sida I. Wang
Xi Lin
145
227
0
16 Feb 2023
Aligning Language Models with Preferences through f-divergence
  Minimization
Aligning Language Models with Preferences through f-divergence Minimization
Dongyoung Go
Tomasz Korbak
Germán Kruszewski
Jos Rozen
Nahyeon Ryu
Marc Dymetman
111
76
0
16 Feb 2023
Auto-Parallelizing Large Models with Rhino: A Systematic Approach on
  Production AI Platform
Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform
Shiwei Zhang
Lansong Diao
Siyu Wang
Zongyan Cao
Yiliang Gu
Chang Si
Ziji Shi
Zhen Zheng
Chuan Wu
W. Lin
AI4CE
61
4
0
16 Feb 2023
MINOTAUR: Multi-task Video Grounding From Multimodal Queries
MINOTAUR: Multi-task Video Grounding From Multimodal Queries
Raghav Goyal
E. Mavroudi
Xitong Yang
Sainbayar Sukhbaatar
Leonid Sigal
Matt Feiszli
Lorenzo Torresani
Du Tran
95
7
0
16 Feb 2023
Slapo: A Schedule Language for Progressive Optimization of Large Deep
  Learning Model Training
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training
Hongzheng Chen
Cody Hao Yu
Shuai Zheng
Zhen Zhang
Zhiru Zhang
Yida Wang
84
8
0
16 Feb 2023
ANSEL Photobot: A Robot Event Photographer with Semantic Intelligence
ANSEL Photobot: A Robot Event Photographer with Semantic Intelligence
D. Rivkin
Gregory Dudek
Nikhil Kakodkar
David Meger
Oliver Limoyo
Xue Liu
F. Hogan
LM&Ro
69
6
0
15 Feb 2023
Measuring the Instability of Fine-Tuning
Measuring the Instability of Fine-Tuning
Yupei Du
D. Nguyen
76
4
0
15 Feb 2023
The Capacity for Moral Self-Correction in Large Language Models
The Capacity for Moral Self-Correction in Large Language Models
Deep Ganguli
Amanda Askell
Nicholas Schiefer
Thomas I. Liao
Kamil.e Lukovsiut.e
...
Tom B. Brown
C. Olah
Jack Clark
Sam Bowman
Jared Kaplan
LRMReLM
92
171
0
15 Feb 2023
PolyFormer: Referring Image Segmentation as Sequential Polygon
  Generation
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Jiang Liu
Hui Ding
Zhaowei Cai
Yuting Zhang
R. Satzoda
Vijay Mahadevan
R. Manmatha
ObjD
128
133
0
14 Feb 2023
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained
  Language Models
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models
Alexandra Chronopoulou
Matthew E. Peters
Alexander Fraser
Jesse Dodge
MoMe
95
72
0
14 Feb 2023
Generation of Highlights from Research Papers Using Pointer-Generator
  Networks and SciBERT Embeddings
Generation of Highlights from Research Papers Using Pointer-Generator Networks and SciBERT Embeddings
Tohida Rehman
Debarshi Kumar Sanyal
S. Chattopadhyay
Plaban Kumar Bhowmick
P. Das
79
11
0
14 Feb 2023
Few-shot learning approaches for classifying low resource domain
  specific software requirements
Few-shot learning approaches for classifying low resource domain specific software requirements
Anmol Nayak
Hariprasad Timmapathini
Vidhya Murali
A. Gohad
20
1
0
14 Feb 2023
The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis
  and Algorithm for Robust Natural Language Generation
The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation
Kushal Arora
Timothy J. O'Donnell
Doina Precup
Jason Weston
Jackie C.K.Cheung
66
2
0
14 Feb 2023
STREET: A Multi-Task Structured Reasoning and Explanation Benchmark
STREET: A Multi-Task Structured Reasoning and Explanation Benchmark
D. Ribeiro
Shen Wang
Xiaofei Ma
He Zhu
Rui Dong
...
William Yang Wang
Zhiheng Huang
George Karypis
Bing Xiang
Dan Roth
LRMReLM
86
23
0
13 Feb 2023
Symbolic Discovery of Optimization Algorithms
Symbolic Discovery of Optimization Algorithms
Xiangning Chen
Chen Liang
Da Huang
Esteban Real
Kaiyuan Wang
...
Xuanyi Dong
Thang Luong
Cho-Jui Hsieh
Yifeng Lu
Quoc V. Le
181
383
0
13 Feb 2023
AbLit: A Resource for Analyzing and Generating Abridged Versions of
  English Literature
AbLit: A Resource for Analyzing and Generating Abridged Versions of English Literature
Melissa Roemmele
Kyle Shaffer
Katrina Olsen
Yiyi Wang
Steve DeNeefe
49
1
0
13 Feb 2023
LipLearner: Customizable Silent Speech Interactions on Mobile Devices
LipLearner: Customizable Silent Speech Interactions on Mobile Devices
Zixiong Su
Shitao Fang
Jun Rekimoto
91
26
0
12 Feb 2023
TextDefense: Adversarial Text Detection based on Word Importance Entropy
TextDefense: Adversarial Text Detection based on Word Importance Entropy
Lujia Shen
Xuhong Zhang
S. Ji
Yuwen Pu
Chunpeng Ge
Xing Yang
Yanghe Feng
AAML
64
8
0
12 Feb 2023
Transformer models: an introduction and catalog
Transformer models: an introduction and catalog
X. Amatriain
Ananth Sankar
Jie Bing
Praveen Kumar Bodigutla
Timothy J. Hazen
Michaeel Kazi
146
53
0
12 Feb 2023
The Wisdom of Hindsight Makes Language Models Better Instruction
  Followers
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Tianjun Zhang
Fangchen Liu
Justin Wong
Pieter Abbeel
Joseph E. Gonzalez
103
47
0
10 Feb 2023
In-Context Learning with Many Demonstration Examples
In-Context Learning with Many Demonstration Examples
Mukai Li
Shansan Gong
Jiangtao Feng
Yiheng Xu
Jinchao Zhang
Zhiyong Wu
Lingpeng Kong
113
38
0
09 Feb 2023
Zero-Shot Learning for Requirements Classification: An Exploratory Study
Zero-Shot Learning for Requirements Classification: An Exploratory Study
Waad Alhoshan
Alessio Ferrari
Liping Zhao
VLM
113
41
0
09 Feb 2023
A Text-guided Protein Design Framework
A Text-guided Protein Design Framework
Shengchao Liu
Yanjing Li
Zhuoxinran Li
A. Gitter
Yutao Zhu
...
Arvind Ramanathan
Chaowei Xiao
Jian Tang
Hongyu Guo
Anima Anandkumar
138
70
0
09 Feb 2023
GPTScore: Evaluate as You Desire
GPTScore: Evaluate as You Desire
Jinlan Fu
See-Kiong Ng
Zhengbao Jiang
Pengfei Liu
LM&MAALMELM
196
292
0
08 Feb 2023
Automating Code-Related Tasks Through Transformers: The Impact of
  Pre-training
Automating Code-Related Tasks Through Transformers: The Impact of Pre-training
Rosalia Tufano
L. Pascarella
Gabriele Bavota
79
21
0
08 Feb 2023
Syntax and Domain Aware Model for Unsupervised Program Translation
Syntax and Domain Aware Model for Unsupervised Program Translation
Fang Liu
Jia Li
Li Zhang
71
18
0
08 Feb 2023
EvoText: Enhancing Natural Language Generation Models via
  Self-Escalation Learning for Up-to-Date Knowledge and Improved Performance
EvoText: Enhancing Natural Language Generation Models via Self-Escalation Learning for Up-to-Date Knowledge and Improved Performance
Zheng Yuan
HU Xue
Chuxu Zhang
Yongming Liu
VLM
68
0
0
08 Feb 2023
Long Text and Multi-Table Summarization: Dataset and Method
Long Text and Multi-Table Summarization: Dataset and Method
Shuaiqi Liu
Jiannong Cao
Ruosong Yang
Zhiyuan Wen
RALM
92
21
0
08 Feb 2023
Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories
Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories
Suyu Ge
Chenyan Xiong
Corby Rosset
Arnold Overwijk
Jiawei Han
Paul N. Bennett
VLM
65
6
0
07 Feb 2023
Previous
123...136137138...197198199
Next