ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,955 papers shown
Title
OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning
Youhe Jiang
Fangcheng Fu
Xupeng Miao
Xiaonan Nie
Tengjiao Wang
73
11
0
17 May 2023
The Jaseci Programming Paradigm and Runtime Stack: Building Scale-out
  Production Applications Easy and Fast
The Jaseci Programming Paradigm and Runtime Stack: Building Scale-out Production Applications Easy and Fast
Jason Mars
Yiping Kang
Roland Daynauth
Baichuan Li
Ashish Mahendra
Krisztian Flautner
Lingjia Tang
GNN
36
3
0
17 May 2023
StructGPT: A General Framework for Large Language Model to Reason over
  Structured Data
StructGPT: A General Framework for Large Language Model to Reason over Structured Data
Jinhao Jiang
Kun Zhou
Zican Dong
Keming Ye
Wayne Xin Zhao
Ji-Rong Wen
LRMLMTDRALM
174
301
0
16 May 2023
Sequence-to-Sequence Pre-training with Unified Modality Masking for
  Visual Document Understanding
Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding
ShuWei Feng
Tianyang Zhan
Zhanming Jie
Trung Quoc Luong
Xiaoran Jin
59
1
0
16 May 2023
xPQA: Cross-Lingual Product Question Answering across 12 Languages
xPQA: Cross-Lingual Product Question Answering across 12 Languages
Xiaoyu Shen
Akari Asai
Bill Byrne
Adria de Gispert
98
8
0
16 May 2023
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with
  Foundation Models
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Zhimin Chen
Longlong Jing
Yingwei Li
Bing Li
118
34
0
15 May 2023
Schema-adaptable Knowledge Graph Construction
Schema-adaptable Knowledge Graph Construction
Hongbin Ye
Honghao Gui
Xin Xu
Xi Chen
Huajun Chen
Ningyu Zhang
116
4
0
15 May 2023
Recyclable Tuning for Continual Pre-training
Recyclable Tuning for Continual Pre-training
Yujia Qin
Cheng Qian
Xu Han
Yankai Lin
Huadong Wang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
Jie Zhou
CLL
71
13
0
15 May 2023
Unsupervised Sentence Representation Learning with Frequency-induced
  Adversarial Tuning and Incomplete Sentence Filtering
Unsupervised Sentence Representation Learning with Frequency-induced Adversarial Tuning and Incomplete Sentence Filtering
Bing Wang
Ximing Li
Zhiyao Yang
Yuanyuan Guan
Jiayin Li
Sheng-sheng Wang
77
7
0
15 May 2023
Similarity-weighted Construction of Contextualized Commonsense Knowledge
  Graphs for Knowledge-intense Argumentation Tasks
Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-intense Argumentation Tasks
Moritz Plenz
Juri Opitz
Philipp Heinisch
Philipp Cimiano
Anette Frank
95
9
0
15 May 2023
KEPR: Knowledge Enhancement and Plausibility Ranking for Generative
  Commonsense Question Answering
KEPR: Knowledge Enhancement and Plausibility Ranking for Generative Commonsense Question Answering
Zhifeng Li
Bowei Zou
Yifan Fan
Yu Hong
76
3
0
15 May 2023
Symbol tuning improves in-context learning in language models
Symbol tuning improves in-context learning in language models
Jerry W. Wei
Le Hou
Andrew Kyle Lampinen
Xiangning Chen
Da Huang
...
Xinyun Chen
Yifeng Lu
Denny Zhou
Tengyu Ma
Quoc V. Le
LRM
90
80
0
15 May 2023
Helping the Helper: Supporting Peer Counselors via AI-Empowered Practice and Feedback
Helping the Helper: Supporting Peer Counselors via AI-Empowered Practice and Feedback
Shang-ling Hsu
Raj Sanjay Shah
Prathik Senthil
Zahra Ashktorab
Casey Dugan
Werner Geyer
Diyi Yang
121
24
0
15 May 2023
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
Le Xue
Ning Yu
Shu Zhen Zhang
Artemis Panagopoulou
Junnan Li
...
Jiajun Wu
Caiming Xiong
Ran Xu
Juan Carlos Niebles
Silvio Savarese
127
130
0
14 May 2023
Learning to Generalize for Cross-domain QA
Learning to Generalize for Cross-domain QA
Yingjie Niu
Linyi Yang
Ruihai Dong
Yue Zhang
96
6
0
14 May 2023
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Chunhui Zhang
Li Liu
Yawen Cui
Guanjie Huang
Weilin Lin
Yiqian Yang
Yuehong Hu
VLM
102
101
0
14 May 2023
Learning to Simulate Natural Language Feedback for Interactive Semantic
  Parsing
Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing
Hao Yan
Saurabh Srivastava
Yintao Tai
Sida I. Wang
Wen-tau Yih
Ziyu Yao
73
19
0
14 May 2023
CodeT5+: Open Code Large Language Models for Code Understanding and
  Generation
CodeT5+: Open Code Large Language Models for Code Understanding and Generation
Yue Wang
Hung Le
Akhilesh Deepak Gotmare
Nghi D. Q. Bui
Junnan Li
Steven C. H. Hoi
ALM
158
504
0
13 May 2023
Scalable Educational Question Generation with Pre-trained Language
  Models
Scalable Educational Question Generation with Pre-trained Language Models
Sahan Bulathwela
Hamze Muse
Emine Yilmaz
AI4EdELM
64
23
0
13 May 2023
ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain
  Dialogue Systems
ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain Dialogue Systems
Sarik Ghazarian
Yijia Shao
Rujun Han
Aram Galstyan
Nanyun Peng
86
7
0
12 May 2023
What are the Desired Characteristics of Calibration Sets? Identifying
  Correlates on Long Form Scientific Summarization
What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization
Griffin Adams
Bichlien H. Nguyen
Jake A. Smith
Yingce Xia
Shufang Xie
Anna Ostropolets
Budhaditya Deb
Yuan Chen
Tristan Naumann
Noémie Elhadad
102
8
0
12 May 2023
Surfacing Biases in Large Language Models using Contrastive Input
  Decoding
Surfacing Biases in Large Language Models using Contrastive Input Decoding
G. Yona
Or Honovich
Itay Laish
Roee Aharoni
65
12
0
12 May 2023
ZARA: Improving Few-Shot Self-Rationalization for Small Language Models
ZARA: Improving Few-Shot Self-Rationalization for Small Language Models
Wei-Lin Chen
An-Zi Yen
Cheng-Kuang Wu
Hen-Hsen Huang
Hsin-Hsi Chen
ReLMLRM
54
11
0
12 May 2023
Open-WikiTable: Dataset for Open Domain Question Answering with Complex
  Reasoning over Table
Open-WikiTable: Dataset for Open Domain Question Answering with Complex Reasoning over Table
Sunjun Kweon
Yeonsu Kwon
Seonhee Cho
Yohan Jo
E. Choi
LMTDRALM
70
23
0
12 May 2023
Are Machine Rationales (Not) Useful to Humans? Measuring and Improving
  Human Utility of Free-Text Rationales
Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales
Brihi Joshi
Ziyi Liu
Sahana Ramnath
Aaron Chan
Zhewei Tong
Shaoliang Nie
Qifan Wang
Yejin Choi
Xiang Ren
HAILRM
95
35
0
11 May 2023
Think Twice: Measuring the Efficiency of Eliminating Prediction
  Shortcuts of Question Answering Models
Think Twice: Measuring the Efficiency of Eliminating Prediction Shortcuts of Question Answering Models
Lukávs Mikula
Michal vStefánik
Marek Petrovivc
Petr Sojka
79
4
0
11 May 2023
InstructBLIP: Towards General-purpose Vision-Language Models with
  Instruction Tuning
InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
Wenliang Dai
Junnan Li
Dongxu Li
A. M. H. Tiong
Junqi Zhao
Weisheng Wang
Boyang Albert Li
Pascale Fung
Steven C. H. Hoi
MLLMVLM
279
2,105
0
11 May 2023
Bot or Human? Detecting ChatGPT Imposters with A Single Question
Bot or Human? Detecting ChatGPT Imposters with A Single Question
Hong Wang
Xuan Luo
Weizhi Wang
Xifeng Yan
DeLMO
70
27
0
10 May 2023
Privacy-Preserving Prompt Tuning for Large Language Model Services
Privacy-Preserving Prompt Tuning for Large Language Model Services
Yansong Li
Zhixing Tan
Yang Liu
SILMVLM
117
69
0
10 May 2023
Synthetic Query Generation for Privacy-Preserving Deep Retrieval Systems
  using Differentially Private Language Models
Synthetic Query Generation for Privacy-Preserving Deep Retrieval Systems using Differentially Private Language Models
Aldo G. Carranza
Rezsa Farahani
Natalia Ponomareva
Alexey Kurakin
Matthew Jagielski
Milad Nasr
SyDa
87
7
0
10 May 2023
Multilingual LLMs are Better Cross-lingual In-context Learners with
  Alignment
Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment
Eshaan Tanwar
Subhabrata Dutta
Manish Borthakur
Tanmoy Chakraborty
107
57
0
10 May 2023
Multi-hop Commonsense Knowledge Injection Framework for Zero-Shot
  Commonsense Question Answering
Multi-hop Commonsense Knowledge Injection Framework for Zero-Shot Commonsense Question Answering
Xin Guan
Biwei Cao
Qingqing Gao
Zheng Yin
Bo Liu
Jiuxin Cao
81
5
0
10 May 2023
Decker: Double Check with Heterogeneous Knowledge for Commonsense Fact
  Verification
Decker: Double Check with Heterogeneous Knowledge for Commonsense Fact Verification
Anni Zou
Zhuosheng Zhang
Hai Zhao
HILM
79
6
0
10 May 2023
CodeIE: Large Code Generation Models are Better Few-Shot Information
  Extractors
CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors
Peng Li
Tianxiang Sun
Qiong Tang
Hang Yan
Yuanbin Wu
Xuanjing Huang
Technology
SyDa
84
77
0
09 May 2023
An Exploration of Encoder-Decoder Approaches to Multi-Label
  Classification for Legal and Biomedical Text
An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text
Yova Kementchedjhieva
Ilias Chalkidis
96
24
0
09 May 2023
Consistent Text Categorization using Data Augmentation in e-Commerce
Consistent Text Categorization using Data Augmentation in e-Commerce
G. Horowitz
Stav Yanovsky Daye
Noa Avigdor-Elgrabli
Ariel Raviv
71
4
0
09 May 2023
The Vault: A Comprehensive Multilingual Dataset for Advancing Code
  Understanding and Generation
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Dũng Nguyễn Mạnh
Nam Le Hai
An Dau
A. Nguyen
Khanh N. Nghiem
Jingnan Guo
Nghi D. Q. Bui
94
18
0
09 May 2023
Distilling Script Knowledge from Large Language Models for Constrained
  Language Planning
Distilling Script Knowledge from Large Language Models for Constrained Language Planning
Siyu Yuan
Jiangjie Chen
Ziquan Fu
Xuyang Ge
Soham Shah
C. R. Jankowski
Yanghua Xiao
Deqing Yang
116
56
0
09 May 2023
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with
  Large Language Models
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models
Shan Zhong
Zhongzhan Huang
Wushao Wen
Jinghui Qin
Liang Lin
112
41
0
09 May 2023
Recommender Systems with Generative Retrieval
Recommender Systems with Generative Retrieval
Shashank Rajput
Nikhil Mehta
Anima Singh
Raghunandan H. Keshavan
T. Vu
...
Vinh Q. Tran
Jonah Samost
Maciej Kula
Ed H. Chi
M. Sathiamoorthy
RALM3DV
103
90
0
08 May 2023
Web Content Filtering through knowledge distillation of Large Language
  Models
Web Content Filtering through knowledge distillation of Large Language Models
Tamás Vörös
Sean P. Bergeron
Konstantin Berlin
85
7
0
08 May 2023
Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge
  Distillation
Do Not Blindly Imitate the Teacher: Using Perturbed Loss for Knowledge Distillation
Rongzhi Zhang
Jiaming Shen
Tianqi Liu
Jia-Ling Liu
Michael Bendersky
Marc Najork
Chao Zhang
106
20
0
08 May 2023
GersteinLab at MEDIQA-Chat 2023: Clinical Note Summarization from
  Doctor-Patient Conversations through Fine-tuning and In-context Learning
GersteinLab at MEDIQA-Chat 2023: Clinical Note Summarization from Doctor-Patient Conversations through Fine-tuning and In-context Learning
Xiangru Tang
Andrew Tran
Jeffrey Tan
Mark B. Gerstein
71
7
0
08 May 2023
The Current State of Summarization
The Current State of Summarization
Fabian Retkowski
83
6
0
08 May 2023
Differentially Private Attention Computation
Differentially Private Attention Computation
Yeqi Gao
Zhao Song
Xin Yang
92
21
0
08 May 2023
Enhancing Knowledge Graph Construction Using Large Language Models
Enhancing Knowledge Graph Construction Using Large Language Models
Milena Trajanoska
Riste Stojanov
D. Trajanov
92
56
0
08 May 2023
Improving Cross-Task Generalization with Step-by-Step Instructions
Improving Cross-Task Generalization with Step-by-Step Instructions
Yang Wu
Yanyan Zhao
Zhongyang Li
Bing Qin
Kai Xiong
LRMALM
80
9
0
08 May 2023
Leveraging Synthetic Targets for Machine Translation
Leveraging Synthetic Targets for Machine Translation
Sarthak Mittal
Oleksii Hrinchuk
Oleksii Kuchaiev
72
2
0
07 May 2023
Analysis of Climate Campaigns on Social Media using Bayesian Model
  Averaging
Analysis of Climate Campaigns on Social Media using Bayesian Model Averaging
Tunazzina Islam
Ruqi Zhang
Dan Goldwasser
80
4
0
06 May 2023
Two to Five Truths in Non-Negative Matrix Factorization
Two to Five Truths in Non-Negative Matrix Factorization
John M. Conroy
Neil P. Molino
Brian Baughman
Rod Gomez
Ryan Kaliszewski
Nicholas A. Lines
81
0
0
06 May 2023
Previous
123...132133134...198199200
Next