ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 9,984 papers shown
Title
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight
  Compression
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
Tim Dettmers
Ruslan Svirschevski
Vage Egiazarian
Denis Kuznedelev
Elias Frantar
Saleh Ashkboos
Alexander Borzunov
Torsten Hoefler
Dan Alistarh
MQ
83
257
0
05 Jun 2023
Interactive Editing for Text Summarization
Interactive Editing for Text Summarization
Yujia Xie
Xun Wang
Si-Qing Chen
Wayne Xiong
Pengcheng He
KELM
337
2
0
05 Jun 2023
Classification of Edge-dependent Labels of Nodes in Hypergraphs
Classification of Edge-dependent Labels of Nodes in Hypergraphs
Minyoung Choe
Sunwoo Kim
Jaemin Yoo
Kijung Shin
GNN
93
13
0
05 Jun 2023
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese
  Medical Exam Dataset
Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
Junling Liu
Peilin Zhou
Yining Hua
Dading Chong
Zhongyu Tian
...
Helin Wang
Chenyu You
Zhenhua Guo
Lei Zhu
Michael Lingzhi Li
LM&MAELM
155
80
0
05 Jun 2023
Using Sequences of Life-events to Predict Human Lives
Using Sequences of Life-events to Predict Human Lives
Germans Savcisens
Tina Eliassi-Rad
L. K. Hansen
L. Mortensen
Lau Lilleholt
Anna Rogers
Ingo Zettler
Sune Lehmann
AI4TS
98
46
0
05 Jun 2023
KNOW How to Make Up Your Mind! Adversarially Detecting and Alleviating
  Inconsistencies in Natural Language Explanations
KNOW How to Make Up Your Mind! Adversarially Detecting and Alleviating Inconsistencies in Natural Language Explanations
Myeongjun Jang
Bodhisattwa Prasad Majumder
Julian McAuley
Thomas Lukasiewicz
Oana-Maria Camburu
AAML
51
4
0
05 Jun 2023
DecompX: Explaining Transformers Decisions by Propagating Token
  Decomposition
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition
Ali Modarressi
Mohsen Fayyaz
Ehsan Aghazadeh
Yadollah Yaghoobzadeh
Mohammad Taher Pilehvar
104
28
0
05 Jun 2023
On "Scientific Debt" in NLP: A Case for More Rigour in Language Model
  Pre-Training Research
On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research
Made Nindyatama Nityasya
Haryo Akbarianto Wibowo
Alham Fikri Aji
Genta Indra Winata
Radityo Eko Prasojo
Phil Blunsom
A. Kuncoro
72
8
0
05 Jun 2023
Learning Probabilistic Symmetrization for Architecture Agnostic
  Equivariance
Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance
Jinwoo Kim
Tien Dat Nguyen
Ayhan Suleymanzade
Hyeokjun An
Seunghoon Hong
123
24
0
05 Jun 2023
Leveraging Large Language Models for Topic Classification in the Domain
  of Public Affairs
Leveraging Large Language Models for Topic Classification in the Domain of Public Affairs
Alejandro Peña
Aythami Morales
Julian Fierrez
Ignacio Serna
J. Ortega-Garcia
Iñigo Puente
Jorge Cordova
Gonzalo Cordova
87
20
0
05 Jun 2023
Learning to Substitute Spans towards Improving Compositional
  Generalization
Learning to Substitute Spans towards Improving Compositional Generalization
Zhaoyi Li
Ying Wei
Defu Lian
94
10
0
05 Jun 2023
Enhancing Language Representation with Constructional Information for
  Natural Language Understanding
Enhancing Language Representation with Constructional Information for Natural Language Understanding
Lvxiaowei Xu
Jian Wu
Jiawei Peng
Zhilin Gong
Ming Cai
Tianxiang Wang
71
3
0
05 Jun 2023
PULSAR: Pre-training with Extracted Healthcare Terms for Summarising
  Patients' Problems and Data Augmentation with Black-box Large Language Models
PULSAR: Pre-training with Extracted Healthcare Terms for Summarising Patients' Problems and Data Augmentation with Black-box Large Language Models
Hao Li
Yuping Wu
Viktor Schlegel
Riza Batista-Navarro
Thanh-Tung Nguyen
Abhinav Ramesh Kashyap
Xiaojun Zeng
Daniel Beck
Stefan Winkler
Goran Nenadic
LM&MA
85
9
0
05 Jun 2023
CELDA: Leveraging Black-box Language Model as Enhanced Classifier
  without Labels
CELDA: Leveraging Black-box Language Model as Enhanced Classifier without Labels
Hyunsoo Cho
Youna Kim
Sang-goo Lee
54
3
0
05 Jun 2023
Uncertainty in Natural Language Processing: Sources, Quantification, and
  Applications
Uncertainty in Natural Language Processing: Sources, Quantification, and Applications
Mengting Hu
Zhen Zhang
Shiwan Zhao
Minlie Huang
Bingzhe Wu
BDL
103
39
0
05 Jun 2023
Enhance Diffusion to Improve Robust Generalization
Enhance Diffusion to Improve Robust Generalization
Jianhui Sun
Sanchit Sinha
Aidong Zhang
87
4
0
05 Jun 2023
Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help
  Multiple Graph Applications
Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications
Han Xie
Da Zheng
Jun Ma
Houyu Zhang
V. Ioannidis
...
Sheng Wang
Carl Yang
Yi Xu
Belinda Zeng
Trishul Chilimbi
AI4CE
117
40
0
05 Jun 2023
Prompt to be Consistent is Better than Self-Consistent? Few-Shot and
  Zero-Shot Fact Verification with Pre-trained Language Models
Prompt to be Consistent is Better than Self-Consistent? Few-Shot and Zero-Shot Fact Verification with Pre-trained Language Models
Fengzhu Zeng
Wei Gao
82
7
0
05 Jun 2023
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and
  Generative Fusion
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion
Dongfu Jiang
Xiang Ren
Bill Yuchen Lin
ELM
182
334
0
05 Jun 2023
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion
  Model
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
Yizhe Zhang
Jiatao Gu
Zhuofeng Wu
Shuangfei Zhai
J. Susskind
Navdeep Jaitly
DiffM
135
28
0
05 Jun 2023
SamToNe: Improving Contrastive Loss for Dual Encoder Retrieval Models
  with Same Tower Negatives
SamToNe: Improving Contrastive Loss for Dual Encoder Retrieval Models with Same Tower Negatives
Fedor Moiseev
Gustavo Hernández Ábrego
Peter Dornbach
I. Zitouni
Enrique Alfonseca
Zhe Dong
79
7
0
05 Jun 2023
Deep learning powered real-time identification of insects using citizen
  science data
Deep learning powered real-time identification of insects using citizen science data
Shivani Chiranjeevi
Mojdeh Sadaati
Ziqing Deng
Jayanth Koushik
Talukder Z Jubery
...
Aarti Singh
Ashutosh Kumar Singh
Soumik Sarkar
Arti Singh
Baskar Ganapathysubramanian
23
14
0
04 Jun 2023
Modeling Cross-Cultural Pragmatic Inference with Codenames Duet
Modeling Cross-Cultural Pragmatic Inference with Codenames Duet
Omar Shaikh
Caleb Ziems
William B. Held
Aryan Pariani
Fred Morstatter
Diyi Yang
85
14
0
04 Jun 2023
Commonsense Knowledge Transfer for Pre-trained Language Models
Commonsense Knowledge Transfer for Pre-trained Language Models
Wangchunshu Zhou
Ronan Le Bras
Yejin Choi
KELMLRM
77
4
0
04 Jun 2023
Modular Transformers: Compressing Transformers into Modularized Layers
  for Flexible Efficient Inference
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference
Wangchunshu Zhou
Ronan Le Bras
Yejin Choi
58
1
0
04 Jun 2023
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Momchil Hardalov
Pepa Atanasova
Todor Mihaylov
G. Angelova
K. Simov
P. Osenova
Ves Stoyanov
Ivan Koychev
Preslav Nakov
Dragomir R. Radev
ELMFedML
88
4
0
04 Jun 2023
Exploring the Impact of Model Scaling on Parameter-Efficient Tuning
Exploring the Impact of Model Scaling on Parameter-Efficient Tuning
Yusheng Su
Chi-Min Chan
Jiali Cheng
Yujia Qin
Yankai Lin
...
Ning Ding
Xingzhi Sun
Guotong Xie
Zhiyuan Liu
Maosong Sun
104
6
0
04 Jun 2023
Exploring and Verbalizing Academic Ideas by Concept Co-occurrence
Exploring and Verbalizing Academic Ideas by Concept Co-occurrence
Yi Xu
Shuqian Sheng
Bo Xue
Luoyi Fu
Xinbing Wang
Cheng Zhou
65
9
0
04 Jun 2023
OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and
  Inference of Large Language Models
OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models
Changhun Lee
Jungyu Jin
Taesu Kim
Hyungjun Kim
Eunhyeok Park
MQ
96
62
0
04 Jun 2023
Sen2Pro: A Probabilistic Perspective to Sentence Embedding from
  Pre-trained Language Model
Sen2Pro: A Probabilistic Perspective to Sentence Embedding from Pre-trained Language Model
Lingfeng Shen
Haiyun Jiang
Lemao Liu
Shuming Shi
68
2
0
04 Jun 2023
Detector Guidance for Multi-Object Text-to-Image Generation
Detector Guidance for Multi-Object Text-to-Image Generation
Luping Liu
Zijian Zhang
Yi Ren
Rongjie Huang
Xiang Yin
Zhou Zhao
DiffM
96
10
0
04 Jun 2023
TART: Improved Few-shot Text Classification Using Task-Adaptive
  Reference Transformation
TART: Improved Few-shot Text Classification Using Task-Adaptive Reference Transformation
Shuo Lei
Xuchao Zhang
Jianfeng He
Fanglan Chen
Chang-Tien Lu
VLM
41
16
0
03 Jun 2023
Benchmarking Robustness of Adaptation Methods on Pre-trained
  Vision-Language Models
Benchmarking Robustness of Adaptation Methods on Pre-trained Vision-Language Models
Shuo Chen
Jindong Gu
Zhen Han
Yunpu Ma
Philip Torr
Volker Tresp
VPVLMVLM
129
21
0
03 Jun 2023
Utilizing ChatGPT to Enhance Clinical Trial Enrollment
Utilizing ChatGPT to Enhance Clinical Trial Enrollment
Georgios Peikos
S. Symeonidis
Pranav Kasela
G. Pasi
LM&MA
59
13
0
03 Jun 2023
MultiLegalPile: A 689GB Multilingual Legal Corpus
MultiLegalPile: A 689GB Multilingual Legal Corpus
Joel Niklaus
Veton Matoshi
Matthias Sturmer
Ilias Chalkidis
Daniel E. Ho
AILawELM
135
44
0
03 Jun 2023
Span Identification of Epistemic Stance-Taking in Academic Written
  English
Span Identification of Epistemic Stance-Taking in Academic Written English
Masaki Eguchi
K. Kyle
48
6
0
03 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGenDiffM
123
341
0
03 Jun 2023
Probabilistic Adaptation of Text-to-Video Models
Probabilistic Adaptation of Text-to-Video Models
Mengjiao Yang
Yilun Du
Bo Dai
Dale Schuurmans
J. Tenenbaum
Pieter Abbeel
VGenDiffM
139
26
0
02 Jun 2023
5IDER: Unified Query Rewriting for Steering, Intent Carryover,
  Disfluencies, Entity Carryover and Repair
5IDER: Unified Query Rewriting for Steering, Intent Carryover, Disfluencies, Entity Carryover and Repair
Jiarui Lu
Bo-Hsiang Tseng
Joel Ruben Antony Moniz
Site Li
Xueyun Zhu
Hong-ye Yu
Murat Akbacak
81
1
0
02 Jun 2023
DocFormerv2: Local Features for Document Understanding
DocFormerv2: Local Features for Document Understanding
Srikar Appalaraju
Peng Tang
Qi Dong
Nishant Sankaran
Yichu Zhou
R. Manmatha
109
41
0
02 Jun 2023
Improving Generalization in Task-oriented Dialogues with Workflows and
  Action Plans
Improving Generalization in Task-oriented Dialogues with Workflows and Action Plans
Stefania Raimondo
C. Pal
Xiaotian Liu
David Vazquez
Héctor Palacios
65
2
0
02 Jun 2023
TIES-Merging: Resolving Interference When Merging Models
TIES-Merging: Resolving Interference When Merging Models
Prateek Yadav
Derek Tam
Leshem Choshen
Colin Raffel
Joey Tianyi Zhou
MoMe
162
319
0
02 Jun 2023
The Information Pathways Hypothesis: Transformers are Dynamic
  Self-Ensembles
The Information Pathways Hypothesis: Transformers are Dynamic Self-Ensembles
Md Shamim Hussain
Mohammed J Zaki
D. Subramanian
179
3
0
02 Jun 2023
Fine-Grained Human Feedback Gives Better Rewards for Language Model
  Training
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Zeqiu Wu
Yushi Hu
Weijia Shi
Nouha Dziri
Alane Suhr
Prithviraj Ammanabrolu
Noah A. Smith
Mari Ostendorf
Hannaneh Hajishirzi
ALM
186
336
0
02 Jun 2023
Harnessing large-language models to generate private synthetic text
Harnessing large-language models to generate private synthetic text
Alexey Kurakin
Natalia Ponomareva
Umar Syed
Liam MacDermed
Andreas Terzis
SILMSyDa
85
42
0
02 Jun 2023
Learning from Partially Annotated Data: Example-aware Creation of
  Gap-filling Exercises for Language Learning
Learning from Partially Annotated Data: Example-aware Creation of Gap-filling Exercises for Language Learning
Semere Kiros Bitew
Johannes Deleu
A. Seza Doğruöz
Chris Develder
Thomas Demeester
57
2
0
02 Jun 2023
EmoUS: Simulating User Emotions in Task-Oriented Dialogues
EmoUS: Simulating User Emotions in Task-Oriented Dialogues
Hsien-chin Lin
Shutong Feng
Christian Geishauser
Nurul Lubis
Carel van Niekerk
Michael Heck
Benjamin Ruppik
Renato Vukovic
Milica Gavsić
71
12
0
02 Jun 2023
Enhancing the Protein Tertiary Structure Prediction by Multiple Sequence
  Alignment Generation
Enhancing the Protein Tertiary Structure Prediction by Multiple Sequence Alignment Generation
Le Zhang
Jiayang Chen
Tao Shen
Yu Li
S. Sun
58
5
0
02 Jun 2023
Data-Efficient French Language Modeling with CamemBERTa
Data-Efficient French Language Modeling with CamemBERTa
Wissam Antoun
Benoît Sagot
Djamé Seddah
70
7
0
02 Jun 2023
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training
  Data Exploration
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration
Aleksandra Piktus
Odunayo Ogundepo
Christopher Akiki
Akintunde Oladipo
Xinyu Crystina Zhang
Hailey Schoelkopf
Stella Biderman
Martin Potthast
Jimmy J. Lin
CVBM
80
10
0
02 Jun 2023
Previous
123...126127128...198199200
Next