Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,966 papers shown
Title
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Ting Jiang
Shaohan Huang
Shengyue Luo
Zihan Zhang
Haizhen Huang
...
Weiwei Deng
Feng Sun
Qi Zhang
Deqing Wang
Fuzhen Zhuang
92
31
0
20 May 2024
Imp: Highly Capable Large Multimodal Models for Mobile Devices
Zhenwei Shao
Zhou Yu
Jun Yu
Xuecheng Ouyang
Lihao Zheng
Zhenbiao Gai
Mingyang Wang
Jiajun Ding
78
11
0
20 May 2024
Continuous Sign Language Recognition with Adapted Conformer via Unsupervised Pretraining
Neena Aloysius
M. Geetha
Prema Nedungadi
SLR
72
3
0
20 May 2024
A review on the use of large language models as virtual tutors
Silvia García-Méndez
Francisco de Arriba-Pérez
Maria del Carmen Lopez-Perez
LLMAG
3DV
AI4Ed
VLM
KELM
62
19
0
20 May 2024
Multiple-Choice Questions are Efficient and Robust LLM Evaluators
Ziyin Zhang
Zhaokun Jiang
Lizhen Xu
Hong-ping Hao
Rui Wang
100
19
0
20 May 2024
A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers
Tom Roth
Inigo Jauregi Unanue
A. Abuadbba
Massimo Piccardi
AAML
SILM
89
1
0
20 May 2024
DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation
Kounianhua Du
Jizheng Chen
Jianghao Lin
Yunjia Xi
Hangyu Wang
Xinyi Dai
Bo Chen
Ruiming Tang
Weinan Zhang
90
7
0
20 May 2024
Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities
Junqi Wang
Chunhui Zhang
Jiapeng Li
Yuxi Ma
Lixing Niu
Jiaheng Han
Yujia Peng
Yixin Zhu
Lifeng Fan
ELM
ALM
96
4
0
20 May 2024
Learning Regularities from Data using Spiking Functions: A Theory
Canlin Zhang
Xiuwen Liu
78
0
0
19 May 2024
Exploring the Capabilities of Prompted Large Language Models in Educational and Assessment Applications
Subhankar Maity
Aniket Deroy
Sudeshna Sarkar
AI4Ed
ELM
91
12
0
19 May 2024
"Previously on ..." From Recaps to Story Summarization
Aditya Kumar Singh
Dhruv Srivastava
Makarand Tapaswi
87
1
0
19 May 2024
MICap: A Unified Model for Identity-aware Movie Descriptions
Haran Raajesh
Naveen Reddy Desanur
Zeeshan Khan
Makarand Tapaswi
82
4
0
19 May 2024
Efficient Prompt Tuning by Multi-Space Projection and Prompt Fusion
Pengxiang Lan
Enneng Yang
Yuting Liu
Guibing Guo
Linying Jiang
Jianzhe Zhao
Xingwei Wang
VLM
AAML
88
1
0
19 May 2024
Unifying 3D Vision-Language Understanding via Promptable Queries
Ziyu Zhu
Zhuofan Zhang
Xiaojian Ma
Xuesong Niu
Yixin Chen
Baoxiong Jia
Zhidong Deng
Siyuan Huang
Qing Li
118
32
0
19 May 2024
EmbSum: Leveraging the Summarization Capabilities of Large Language Models for Content-Based Recommendations
Chiyu Zhang
Yifei Sun
Minghao Wu
Jun Chen
Jie Lei
...
Angli Liu
Ji Zhu
Sem Park
Ning Yao
Bo Long
OffRL
118
6
0
19 May 2024
LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions
Victor Agostinelli
Sanghyun Hong
Lizhong Chen
KELM
81
2
0
18 May 2024
CoLay: Controllable Layout Generation through Multi-conditional Latent Diffusion
Chin-Yi Cheng
Ruiqi Gao
Forrest Huang
Yang Li
DiffM
74
2
0
18 May 2024
Identifying and Aligning Medical Claims Made on Social Media with Medical Evidence
Anthony James Hughes
Xingyi Song
61
1
0
18 May 2024
LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs
Yongrae Jo
Seongyun Lee
Minju Seo
Sung Ju Hwang
Moontae Lee
69
3
0
18 May 2024
Towards Modular LLMs by Building and Reusing a Library of LoRAs
O. Ostapenko
Zhan Su
Edoardo Ponti
Laurent Charlin
Nicolas Le Roux
Matheus Pereira
Lucas Caccia
Alessandro Sordoni
MoMe
111
37
0
18 May 2024
Generative Artificial Intelligence: A Systematic Review and Applications
S. S. Sengar
Affan Bin Hasan
Sanjay Kumar
Fiona Carroll
MedIm
80
75
0
17 May 2024
The Future of Large Language Model Pre-training is Federated
Lorenzo Sani
Alexandru Iacob
Zeyu Cao
Bill Marino
Yan Gao
...
Wanru Zhao
William F. Shen
Preslav Aleksandrov
Xinchi Qiu
Nicholas D. Lane
AI4CE
163
21
0
17 May 2024
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities
Hao Zhou
Chengming Hu
Ye Yuan
Yufei Cui
Yili Jin
...
Di Wu
Xue Liu
Charlie Zhang
Xianbin Wang
Jiangchuan Liu
118
79
0
17 May 2024
INDUS: Effective and Efficient Language Models for Scientific Applications
Bishwaranjan Bhattacharjee
Aashka Trivedi
Masayasu Muraoka
Muthukumaran Ramasubramanian
Takuma Udagawa
...
Peter W. J. Staar
S. Vahidinia
Ryan McGranaghan
A. Mehrabian
Tsendgar Lee
AI4CE
99
6
0
17 May 2024
SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation
Ziyao Xu
Houfeng Wang
53
2
0
17 May 2024
Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Xiaoming Shi
Zeming Liu
Li Du
Yuxuan Wang
Hongru Wang
Yuhang Guo
Tong Ruan
Jie Xu
Shaoting Zhang
LM&MA
ELM
107
2
0
17 May 2024
RDRec: Rationale Distillation for LLM-based Recommendation
Xinfeng Wang
Jin Cui
Yoshimi Suzuki
Fumiyo Fukumoto
LRM
119
13
0
17 May 2024
Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks
Anwoy Chatterjee
Eshaan Tanwar
Subhabrata Dutta
Tanmoy Chakraborty
LRM
125
11
0
17 May 2024
Towards Better Question Generation in QA-based Event Extraction
Zijin Hong
Jian Liu
112
9
0
17 May 2024
In-context Contrastive Learning for Event Causality Identification
Chao Liang
Wei Xiang
Bang Wang
73
1
0
17 May 2024
Multi-Evidence based Fact Verification via A Confidential Graph Neural Network
Yuqing Lan
Zhenghao Liu
Yu Gu
Xiaoyuan Yi
Xiaohua Li
Liner Yang
Ge Yu
97
1
0
17 May 2024
A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision
Charles Raude
Prajwal K R
Liliane Momeni
Hannah Bull
Samuel Albanie
Andrew Zisserman
Gül Varol
SLR
110
5
0
16 May 2024
Keep It Private: Unsupervised Privatization of Online Text
Calvin Bao
Marine Carpuat
DeLMO
94
3
0
16 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
136
21
0
16 May 2024
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks
Xuanfan Ni
Piji Li
ELM
LRM
69
9
0
16 May 2024
Low-Rank Adaptation of Time Series Foundational Models for Out-of-Domain Modality Forecasting
Divij Gupta
Anubhav Bhatti
Surajsinh Parmar
Chen Dan
Yuwei Liu
Bingjie Shen
San Lee
AI4TS
78
4
0
16 May 2024
MarkLLM: An Open-Source Toolkit for LLM Watermarking
Leyi Pan
Aiwei Liu
Zhiwei He
Zitian Gao
Xuandong Zhao
...
Shuliang Liu
Xuming Hu
Lijie Wen
Irwin King
Philip S. Yu
144
37
0
16 May 2024
Relative Counterfactual Contrastive Learning for Mitigating Pretrained Stance Bias in Stance Detection
Jiarui Zhang
Shaojuan Wu
Xiaowang Zhang
Zhiyong Feng
92
0
0
16 May 2024
IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretraining
Dawei Feng
Yihai Zhang
Zhixuan Xu
SyDa
55
0
0
16 May 2024
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling
Guangmin Zheng
Jin Wang
Xiaobing Zhou
Xuejie Zhang
LRM
63
2
0
16 May 2024
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis
Zeyi Zhang
Tenglong Ao
Yuyao Zhang
Qingzhe Gao
Chuan Lin
Baoquan Chen
Libin Liu
SLR
79
17
0
16 May 2024
NIFTY Financial News Headlines Dataset
Raeid Saqur
Ken Kato
Nicholas Vinden
Frank Rudzicz
AIFin
81
1
0
16 May 2024
Mitigating Text Toxicity with Counterfactual Generation
Milan Bhan
Jean-Noel Vittaut
Nina Achache
Victor Legrand
Nicolas Chesneau
A. Blangero
Juliette Murris
Marie-Jeanne Lesot
MedIm
217
0
0
16 May 2024
Prompting-based Synthetic Data Generation for Few-Shot Question Answering
Maximilian Schmidt
Andrea Bartezzaghi
Ngoc Thang Vu
SyDa
94
6
0
15 May 2024
A Survey of Generative Techniques for Spatial-Temporal Data Mining
Qianru Zhang
Haixin Wang
Cheng Long
Liangcai Su
Xingwei He
...
Tailin Wu
Hongzhi Yin
Siu-Ming Yiu
Qi Tian
Christian S. Jensen
AI4TS
95
9
0
15 May 2024
Sign of the Times: Evaluating the use of Large Language Models for Idiomaticity Detection
Dylan Phelps
Thomas Pickard
Maggie Mi
Edward Gow-Smith
Aline Villavicencio
84
4
0
15 May 2024
New Textual Corpora for Serbian Language Modeling
Mihailo Škorić
Nikola Janković
87
1
0
15 May 2024
A Survey on Transformers in NLP with Focus on Efficiency
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
MedIm
100
2
0
15 May 2024
A Systematic Analysis on the Temporal Generalization of Language Models in Social Media
Asahi Ushio
Jose Camacho-Collados
53
0
0
15 May 2024
Improving Transformers using Faithful Positional Encoding
Tsuyoshi Idé
Jokin Labaien
Pin-Yu Chen
70
0
0
15 May 2024
Previous
1
2
3
...
59
60
61
...
198
199
200
Next