Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,955 papers shown
Title
M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering
Anand Subramanian
Viktor Schlegel
Abhinav Ramesh Kashyap
Thanh-Tung Nguyen
Vijay Prakash Dwivedi
Stefan Winkler
ELM
LM&MA
AI4MH
66
3
0
06 Jun 2024
SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound
Rishit Dagli
Shivesh Prakash
Robert Wu
H. Khosravani
145
6
0
06 Jun 2024
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
Zeyue Tian
Zhaoyang Liu
Ruibin Yuan
Jiahao Pan
Xiaoqiang Huang
Xu Tan
Xu Tan
Qifeng Chen
Yu Guo
VGen
278
17
0
06 Jun 2024
NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Shuo Huang
William MacLean
Xiaoxi Kang
Qiongkai Xu
Zhuang Li
Xingliang Yuan
Zhuang Li
Lizhen Qu
141
0
0
06 Jun 2024
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo
Nicolaas Paul Jedema
Siddhant Garg
Leonardo F. R. Ribeiro
Alessandro Moschitti
84
1
0
05 Jun 2024
Wings: Learning Multimodal LLMs without Text-only Forgetting
Yi-Kai Zhang
Shiyin Lu
Yang Li
Yanqing Ma
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
De-Chuan Zhan
Han-Jia Ye
VLM
131
10
0
05 Jun 2024
PatentEval: Understanding Errors in Patent Generation
You Zuo
Kim Gerdes
Eric Villemonte de la Clergerie
Benoît Sagot
72
1
0
05 Jun 2024
Document-level Claim Extraction and Decontextualisation for Fact-Checking
Zhenyun Deng
Michael Schlichtkrull
Andreas Vlachos
HILM
88
3
0
05 Jun 2024
Reconstructing training data from document understanding models
Jérémie Dentan
Arnaud Paran
A. Shabou
AAML
SyDa
80
1
0
05 Jun 2024
Missci: Reconstructing Fallacies in Misrepresented Science
Max Glockner
Yufang Hou
Preslav Nakov
Iryna Gurevych
99
6
0
05 Jun 2024
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization
Jinge Wu
Abul Hasan
Honghan Wu
38
1
0
05 Jun 2024
PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs
Charlie Hou
Akshat Shrivastava
Hongyuan Zhan
Rylan Conway
Trang Le
Adithya Sagar
Giulia Fanti
Daniel Lazar
118
15
0
05 Jun 2024
Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models
Peijie Dong
Lujun Li
Zhenheng Tang
Xiang Liu
Xinglin Pan
Qiang-qiang Wang
Xiaowen Chu
159
33
0
05 Jun 2024
Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity
Wentao Guo
Jikai Long
Yimeng Zeng
Zirui Liu
Xinyu Yang
...
Osbert Bastani
Christopher De Sa
Xiaodong Yu
Beidi Chen
Zhaozhuo Xu
100
21
0
05 Jun 2024
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
Rongzhi Zhang
Jiaming Shen
Tianqi Liu
Haorui Wang
Zhen Qin
Feng Han
Jialu Liu
Simon Baumgartner
Michael Bendersky
Chao Zhang
91
8
0
05 Jun 2024
Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter
Peng-Fei Xing
Ning Wang
Jianbo Ouyang
Zechao Li
DiffM
79
1
0
05 Jun 2024
DREW : Towards Robust Data Provenance by Leveraging Error-Controlled Watermarking
Mehrdad Saberi
Vinu Sankar Sadasivan
Arman Zarei
Hessam Mahdavifar
Soheil Feizi
68
1
0
05 Jun 2024
Loki: Low-Rank Keys for Efficient Sparse Attention
Prajwal Singhania
Siddharth Singh
Shwai He
Soheil Feizi
A. Bhatele
115
22
0
04 Jun 2024
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices
Ruslan Svirschevski
Avner May
Zhuoming Chen
Beidi Chen
Zhihao Jia
Max Ryabinin
140
21
0
04 Jun 2024
Block Transformer: Global-to-Local Language Modeling for Fast Inference
Namgyu Ho
Sangmin Bae
Taehyeon Kim
Hyunjik Jo
Yireun Kim
Tal Schuster
Adam Fisch
James Thorne
Se-Young Yun
116
9
0
04 Jun 2024
A Temporal Kolmogorov-Arnold Transformer for Time Series Forecasting
Remi Genet
Hugo Inzirillo
AI4TS
140
47
0
04 Jun 2024
Landscape-Aware Growing: The Power of a Little LAG
Stefani Karp
Nikunj Saunshi
Sobhan Miryoosefi
Sashank J. Reddi
Sanjiv Kumar
82
1
0
04 Jun 2024
Learning to Edit Visual Programs with Self-Supervision
R. K. Jones
Renhao Zhang
Aditya Ganeshan
Daniel E. Ritchie
SSL
86
3
0
04 Jun 2024
On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept
Guangliang Liu
Haitao Mao
Bochuan Cao
Zhiyu Xue
K. Johnson
Jiliang Tang
Rongrong Wang
LRM
112
10
0
04 Jun 2024
An Independence-promoting Loss for Music Generation with Language Models
Jean-Marie Lemercier
Simon Rouard
Jade Copet
Yossi Adi
Alexandre Défossez
162
1
0
04 Jun 2024
Enhancing Retrieval-Augmented LMs with a Two-stage Consistency Learning Compressor
Chuankai Xu
Dongming Zhao
Bo Wang
Hanwen Xing
RALM
37
0
0
04 Jun 2024
Description Boosting for Zero-Shot Entity and Relation Classification
Gabriele Picco
Leopold Fuchs
Marcos Martínez Galindo
Alberto Purpura
V. López
Hoang Thanh Lam
VLM
64
3
0
04 Jun 2024
SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining
Andi Han
Jiaxiang Li
Wei Huang
Mingyi Hong
Akiko Takeda
Pratik Jawanpuria
Bamdev Mishra
115
16
0
04 Jun 2024
LongSSM: On the Length Extension of State-space Models in Language Modelling
Shida Wang
100
1
0
04 Jun 2024
Multimodal Reasoning with Multimodal Knowledge Graph
Junlin Lee
Yequan Wang
Jing Li
Min Zhang
102
23
0
04 Jun 2024
Zyda: A 1.3T Dataset for Open Language Modeling
Yury Tokpanov
Beren Millidge
Paolo Glorioso
Jonathan Pilault
Adam Ibrahim
James Whittington
Quentin Anthony
95
2
0
04 Jun 2024
Conditional Language Learning with Context
X. Zhang
Miao Li
Ji Wu
96
4
0
04 Jun 2024
Dishonesty in Helpful and Harmless Alignment
Youcheng Huang
Jingkun Tang
Duanyu Feng
Zheng Zhang
Wenqiang Lei
Jiancheng Lv
Anthony G. Cohn
LLMSV
100
4
0
04 Jun 2024
Explicitly Encoding Structural Symmetry is Key to Length Generalization in Arithmetic Tasks
Mahdi Sabbaghi
George Pappas
Hamed Hassani
Surbhi Goel
120
6
0
04 Jun 2024
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Jiexin Wang
Adam Jatowt
Yi Cai
AI4CE
102
1
0
04 Jun 2024
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
Marianna Nezhurina
Lucia Cipolina-Kun
Mehdi Cherti
J. Jitsev
LLMAG
LRM
ELM
ReLM
199
37
0
04 Jun 2024
Prototypical Transformer as Unified Motion Learners
Cheng Han
Yawen Lu
Guohao Sun
James Liang
Zhiwen Cao
...
S. Dianat
Raghuveer M. Rao
Tong Geng
Zhiqiang Tao
Dongfang Liu
ViT
96
3
0
03 Jun 2024
SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM
Quandong Wang
Yuxuan Yuan
Xiaoyu Yang
Ruike Zhang
Kang Zhao
Wei Liu
Jian Luan
Daniel Povey
Bin Wang
86
0
0
03 Jun 2024
Mixture of Rationale: Multi-Modal Reasoning Mixture for Visual Question Answering
Tao Li
Linjun Shou
Xuejun Liu
75
0
0
03 Jun 2024
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
Weichao Zhao
Hao Feng
Qi Liu
Jingqun Tang
Shubo Wei
...
Lei Liao
Yongjie Ye
Hao Liu
Houqiang Li
Can Huang
LMTD
102
24
0
03 Jun 2024
On the Nonlinearity of Layer Normalization
Yunhao Ni
Yuxin Guo
Junlong Jia
Lei Huang
175
5
0
03 Jun 2024
A Survey of Generative Information Retrieval
Tzu-Lin Kuo
Tzu-Wei Chiu
Tzung-Sheng Lin
Sheng-Yang Wu
Chao-Wei Huang
Yun-Nung Chen
SyDa
134
2
0
03 Jun 2024
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Qilong Zhangli
Jindong Jiang
Di Liu
Licheng Yu
Xiaoliang Dai
Ankit Ramchandani
Guan Pang
Dimitris N. Metaxas
Praveen Krishnan
DiffM
128
8
0
03 Jun 2024
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs
Fatemeh Shiri
Van Nguyen
Farhad Moghimifar
John Yoo
Gholamreza Haffari
Yuan-Fang Li
ReLM
142
6
0
03 Jun 2024
Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media
Nikhil Mehta
Dan Goldwasser
57
0
0
03 Jun 2024
Predicting drug-gene relations via analogy tasks with word embeddings
Hiroaki Yamagiwa
Ryoma Hashimoto
Kiwamu Arakane
Ken Murakami
Shou Soeda
Momose Oyama
Yihua Zhu
Mariko Okada
Hidetoshi Shimodaira
205
0
0
03 Jun 2024
FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models
Kaixin Lan
Tao Fang
Derek F. Wong
Yabo Xu
Lidia S. Chao
Cecilia G. Zhao
116
4
0
02 Jun 2024
MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization
Aozhong Zhang
Naigang Wang
Yanxia Deng
Xin Li
Zi Yang
Penghang Yin
MQ
86
8
0
02 Jun 2024
Transforming Computer Security and Public Trust Through the Exploration of Fine-Tuning Large Language Models
Garrett Crumrine
I. Alsmadi
Jesus Guerrero
Yuvaraj Munian
89
1
0
02 Jun 2024
Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions
Yihan Wu
Ruibo Chen
Zhengmian Hu
Yanshuo Chen
Junfeng Guo
Hongyang R. Zhang
Heng-Chiao Huang
WaLM
110
5
0
02 Jun 2024
Previous
1
2
3
...
55
56
57
...
198
199
200
Next