Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,901 papers shown
Title
Advances in Transformers for Robotic Applications: A Review
Nikunj Sanghai
Nik Bear Brown
AI4CE
148
0
0
13 Dec 2024
The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion
Changan Chen
Juze Zhang
S. K. Lakshmikanth
Yusu Fang
Ruizhi Shao
Gordon Wetzstein
L. Fei-Fei
Ehsan Adeli
VGen
135
5
0
13 Dec 2024
Neptune: The Long Orbit to Benchmarking Long Video Understanding
Arsha Nagrani
Ruotong Wang
Ramin Mehran
Rachel Hornung
N. B. Gundavarapu
...
Boqing Gong
Cordelia Schmid
Mikhail Sirotenko
Yukun Zhu
Tobias Weyand
179
8
0
12 Dec 2024
Text Generation Models for Luxembourgish with Limited Data: A Balanced Multilingual Strategy
Alistair Plum
Tharindu Ranasinghe
Christoph Purschke
126
3
0
12 Dec 2024
UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer
Delong Liu
Zhaohui Hou
Mingjie Zhan
Shihao Han
Zhicheng Zhao
Fei Su
VGen
111
0
0
12 Dec 2024
Neural Text Normalization for Luxembourgish using Real-Life Variation Data
Anne-Marie Lutgen
Alistair Plum
Christoph Purschke
Barbara Plank
109
1
0
12 Dec 2024
What Makes Cryptic Crosswords Challenging for LLMs?
Abdelrahman Sadallah
Daria Kotova
Ekaterina Kochmar
AAML
164
0
0
12 Dec 2024
SMMF: Square-Matricized Momentum Factorization for Memory-Efficient Optimization
Kwangryeol Park
Seulki Lee
88
0
0
12 Dec 2024
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang
Feng Cheng
Ziyang Wang
Huiyu Wang
Md. Mohaiminul Islam
Lorenzo Torresani
Joey Tianyi Zhou
Gedas Bertasius
David J. Crandall
212
2
0
12 Dec 2024
Code LLMs: A Taxonomy-based Survey
Nishat Raihan
Christian D. Newman
Marcos Zampieri
148
1
0
11 Dec 2024
DocSum: Domain-Adaptive Pre-training for Document Abstractive Summarization
Phan Phuong Mai Chau
Souhail Bakkali
Antoine Doucet
131
0
0
11 Dec 2024
NAT-NL2GQL: A Novel Multi-Agent Framework for Translating Natural Language to Graph Query Language
Yuanyuan Liang
Tingyu Xie
Gan Peng
Zihao Huang
Yunshi Lan
Weining Qian
LLMAG
117
2
0
11 Dec 2024
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
Tianyi Li
Erenay Dayanik
Shubhi Tyagi
Andrea Pierleoni
HILM
122
0
0
10 Dec 2024
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Xi Chen
Zhifei Zhang
He Zhang
Yuqian Zhou
Seunggeun Kim
...
Nanxuan Zhao
Yilin Wang
Hui Ding
Zhe Lin
Hengshuang Zhao
VGen
DiffM
187
29
0
10 Dec 2024
Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models
Haoran Lian
Junmin Chen
Wei Huang
Yizhe Xiong
Wenping Hu
...
Hui Chen
Jianwei Niu
Zijia Lin
Fuzheng Zhang
Di Zhang
127
0
0
10 Dec 2024
A Review of Human Emotion Synthesis Based on Generative Technology
Fei Ma
Yongqian Li
Yifan Xie
Y. He
Yize Zhang
...
Z. Liu
Wei Yao
Fuji Ren
Fei Richard Yu
Shiguang Ni
120
2
0
10 Dec 2024
ArtFormer: Controllable Generation of Diverse 3D Articulated Objects
Jiayi Su
Youhe Feng
Zheng Li
Jinhua Song
Yangfan He
Botao Ren
Botian Xu
AI4CE
158
3
0
10 Dec 2024
AutoReason: Automatic Few-Shot Reasoning Decomposition
Arda Sevinc
A. Gumus
ReLM
LRM
102
0
0
09 Dec 2024
GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary
F. Almeman
Luis Espinosa-Anke
104
0
0
09 Dec 2024
KITE-DDI: A Knowledge graph Integrated Transformer Model for accurately predicting Drug-Drug Interaction Events from Drug SMILES and Biomedical Knowledge Graph
Azwad Tamir
Jiann-Shiun Yuan
95
0
0
08 Dec 2024
Taming Sensitive Weights : Noise Perturbation Fine-tuning for Robust LLM Quantization
Dongwei Wang
Huanrui Yang
MQ
184
1
0
08 Dec 2024
Flex Attention: A Programming Model for Generating Optimized Attention Kernels
Juechu Dong
Boyuan Feng
Driss Guessous
Yanbo Liang
Horace He
142
31
0
07 Dec 2024
SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision
Kangjie Zheng
Siyue Liang
Junwei Yang
Bin Feng
Zequn Liu
Wei Ju
Zhiping Xiao
Ming Zhang
164
2
0
07 Dec 2024
Diversity Over Quantity: A Lesson From Few Shot Relation Classification
Amir D. N. Cohen
Shauli Ravfogel
Shaltiel Shmidman
Yoav Goldberg
103
0
0
06 Dec 2024
SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization
Runsheng Bai
Qiang Liu
B. Liu
MQ
135
2
0
05 Dec 2024
The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Ruili Feng
Han Zhang
Zhantao Yang
Jie Xiao
Zhilei Shu
Zhiheng Liu
Andy Zheng
Yukun Huang
Yu Liu
Han Zhang
VGen
154
20
0
04 Dec 2024
FANAL -- Financial Activity News Alerting Language Modeling Framework
Urjitkumar Patel
Fang-Chun Yeh
Chinmay Gondhalekar
Hari Nalluri
AIFin
113
0
0
04 Dec 2024
Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention
Hannan Lu
Xiaohe Wu
Shudong Wang
Xiameng Qin
Xinyu Zhang
Junyu Han
W. Zuo
Ji Tao
143
2
0
04 Dec 2024
AntLM: Bridging Causal and Masked Language Models
Xinru Yu
Bin Guo
Shiwei Luo
Jiadong Wang
Tao Ji
Yuanbin Wu
CLL
135
1
0
04 Dec 2024
CredID: Credible Multi-Bit Watermark for Large Language Models Identification
Haoyu Jiang
Xuhong Wang
Ping Yi
Shanzhe Lei
Yilun Lin
WaLM
161
1
0
04 Dec 2024
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Qu He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Yang Liu
Yun Wang
Chengjie Wang
Xuelong Li
Jing Zhang
DiffM
215
1
0
04 Dec 2024
Video LLMs for Temporal Reasoning in Long Videos
Fawad Javed Fateh
Umer Ahmed
Hamza Khan
M. Zia
Quoc-Huy Tran
VLM
186
1
0
04 Dec 2024
Robust Multi-bit Text Watermark with LLM-based Paraphrasers
Xiaojun Xu
Jinghan Jia
Yuanshun Yao
Yang Liu
Hang Li
116
0
0
04 Dec 2024
ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts
Dmitry Petrov
Pradyumn Goyal
Divyansh Shivashok
Yuanming Tao
Melinos Averkiou
E. Kalogerakis
129
0
0
03 Dec 2024
Does Few-Shot Learning Help LLM Performance in Code Synthesis?
Derek Xu
Tong Xie
Botao Xia
Haoyu Li
Yunsheng Bai
Yizhou Sun
Wei Wang
192
1
0
03 Dec 2024
SyncFlow: Toward Temporally Aligned Joint Audio-Video Generation from Text
Haohe Liu
Gaël Le Lan
Xinhao Mei
Zhaoheng Ni
Anurag Kumar
Varun K. Nagaraja
Wenwu Wang
Mark D. Plumbley
Yangyang Shi
Vikas Chandra
VGen
159
1
0
03 Dec 2024
FathomGPT: A Natural Language Interface for Interactively Exploring Ocean Science Data
Nabin Khanal
Chun Meng Yu
Jui-Cheng Chiu
Anav Chaudhary
Ziyue Zhang
K. Katija
A. Forbes
AI4CE
114
4
0
03 Dec 2024
Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval
Quang Hoang Trung
Nguyen Van Hoang Phuc
Le Trung Hoang
Quang Huu Hieu
Vo Nguyen Le Duy
AILaw
RALM
117
0
0
03 Dec 2024
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Yunkai Dang
Kaichen Huang
Jiahao Huo
Yibo Yan
Shijie Huang
...
Kun Wang
Yong Liu
Jing Shao
Hui Xiong
Xuming Hu
LRM
170
22
0
03 Dec 2024
SiTSE: Sinhala Text Simplification Dataset and Evaluation
Surangika Ranathunga
Rumesh Sirithunga
Himashi Rathnayake
Lahiru De Silva
Thamindu Aluthwala
Saman Peramuna
Ravi Shekhar
169
1
0
02 Dec 2024
PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control
Ruichen Wang
Junliang Zhang
Qingsong Xie
Chen Chen
H. Lu
DiffM
129
1
0
02 Dec 2024
The Evolution and Future Perspectives of Artificial Intelligence Generated Content
Chengzhang Zhu
Luobin Cui
Ying Tang
Jiacun Wang
161
1
0
02 Dec 2024
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
Ziqi Pang
Tianyuan Zhang
Fujun Luan
Yunze Man
Hao Tan
Kai Zhang
William T. Freeman
Yu-Xiong Wang
VGen
137
20
0
02 Dec 2024
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible Cost
Sen Xing
Muyan Zhong
Zeqiang Lai
Liangchen Li
Jing Liu
Yaohui Wang
Jifeng Dai
Wenhai Wang
213
2
0
02 Dec 2024
PGSO: Prompt-based Generative Sequence Optimization Network for Aspect-based Sentiment Analysis
Hao Dong
Wei Wei
102
1
0
01 Dec 2024
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
Jiahao Cui
Hui Li
Yun Zhan
Hanlin Shang
K. Cheng
Yuqi Ma
Shan Mu
Hang Zhou
Jingdong Wang
Siyu Zhu
ViT
VGen
201
9
0
01 Dec 2024
ARMOR: Egocentric Perception for Humanoid Robot Collision Avoidance and Motion Planning
Daehwa Kim
Mario Srouji
Chen Chen
Jian Zhang
148
3
0
30 Nov 2024
Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation
Syed Mohammed Mostaque Billah
Ateya Ahmed Subarna
Sudipta Nandi Sarna
Ahmad Shawkat Wasit
Anika Fariha
Asif Sushmit
Arig Yousuf Sadeque
82
0
0
29 Nov 2024
ChineseWebText 2.0: Large-Scale High-quality Chinese Web Text with Multi-dimensional and fine-grained information
Wanyue Zhang
Ziyong Li
Wen Yang
Chunlin Leng
Yinan Bai
Qianlong Du
Chengqing Zong
Jiajun Zhang
114
0
0
29 Nov 2024
Structured Object Language Modeling (SoLM): Native Structured Objects Generation Conforming to Complex Schemas with Self-Supervised Denoising
A. Tavanaei
Kee Kiat Koo
Hayreddin Ceker
Shaobai Jiang
Qi Li
Julien Han
Karim Bouyarmane
99
1
0
28 Nov 2024
Previous
1
2
3
...
26
27
28
...
197
198
199
Next