Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,972 papers shown
Title
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Zhimin Li
Jianwei Zhang
Qin Lin
Jiangfeng Xiong
Yanxin Long
...
Wei Liu
Dingyong Wang
Yong Yang
Jie Jiang
Qinglin Lu
ViT
141
120
0
14 May 2024
Full Line Code Completion: Bringing AI to Desktop
Anton Semenkin
Vitaliy Bibaev
Yaroslav Sokolov
Kirill Krylov
Alexey Kalina
...
Mikhail Podvitskii
Petr Surkov
Yaroslav Golubev
Nikita Povarov
T. Bryksin
106
2
0
14 May 2024
QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models
Wei Wang
Zhaowei Li
Qi Xu
Yiqing Cai
Hang Song
Qi Qi
Ran Zhou
Zhida Huang
Tao Wang
Li Xiao
ALM
91
1
0
14 May 2024
Improving Transformers with Dynamically Composable Multi-Head Attention
Da Xiao
Qingye Meng
Shengping Li
Xingyuan Yuan
65
4
0
14 May 2024
Is Less More? Quality, Quantity and Context in Idiom Processing with Natural Language Models
Agne Knietaite
Adam Allsebrook
Anton Minkov
Adam Tomaszewski
Norbert Slinko
Richard Johnson
Thomas Pickard
Dylan Phelps
Aline Villavicencio
67
2
0
14 May 2024
Understanding the performance gap between online and offline alignment algorithms
Yunhao Tang
Daniel Guo
Zeyu Zheng
Daniele Calandriello
Yuan Cao
...
Rémi Munos
Bernardo Avila-Pires
Michal Valko
Yong Cheng
Will Dabney
OffRL
OnRL
114
75
0
14 May 2024
TFWT: Tabular Feature Weighting with Transformer
Xinhao Zhang
Zaitian Wang
Lu Jiang
Wanfu Gao
Pengfei Wang
Kunpeng Liu
LMTD
87
18
0
14 May 2024
SpeechVerse: A Large-scale Generalizable Audio Language Model
Nilaksh Das
Saket Dingliwal
S. Ronanki
Rohit Paturi
David Huang
...
Monica Sunkara
S. Srinivasan
Kyu J. Han
Katrin Kirchhoff
Katrin Kirchhoff
124
44
0
14 May 2024
Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis
Yifan Wang
Aleksander Holynski
Brian L. Curless
Steven M. Seitz
69
2
0
13 May 2024
Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp
Rachel Hong
William Agnew
Tadayoshi Kohno
Jamie Morgenstern
107
15
0
13 May 2024
KET-QA: A Dataset for Knowledge Enhanced Table Question Answering
Mengkang Hu
Haoyu Dong
Ping Luo
Shi Han
Dongmei Zhang
LMTD
RALM
73
3
0
13 May 2024
MambaOut: Do We Really Need Mamba for Vision?
Weihao Yu
Xinchao Wang
Mamba
99
60
0
13 May 2024
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Liam Dugan
Alyssa Hwang
Filip Trhlik
Josh Magnus Ludan
Andrew Zhu
Hainiu Xu
Daphne Ippolito
Christopher Callison-Burch
DeLMO
AAML
124
52
0
13 May 2024
Open-vocabulary Auditory Neural Decoding Using fMRI-prompted LLM
Xiaoyu Chen
Changde Du
Che Liu
Yizhe Wang
Huiguang He
70
3
0
13 May 2024
Localizing Task Information for Improved Model Merging and Compression
Ke Wang
Nikolaos Dimitriadis
Guillermo Ortiz-Jimenez
Franccois Fleuret
Pascal Frossard
MoMe
94
60
0
13 May 2024
Generating Human Motion in 3D Scenes from Text Descriptions
Zhi Cen
Huaijin Pi
Sida Peng
Zehong Shen
Minghui Yang
Shuai Zhu
Hujun Bao
Xiaowei Zhou
84
21
0
13 May 2024
Synthetic Test Collections for Retrieval Evaluation
Hossein A. Rahmani
Nick Craswell
Emine Yilmaz
Bhaskar Mitra
Daniel Fernando Campos
75
23
0
13 May 2024
ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge Source
Hung Tuan Le
Long Truong To
Manh Trong Nguyen
Kiet Van Nguyen
119
3
0
13 May 2024
CataLM: Empowering Catalyst Design Through Large Language Models
Ludi Wang
Xueqing Chen
Yi Du
Yuanchun Zhou
Yang Gao
Wenjuan Cui
70
4
0
13 May 2024
The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective
Andrew Shin
Yusuke Mori
Kunitake Kaneko
VGen
EGVM
58
2
0
13 May 2024
DEPTH: Discourse Education through Pre-Training Hierarchically
Zachary Bamberger
Ofek Glick
Chaim Baskin
Yonatan Belinkov
128
0
0
13 May 2024
Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning
Masane Fuchi
Tomohiro Takagi
DiffM
VLM
117
15
0
12 May 2024
Bottleneck-Minimal Indexing for Generative Document Retrieval
Xin Du
Lixin Xiu
Kumiko Tanaka-Ishii
99
2
0
12 May 2024
InsightNet: Structured Insight Mining from Customer Feedback
Sandeep Sricharan Mukku
Manan Soni
Jitenkumar Rana
Chetan Aggarwal
Promod Yenigalla
Rashmi Patange
Shyam Mohan
143
3
0
12 May 2024
MUD: Towards a Large-Scale and Noise-Filtered UI Dataset for Modern Style UI Modeling
Sidong Feng
Suyu Ma
Han Wang
David Kong
Chunyang Chen
107
11
0
11 May 2024
Event GDR: Event-Centric Generative Document Retrieval
Yong Guan
Dingxiao Liu
Jinchen Ma
Hao Peng
Xiaozhi Wang
Lei Hou
Ru Li
73
1
0
11 May 2024
Word-specific tonal realizations in Mandarin
Yu-Ying Chuang
Melanie J. Bell
Yu-Hsiang Tseng
R. Baayen
148
5
0
11 May 2024
The Ghanaian NLP Landscape: A First Look
Sheriff Issaka
Zhaoyi Zhang
Mihir Heda
Keyi Wang
Yinka Ajibola
Ryan DeMar
Xuefeng Du
77
2
0
10 May 2024
Open Challenges and Opportunities in Federated Foundation Models Towards Biomedical Healthcare
Xingyu Li
Lu Peng
Yuping Wang
Weihua Zhang
AI4CE
MedIm
LM&MA
124
12
0
10 May 2024
CANAL -- Cyber Activity News Alerting Language Model: Empirical Approach vs. Expensive LLM
Urjitkumar Patel
Fang-Chun Yeh
Chinmay Gondhalekar
73
3
0
10 May 2024
A Survey of Large Language Models for Graphs
Xubin Ren
Jiabin Tang
D. Yin
Nitesh Chawla
Chao Huang
110
47
0
10 May 2024
Federated Document Visual Question Answering: A Pilot Study
Khanh Nguyen
Dimosthenis Karatzas
FedML
90
0
0
10 May 2024
ATSumm: Auxiliary information enhanced approach for abstractive disaster Tweet Summarization with sparse training data
Piyush Garg
Roshni Chakraborty
Sourav Kumar Dandapat
60
2
0
10 May 2024
Are EEG-to-Text Models Working?
Hyejeong Jo
Yiqian Yang
Juhyeok Han
Yiqun Duan
Hui Xiong
Won Hee Lee
105
19
0
10 May 2024
E2TP: Element to Tuple Prompting Improves Aspect Sentiment Tuple Prediction
Mohammad Ghiasvand Mohammadkhani
Niloofar Ranjbar
S. Momtazi
103
1
0
10 May 2024
FedGCS: A Generative Framework for Efficient Client Selection in Federated Learning via Gradient-based Optimization
Zhiyuan Ning
Chunlin Tian
Meng Xiao
Wei Fan
Pengyang Wang
Li Li
P. Wang
Yuanchun Zhou
77
9
0
10 May 2024
Aspect-oriented Consumer Health Answer Summarization
Rochana Chaturvedi
Abari Bhattacharya
S. Yadav
92
4
0
10 May 2024
Pruning as a Domain-specific LLM Extractor
Nan Zhang
Yanchi Liu
Xujiang Zhao
Wei Cheng
Runxue Bao
Rui Zhang
Prasenjit Mitra
Haifeng Chen
66
14
0
10 May 2024
Learning to Solve Geometry Problems via Simulating Human Dual-Reasoning Process
Tong Xiao
Jia-Yin Liu
Zhenya Huang
Jinze Wu
Jing Sha
Shijin Wang
Enhong Chen
AI4CE
80
4
0
10 May 2024
A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models
Wenqi Fan
Yujuan Ding
Liang-bo Ning
Shijie Wang
Hengyun Li
D. Yin
Tat-Seng Chua
Qing Li
RALM
3DV
170
260
0
10 May 2024
A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds
Christopher Cui
Xiangyu Peng
Mark O. Riedl
LLMAG
OffRL
MoE
94
1
0
09 May 2024
Bayesian Prediction-Powered Inference
R. A. Hofer
Joshua Maynez
Bhuwan Dhingra
Adam Fisch
Amir Globerson
William W. Cohen
84
3
0
09 May 2024
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Peng Gao
Le Zhuo
Ziyi Lin
Ruoyi Du
Xu Luo
...
Weicai Ye
He Tong
Jingwen He
Yu Qiao
Hongsheng Li
VGen
107
91
0
09 May 2024
DOLOMITES: Domain-Specific Long-Form Methodical Tasks
Chaitanya Malaviya
Priyanka Agrawal
Kuzman Ganchev
Pranesh Srinivasan
Fantine Huot
Jonathan Berant
Mark Yatskar
Dipanjan Das
Mirella Lapata
Chris Alberti
71
6
0
09 May 2024
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit
Ruihao Gong
Yang Yong
Shiqiao Gu
Yushi Huang
Chentao Lv
Yunchen Zhang
Xianglong Liu
Dacheng Tao
MQ
116
10
0
09 May 2024
An Automatic Prompt Generation System for Tabular Data Tasks
Ashlesha Akella
Abhijit Manatkar
Brij Chavda
Hima Patel
LMTD
54
0
0
09 May 2024
Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
Shibo Jie
Yehui Tang
Ning Ding
Zhi-Hong Deng
Kai Han
Yunhe Wang
VLM
128
11
0
09 May 2024
Automated Program Repair: Emerging trends pose and expose problems for benchmarks
J. Renzullo
Pemma Reiter
Westley Weimer
Stephanie Forrest
95
3
0
08 May 2024
The Power of Absence: Thinking with Archival Theory in Algorithmic Design
Jihan Sherman
Romi Morrison
Lauren Klein
Daniela Rosner
AI4CE
82
8
0
08 May 2024
You Only Cache Once: Decoder-Decoder Architectures for Language Models
Yutao Sun
Li Dong
Yi Zhu
Shaohan Huang
Wenhui Wang
Shuming Ma
Quanlu Zhang
Jianyong Wang
Furu Wei
VLM
122
64
0
08 May 2024
Previous
1
2
3
...
60
61
62
...
198
199
200
Next