Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,903 papers shown
Title
An Extensive Evaluation of Factual Consistency in Large Language Models for Data-to-Text Generation
Joy Mahapatra
Utpal Garain
HILM
ALM
100
2
0
28 Nov 2024
Efficient Learning Content Retrieval with Knowledge Injection
Batuhan Sariturk
Rabia Bayraktar
Merve Elmas Erdem
121
0
0
28 Nov 2024
OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
Hui Li
Mingwang Xu
Yun Zhan
Shan Mu
Jiaye Li
...
Yukang Chen
Tan Chen
Mao Ye
Jingdong Wang
Siyu Zhu
VGen
210
7
0
28 Nov 2024
DRC-Coder: Automated DRC Checker Code Generation Using LLM Autonomous Agent
Chen-Chia Chang
Chia-Tung Ho
Yaguang Li
Yuxiao Chen
Haoxing Ren
3DV
139
2
0
28 Nov 2024
Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
Tianyi Wei
Dongdong Chen
Yifan Zhou
Xingang Pan
EGVM
137
3
0
27 Nov 2024
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
125
0
0
27 Nov 2024
CoA: Chain-of-Action for Generative Semantic Labels
Meng Wei
Zhongnian Li
Peng Ying
Xinzheng Xu
VLM
119
0
0
26 Nov 2024
PIM-AI: A Novel Architecture for High-Efficiency LLM Inference
Cristobal Ortega
Yann Falevoz
Renaud Ayrignac
114
3
0
26 Nov 2024
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
Claudia Cuttano
Gabriele Trivigno
Gabriele Rosi
Carlo Masone
Giuseppe Averta
VOS
204
2
0
26 Nov 2024
PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning
Zhen Sun
Tianshuo Cong
Yule Liu
Chenhao Lin
Xinlei He
Rongmao Chen
Xingshuo Han
Xinyi Huang
AAML
174
6
0
26 Nov 2024
SoK: Decentralized AI (DeAI)
Zhipeng Wang
Rui Sun
Elizabeth Lui
Vatsal Shah
Xihan Xiong
Jiahao Sun
Davide Crapis
William Knottenbelt
196
2
0
26 Nov 2024
Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation
S. Ramprasad
Byron C. Wallace
LLMAG
HILM
146
3
0
25 Nov 2024
FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web
Cheng-Wei Lin
Wan-Hsuan Hsieh
Kai-Xin Guan
Chan-Jan Hsu
Chia-Chen Kuo
Chuan-Lin Lai
Chung-Wei Chung
Ming-Jen Wang
Da-shan Shiu
74
1
0
25 Nov 2024
Cautious Optimizers: Improving Training with One Line of Code
Kaizhao Liang
Lizhang Chen
B. Liu
Qiang Liu
ODL
261
9
0
25 Nov 2024
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Sohee Yang
Nora Kassner
E. Gribovskaya
Sebastian Riedel
Mor Geva
LRM
KELM
ReLM
173
9
0
25 Nov 2024
Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing
Kaifeng Gao
Jiaxin Shi
Hanwang Zhang
Chunping Wang
Jun Xiao
Long Chen
VGen
DiffM
211
4
0
25 Nov 2024
Anda: Unlocking Efficient LLM Inference with a Variable-Length Grouped Activation Data Format
Chao Fang
Man Shi
Robin Geens
Arne Symons
Zhongfeng Wang
Marian Verhelst
154
2
0
24 Nov 2024
ResCLIP: Residual Attention for Training-free Dense Vision-language Inference
Yuhang Yang
Jinhong Deng
Wen Li
Lixin Duan
VLM
108
1
0
24 Nov 2024
LoRA-Mini : Adaptation Matrices Decomposition and Selective Training
Ayush Singh
Rajdeep Aher
Shivank Garg
129
1
0
24 Nov 2024
State-Space Large Audio Language Models
Saurabhchand Bhati
Yuan Gong
Leonid Karlinsky
Hilde Kuehne
Rogerio Feris
James Glass
153
1
0
24 Nov 2024
Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering
Mostafa Varzaneh
Pooja Voladoddi
Tanmay Bakshi
Uma Gunturi
104
0
0
22 Nov 2024
Financial Risk Assessment via Long-term Payment Behavior Sequence Folding
Yiran Qiao
Yateng Tang
Xiang Ao
Qi Yuan
Ziming Liu
Chen Shen
Xuehao Zheng
98
0
0
22 Nov 2024
Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps
Jeeyung Kim
Erfan Esmaeili
Qiang Qiu
DiffM
137
1
0
21 Nov 2024
Efficient Aspect-Based Summarization of Climate Change Reports with Small Language Models
Iacopo Ghinassi
Leonardo Catalano
Tommaso Colella
105
1
0
21 Nov 2024
Rethinking the Intermediate Features in Adversarial Attacks: Misleading Robotic Models via Adversarial Distillation
Ke Zhao
Huayang Huang
Miao Li
Yu Wu
AAML
117
1
0
21 Nov 2024
Quantization without Tears
Minghao Fu
Hao Yu
Jie Shao
Junjie Zhou
Ke Zhu
Jianxin Wu
MQ
201
3
0
21 Nov 2024
FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers
Zehua Pei
Hui-Ling Zhen
Xianzhi Yu
Sinno Jialin Pan
Mingxuan Yuan
Bei Yu
AI4CE
251
3
0
21 Nov 2024
Hymba: A Hybrid-head Architecture for Small Language Models
Xin Dong
Y. Fu
Shizhe Diao
Wonmin Byeon
Zijia Chen
...
Min-Hung Chen
Yoshi Suhara
Y. Lin
Jan Kautz
Pavlo Molchanov
Mamba
164
27
0
20 Nov 2024
LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement
Siwen Jiao
Yangyi Fang
Baoyun Peng
Wangqun Chen
Bharadwaj Veeravalli
227
5
0
20 Nov 2024
Training Bilingual LMs with Data Constraints in the Targeted Language
Skyler Seto
Maartje ter Hoeve
He Bai
Natalie Schluter
David Grangier
199
1
0
20 Nov 2024
MAS-Attention: Memory-Aware Stream Processing for Attention Acceleration on Resource-Constrained Edge Devices
Mohammadali Shakerdargah
Shan Lu
Chao Gao
Di Niu
174
0
0
20 Nov 2024
Signformer is all you need: Towards Edge AI for Sign Language
Eta Yang
SLR
141
0
0
19 Nov 2024
Selective Attention: Enhancing Transformer through Principled Context Control
Xuechen Zhang
Xiangyu Chang
Mingchen Li
Amit K. Roy-Chowdhury
Jiasi Chen
Samet Oymak
131
3
0
19 Nov 2024
Generalized Prompt Tuning: Adapting Frozen Univariate Time Series Foundation Models for Multivariate Healthcare Time Series
Mingzhu Liu
Angela H. Chen
George H. Chen
AI4TS
112
1
0
19 Nov 2024
Heuristic-Free Multi-Teacher Learning
Huy Thong Nguyen
En-Hung Chu
Lenord Melvix
Jazon Jiao
Chunglin Wen
Benjamin Louie
150
0
0
19 Nov 2024
Neon: News Entity-Interaction Extraction for Enhanced Question Answering
Sneha Singhania
Silviu Cucerzan
Allen Herring
S. Jauhar
KELM
137
0
0
19 Nov 2024
GRL-Prompt: Towards Knowledge Graph based Prompt Optimization via Reinforcement Learning
Yuze Liu
Tingjie Liu
Tiehua Zhang
Youhua Xia
Jinze Wang
Zhishu Shen
Jiong Jin
Fei Richard Yu
132
0
0
19 Nov 2024
Bi-Mamba: Towards Accurate 1-Bit State Space Models
Shengkun Tang
Liqun Ma
Haoyang Li
Mingjie Sun
Zhiqiang Shen
Mamba
127
3
0
18 Nov 2024
Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
Peng Shu
Jianfei Chen
Ziqiang Liu
Haoran Wang
Zihao Wu
...
Constance Owl
Xiaoming Zhai
Ninghao Liu
Claudio Saunt
Tianming Liu
89
8
0
18 Nov 2024
DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
Tianyi Yan
Dongming Wu
Wencheng Han
Junpeng Jiang
Xia Zhou
Kun Zhan
Cheng-Zhong Xu
Jianbing Shen
131
7
0
18 Nov 2024
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements
M. Arda Aydın
Efe Mert Çırpar
Elvin Abdinli
Gözde B. Ünal
Y. Sahin
VLM
293
1
0
18 Nov 2024
The Promises and Pitfalls of LLM Annotations in Dataset Labeling: a Case Study on Media Bias Detection
Tomas Horych
Christoph Mandl
Terry Ruas
André Greiner-Petter
Bela Gipp
Akiko Aizawa
Timo Spinde
185
7
0
17 Nov 2024
Bias in Large Language Models: Origin, Evaluation, and Mitigation
Yufei Guo
Muzhe Guo
Juntao Su
Zhou Yang
Mengqiu Zhu
Hongfei Li
Mengyang Qiu
Shuo Shuo Liu
AILaw
106
22
0
16 Nov 2024
HIST-AID: Leveraging Historical Patient Reports for Enhanced Multi-Modal Automatic Diagnosis
Haoxu Huang
Cem M. Deniz
K. Cho
S. Chopra
Divyam Madaan
72
1
0
16 Nov 2024
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment
Y. Fu
Zhongzhi Yu
Junwei Li
Jiayi Qian
Yongan Zhang
Xiangchi Yuan
Dachuan Shi
Roman Yakunin
Y. Lin
98
4
0
15 Nov 2024
On the Privacy Risk of In-context Learning
Haonan Duan
Adam Dziedzic
Mohammad Yaghini
Nicolas Papernot
Franziska Boenisch
SILM
PILM
131
42
0
15 Nov 2024
Prompting and Fine-tuning Large Language Models for Automated Code Review Comment Generation
Md. Asif Haider
Ayesha Binte Mostofa
Sk. Sabit Bin Mosaddek
Anindya Iqbal
Toufique Ahmed
ALM
92
3
0
15 Nov 2024
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Boyuan Jiang
Xiaobin Hu
Donghao Luo
Qu He
C. Xu
Jinlong Peng
Jing Zhang
Chengjie Wang
Yunsheng Wu
Yanwei Fu
DiffM
92
12
0
15 Nov 2024
Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems
Taaha Kazi
Ruiliang Lyu
Sizhe Zhou
Dilek Hakkani-Tur
Gokhan Tur
ELM
LLMAG
68
2
0
15 Nov 2024
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation
Youpeng Wen
Junfan Lin
Yinlin Zhu
Jiawei Han
Hang Xu
Shen Zhao
Xiaodan Liang
VGen
DiffM
100
5
0
14 Nov 2024
Previous
1
2
3
...
27
28
29
...
197
198
199
Next