Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,910 papers shown
Title
Assessment of Transformer-Based Encoder-Decoder Model for Human-Like Summarization
Sindhu Nair
Y. S. Rao
Radha Shankarmani
53
1
0
22 Oct 2024
Can Large Language Models Act as Ensembler for Multi-GNNs?
Hanqi Duan
Yao Cheng
Jianxiang Yu
Xiang Li
AI4CE
77
0
0
22 Oct 2024
Correct after Answer: Enhancing Multi-Span Question Answering with Post-Processing Method
Jiayi Lin
Chenyang Zhang
Haibo Tong
Dongyu Zhang
Qingqing Hong
Bingxuan Hou
Junli Wang
84
0
0
22 Oct 2024
Beyond Retrieval: Generating Narratives in Conversational Recommender Systems
Krishna Sayana
Raghavendra Vasudeva
Yuri Vasilevski
Kun Su
Liam Hebert
H. Pham
Ambarish Jash
Sukhdeep S. Sodhi
3DV
74
4
0
22 Oct 2024
MotionGlot: A Multi-Embodied Motion Generation Model
Sudarshan Harithas
Srinath Sridhar
176
2
0
22 Oct 2024
Self-calibration for Language Model Quantization and Pruning
Miles Williams
G. Chrysostomou
Nikolaos Aletras
MQ
492
0
0
22 Oct 2024
SoK: Dataset Copyright Auditing in Machine Learning Systems
L. Du
Xuanru Zhou
M. Chen
Chusong Zhang
Zhou Su
Peng Cheng
Jiming Chen
Zhikun Zhang
MLAU
128
6
0
22 Oct 2024
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
Taewhoo Lee
Chanwoong Yoon
Kyochul Jang
Donghyeon Lee
Minju Song
Hyunjae Kim
Jaewoo Kang
ELM
80
1
0
22 Oct 2024
Foundation Models for Rapid Autonomy Validation
Alec Farid
Peter Schleede
Aaron Huang
Christoffer Heckman
103
0
0
22 Oct 2024
DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
Manan Suri
Puneet Mathur
Franck Dernoncourt
R. Jain
Vlad I. Morariu
Ramit Sawhney
Preslav Nakov
Dinesh Manocha
122
3
0
21 Oct 2024
Elucidating the design space of language models for image generation
Xuantong Liu
Shaozhe Hao
Xianbiao Qi
Tianyang Hu
Jun Wang
Rong Xiao
Yuan Yao
VLM
80
3
0
21 Oct 2024
Building A Coding Assistant via the Retrieval-Augmented Language Model
Xinze Li
Hanbin Wang
Zhenghao Liu
S. Yu
Shuo Wang
Yukun Yan
Yukai Fu
Yu Gu
Ge Yu
3DV
RALM
55
4
0
21 Oct 2024
Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance
Mostafa Hussien
Mahmoud Afifi
K. Nguyen
M. Cheriet
94
0
0
21 Oct 2024
Natural GaLore: Accelerating GaLore for memory-efficient LLM Training and Fine-tuning
Arijit Das
36
2
0
21 Oct 2024
PROMPTHEUS: A Human-Centered Pipeline to Streamline SLRs with LLMs
João Pedro Fernandes Torres
Catherine Mulligan
Joaquim A. Jorge
Catarina Moreira
80
3
0
21 Oct 2024
Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly
Junsheng Zhou
Yu-Shen Liu
Zhizhong Han
ViT
118
11
0
21 Oct 2024
Mitigating Object Hallucination via Concentric Causal Attention
Yun Xing
Yiheng Li
Ivan Laptev
Shijian Lu
108
23
0
21 Oct 2024
Mesa-Extrapolation: A Weave Position Encoding Method for Enhanced Extrapolation in LLMs
Xin Ma
Yang Liu
Qingbin Liu
Xiaoxu Ma
46
1
0
21 Oct 2024
Improve Dense Passage Retrieval with Entailment Tuning
Lu Dai
Hao Liu
Hui Xiong
RALM
125
4
0
21 Oct 2024
Reducing Hallucinations in Vision-Language Models via Latent Space Steering
Sheng Liu
Haotian Ye
Lei Xing
James Zou
VLM
LLMSV
167
9
0
21 Oct 2024
A Survey of Conversational Search
Fengran Mo
Kelong Mao
Ziliang Zhao
Hongjin Qian
Haonan Chen
Yiruo Cheng
Xiaochen Li
Yinlin Zhu
Zhicheng Dou
Jian-Yun Nie
KELM
96
6
0
21 Oct 2024
Pruning Foundation Models for High Accuracy without Retraining
Pu Zhao
Fei Sun
Xuan Shen
Pinrui Yu
Zhenglun Kong
Yanzhi Wang
Xue Lin
82
13
0
21 Oct 2024
LDAdam: Adaptive Optimization from Low-Dimensional Gradient Statistics
Thomas Robert
M. Safaryan
Ionut-Vlad Modoranu
Dan Alistarh
ODL
81
7
0
21 Oct 2024
Allegro: Open the Black Box of Commercial-Level Video Generation Model
Yuan Zhou
Qiuyue Wang
Yuxuan Cai
Huan Yang
VGen
VLM
155
37
0
20 Oct 2024
Upsampling DINOv2 features for unsupervised vision tasks and weakly supervised materials segmentation
Ronan Docherty
Antonis Vamvakeros
Samuel J. Cooper
69
2
0
20 Oct 2024
Causality for Large Language Models
Anpeng Wu
Kun Kuang
Minqin Zhu
Yingrong Wang
Yujia Zheng
Kairong Han
Yangqiu Song
Guangyi Chen
Leilei Gan
Kun Zhang
LRM
115
9
0
20 Oct 2024
Contextual Augmented Multi-Model Programming (CAMP): A Hybrid Local-Cloud Copilot Framework
Yuchen Wang
Shangxin Guo
C. Tan
86
0
0
20 Oct 2024
Grammatical Error Correction for Low-Resource Languages: The Case of Zarma
Mamadou K. Keita
Christopher Homan
Sofiane Abdoulaye Hamani
Adwoa Bremang
Marcos Zampieri
Habibatou Abdoulaye Alfari
Elysabhete Amadou Ibrahim
127
0
0
20 Oct 2024
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
Alan Dao
Dinh Bach Vu
Huy Hoang Ha
AuLLM
VLM
141
5
0
20 Oct 2024
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
Hao-Tang Tsui
Chien-Yao Wang
H. Liao
ObjD
VLM
155
0
0
20 Oct 2024
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression
Yuankai Li
Jia-Chen Gu
Di Wu
Kai-Wei Chang
Nanyun Peng
RALM
MQ
72
0
0
20 Oct 2024
A survey of neural-network-based methods utilising comparable data for finding translation equivalents
Michaela Denisová
Pavel Rychlý
81
0
0
19 Oct 2024
Are LLMs Good Zero-Shot Fallacy Classifiers?
Fengjun Pan
Xiaobao Wu
Zongrui Li
Anh Tuan Luu
LRM
137
14
0
19 Oct 2024
Group Diffusion Transformers are Unsupervised Multitask Learners
Lianghua Huang
Wei Wang
Zhi-Fan Wu
Huanzhang Dou
Yupeng Shi
Yutong Feng
C. Liang
Yu Liu
Jingren Zhou
VLM
126
13
0
19 Oct 2024
Cross-Document Event-Keyed Summarization
William Walden
Pavlo Kuchmiichuk
Alexander Martin
Chihsheng Jin
Angela Cao
Claire Sun
Curisia Allen
Aaron Steven White
RALM
53
0
0
18 Oct 2024
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities
Shaozhe Hao
Xuantong Liu
Xianbiao Qi
Shihao Zhao
Bojia Zi
Rong Xiao
Kai Han
Kwan-Yee K. Wong
198
3
0
18 Oct 2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Zhepeng Cen
Yao Liu
Siliang Zeng
Pratik Chaudhar
Huzefa Rangwala
George Karypis
Rasool Fakoor
SyDa
AIFin
135
3
0
18 Oct 2024
Large Language Models Are Overparameterized Text Encoders
Thennal D K
Tim Fischer
Chris Biemann
85
2
0
18 Oct 2024
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
R. Teo
Tan M. Nguyen
MoE
94
3
0
18 Oct 2024
LEAD: Latent Realignment for Human Motion Diffusion
Nefeli Andreou
Xi Wang
Victoria Fernandez-Abrevaya
Marie-Paule Cani
Y. Chrysanthou
Vicky Kalogeiton
VGen
DiffM
88
4
0
18 Oct 2024
HYPNOS : Highly Precise Foreground-focused Diffusion Finetuning for Inanimate Objects
Oliverio Theophilus Nathanael
Jonathan Samuel Lumentut
Nicholas Hans Muliawan
Edbert Valencio Angky
Felix Indra Kurniadi
Alfi Yusrotis Zakiyyah
Jeklin Harefa
DiffM
49
1
0
18 Oct 2024
Speciesism in Natural Language Processing Research
Masashi Takeshita
Rafal Rzepka
68
2
0
18 Oct 2024
RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training
Muhe Ding
Yang Ma
Pengda Qin
Jianlong Wu
Yuhong Li
Liqiang Nie
78
1
0
18 Oct 2024
Leveraging Large Language Models for Enhancing Public Transit Services
Jiahao Wang
Amer Shalaby
51
2
0
18 Oct 2024
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering
Nghia Hieu Nguyen
Tho Thanh Quan
Ngan Luu-Thuy Nguyen
75
0
0
18 Oct 2024
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
Oliver Sieberling
Denis Kuznedelev
Eldar Kurtic
Dan Alistarh
MQ
71
5
0
18 Oct 2024
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs
SeongYeub Chu
JongWoo Kim
Bryan Wong
MunYong Yi
LRM
94
3
0
18 Oct 2024
Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts
German Gritsai
Anastasia Voznyuk
Andrey Grabovoy
Yury Chekhovich
DeLMO
143
2
0
18 Oct 2024
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
Lijie Fan
Tianhong Li
Siyang Qin
Yuanzhen Li
Chen Sun
Michael Rubinstein
Deqing Sun
Kaiming He
Yonglong Tian
VLM
DiffM
131
57
0
17 Oct 2024
VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
Runsen Xu
Zhiwei Huang
Tai Wang
Yuxiao Chen
Jiangmiao Pang
Dahua Lin
VGen
97
18
0
17 Oct 2024
Previous
1
2
3
...
31
32
33
...
197
198
199
Next