Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 9,942 papers shown
Title
Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Wenbin An
Feng Tian
Jiahao Nie
Wenkai Shi
Haonan Lin
Yan Chen
Qianying Wang
Y. Wu
Guang Dai
Ping Chen
VLM
96
4
0
22 Jul 2024
MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training
Cheng Luo
Jiawei Zhao
Zhuoming Chen
Beidi Chen
A. Anandkumar
99
4
0
22 Jul 2024
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models
Amir Mohammad Karimi Mamaghan
Samuele Papa
Karl Henrik Johansson
Stefan Bauer
Andrea Dittadi
OCL
180
9
0
22 Jul 2024
dMel: Speech Tokenization made Simple
Richard He Bai
Tatiana Likhomanenko
Ruixiang Zhang
Zijin Gu
Zakaria Aldeneh
Navdeep Jaitly
113
6
0
22 Jul 2024
VideoGameBunny: Towards vision assistants for video games
Mohammad Reza Taesiri
Cor-Paul Bezemer
VLM
MLLM
81
2
0
21 Jul 2024
Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis
Guang-Da Liu
Haitao Mao
Jiliang Tang
K. Johnson
LRM
97
8
0
21 Jul 2024
Two eyes, Two views, and finally, One summary! Towards Multi-modal Multi-tasking Knowledge-Infused Medical Dialogue Summarization
Anisha Saha
Abhisek Tiwari
Bokkasam Venkata Sai Ruthvik
Sriparna Saha
52
0
0
21 Jul 2024
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Yiyang Jiang
Wengyu Zhang
Xu-Lu Zhang
Xiaoyong Wei
Chang Wen Chen
Qing Li
88
4
0
21 Jul 2024
Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives
D. Hagos
Rick Battle
Danda B. Rawat
LM&MA
OffRL
116
28
0
20 Jul 2024
Consent in Crisis: The Rapid Decline of the AI Data Commons
Shayne Longpre
Robert Mahari
Ariel N. Lee
Campbell Lund
Hamidah Oderinwale
...
Hanlin Li
Daphne Ippolito
Sara Hooker
Jad Kabbara
Sandy Pentland
129
43
0
20 Jul 2024
Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks
Jiayu Lin
Guanrong Chen
Bojun Jin
Chenyang Li
Shutong Jia
...
R. Xu
Long Zhang
Jiuxin Cao
Ting Jin
Zhongyu Wei
84
1
0
20 Jul 2024
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction with Phonetic Analysis
S. Dashti
A. K. Bardsiri
M. J. Shahbazzadeh
114
4
0
20 Jul 2024
Composer's Assistant 2: Interactive Multi-Track MIDI Infilling with Fine-Grained User Control
Martin E. Malandro
81
3
0
19 Jul 2024
LaMAGIC: Language-Model-based Topology Generation for Analog Integrated Circuits
Chen-Chia Chang
Yikang Shan
Shaoze Fan
Jing Li
Shun Zhang
Ningyuan Cao
Yiran Chen
Xin Zhang
59
14
0
19 Jul 2024
Advancing Chart Question Answering with Robust Chart Component Recognition
Hanwen Zheng
Sijia Wang
Chris Thomas
Lifu Huang
89
1
0
19 Jul 2024
M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Seung-geun Chi
Hyung-Gun Chi
Hengbo Ma
Nakul Agarwal
Faizan Siddiqui
Karthik Ramani
Kwonjoon Lee
DiffM
120
13
0
19 Jul 2024
Evaluating the Reliability of Self-Explanations in Large Language Models
Korbinian Randl
John Pavlopoulos
Aron Henriksson
Tony Lindgren
LRM
138
1
0
19 Jul 2024
Mixture of Experts with Mixture of Precisions for Tuning Quality of Service
HamidReza Imani
Abdolah Amirany
Tarek A. El-Ghazawi
MoE
98
10
0
19 Jul 2024
Stable Audio Open
Zach Evans
Julian Parker
CJ Carr
Zack Zukowski
Josiah Taylor
Jordi Pons
268
53
0
19 Jul 2024
Voices in a Crowd: Searching for Clusters of Unique Perspectives
Nikolas Vitsakis
Amit Parekh
Ioannis Konstas
93
1
0
19 Jul 2024
Unlearning Concepts from Text-to-Video Diffusion Models
Shiqi Liu
Yihua Tan
DiffM
72
0
0
19 Jul 2024
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation
Joy Mahapatra
Utpal Garain
92
10
0
19 Jul 2024
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization
Md Sultan al Nahian
R. Kavuluru
MedIm
AI4CE
61
0
0
19 Jul 2024
Watermark Smoothing Attacks against Language Models
Hongyan Chang
Hamed Hassani
Reza Shokri
WaLM
143
3
0
19 Jul 2024
Learning Visual Grounding from Generative Vision and Language Model
Shijie Wang
Dahun Kim
A. Taalimi
Chen Sun
Weicheng Kuo
ObjD
116
7
0
18 Jul 2024
LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
Mingkang Zhu
Xi Chen
Zhongdao Wang
Hengshuang Zhao
Jiaya Jia
DiffM
99
3
0
18 Jul 2024
Compressed models are NOT miniature versions of large models
Rohit Raj Rai
Rishant Pal
Amit Awekar
60
0
0
18 Jul 2024
MetaSumPerceiver: Multimodal Multi-Document Evidence Summarization for Fact-Checking
Ting-Chih Chen
Chia-Wei Tang
Chris Thomas
96
5
0
18 Jul 2024
Establishing Knowledge Preference in Language Models
Sizhe Zhou
Sha Li
Yu Meng
Yizhu Jiao
Heng Ji
Jiawei Han
KELM
135
0
0
17 Jul 2024
SmartQuant: CXL-based AI Model Store in Support of Runtime Configurable Weight Quantization
Rui Xie
Asad Ul Haq
Linsen Ma
Krystal Sun
Sanchari Sen
Swagath Venkataramani
Liu Liu
Tong Zhang
MQ
37
1
0
17 Jul 2024
Audio Conditioning for Music Generation via Discrete Bottleneck Features
Simon Rouard
Yossi Adi
Jade Copet
Axel Roebel
Alexandre Défossez
MGen
105
1
0
17 Jul 2024
On Initializing Transformers with Pre-trained Embeddings
Ha Young Kim
Niranjan Balasubramanian
Byungkon Kang
66
1
0
17 Jul 2024
GeoHard
\textit{GeoHard}
GeoHard
: Towards Measuring Class-wise Hardness through Modelling Class Semantics
Fengyu Cai
Xinran Zhao
Hongming Zhang
Iryna Gurevych
Heinz Koeppl
59
0
0
17 Jul 2024
Hybrid Dynamic Pruning: A Pathway to Efficient Transformer Inference
Ghadeer Jaradat
M. Tolba
Ghada Alsuhli
Hani Saleh
Mahmoud Al-Qutayri
Thanos Stouraitis
Baker Mohammad
75
0
0
17 Jul 2024
Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for Fine-Grained Scoring of Textual Semantic Relations
Seyedeh Fatemeh Ebrahimi
Karim Akhavan Azari
Amirmasoud Iravani
Hadi Alizadeh
Zeinab Taghavi
Hossein Sameti
60
4
0
17 Jul 2024
Navigating the Noisy Crowd: Finding Key Information for Claim Verification
Haisong Gong
Huanhuan Ma
Qiang Liu
Shu Wu
Liang Wang
96
1
0
17 Jul 2024
M2DS: Multilingual Dataset for Multi-document Summarisation
Kushan Hewapathirana
Nisansa de Silva
Sri Lanka
81
1
0
17 Jul 2024
Turning Generative Models Degenerate: The Power of Data Poisoning Attacks
Shuli Jiang
S. Kadhe
Yi Zhou
Farhan Ahmed
Ling Cai
Nathalie Baracaldo
SILM
AAML
93
6
0
17 Jul 2024
When can transformers compositionally generalize in-context?
Seijin Kobayashi
Simon Schug
Yassir Akram
Florian Redhardt
J. Oswald
Razvan Pascanu
Guillaume Lajoie
João Sacramento
ViT
93
2
0
17 Jul 2024
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
Ofir Abramovich
Niv Nayman
Sharon Fogel
I. Lavi
Ron Litman
Shahar Tsiper
Royee Tichauer
Srikar Appalaraju
Shai Mazor
R. Manmatha
VLM
109
3
0
17 Jul 2024
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Sherwin Bahmani
Ivan Skorokhodov
Aliaksandr Siarohin
Willi Menapace
Guocheng Qian
...
Chaoyang Wang
Jiaxu Zou
Andrea Tagliasacchi
David B. Lindell
Sergey Tulyakov
VGen
DiffM
207
50
0
17 Jul 2024
MASIVE: Open-Ended Affective State Identification in English and Spanish
Nicholas Deas
Elsbeth Turcan
Iván Pérez Mejía
Kathleen McKeown
CVBM
65
1
0
16 Jul 2024
What's Wrong? Refining Meeting Summaries with LLM Feedback
Frederic Kirstein
Terry Ruas
Bela Gipp
111
6
0
16 Jul 2024
InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification
Yujia Hu
Zhiqiang Hu
C. Seah
Roy Ka-wei Lee
72
0
0
16 Jul 2024
Scaling Sign Language Translation
Biao Zhang
Garrett Tanzer
Orhan Firat
LRM
VLM
SLR
85
1
0
16 Jul 2024
Scaling Diffusion Transformers to 16 Billion Parameters
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Junshi Huang
DiffM
MoE
115
21
0
16 Jul 2024
How Control Information Influences Multilingual Text Image Generation and Editing?
Boqiang Zhang
Zuan Gao
Yadong Qu
Hongtao Xie
DiffM
95
5
0
16 Jul 2024
The Oscars of AI Theater: A Survey on Role-Playing with Language Models
Nuo Chen
Yan Wang
Yang Deng
Jia Li
128
21
0
16 Jul 2024
Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights
Shunqi Mao
Chaoyi Zhang
Hang Su
Hwanjun Song
Igor Shalyminov
Weidong Cai
84
1
0
16 Jul 2024
Genomic Language Models: Opportunities and Challenges
Gonzalo Benegas
Chengzhong Ye
C. Albors
Jianan Canal Li
Yun S. Song
AI4CE
LM&MA
ELM
135
26
0
16 Jul 2024
Previous
1
2
3
...
45
46
47
...
197
198
199
Next