Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.16174
Cited By
Multimodal Transformer for Parallel Concatenated Variational Autoencoders
28 October 2022
Stephen D. Liang
J. Mendel
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Multimodal Transformer for Parallel Concatenated Variational Autoencoders"
4 / 4 papers shown
Title
Comparison of Autoencoders for tokenization of ASL datasets
Vouk Praun-Petrovic
Aadhvika Koundinya
Lavanya Prahallad
DiffM
42
0
0
12 Jan 2025
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,443
0
11 Nov 2021
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Hassan Akbari
Liangzhe Yuan
Rui Qian
Wei-Hong Chuang
Shih-Fu Chang
Huayu Chen
Boqing Gong
ViT
248
577
0
22 Apr 2021
Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers
Lisa Anne Hendricks
John F. J. Mellor
R. Schneider
Jean-Baptiste Alayrac
Aida Nematzadeh
79
110
0
31 Jan 2021
1