Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.04200
Cited By
MaskGIT: Masked Generative Image Transformer
8 February 2022
Huiwen Chang
Han Zhang
Lu Jiang
Ce Liu
William T. Freeman
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MaskGIT: Masked Generative Image Transformer"
50 / 482 papers shown
Title
Masked Generative Story Transformer with Character Guidance and Caption Augmentation
Christos Papadimitriou
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
DiffM
102
1
0
13 Mar 2024
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
Xinjie Zhang
Shenyuan Gao
Zhening Liu
Jiawei Shao
Xingtong Ge
Dailan He
Tongda Xu
Yan Wang
Jun Zhang
48
1
0
13 Mar 2024
FrameQuant: Flexible Low-Bit Quantization for Transformers
Harshavardhan Adepu
Zhanpeng Zeng
Li Zhang
Vikas Singh
MQ
45
5
0
10 Mar 2024
Extreme Precipitation Nowcasting using Transformer-based Generative Models
Cristian Meo
Ankush Roy
Mircea Lica
Junzhe Yin
Zeineb Bou Che
Yanbo Wang
R. Imhoff
R. Uijlenhoet
Justin Dauwels
23
3
0
06 Mar 2024
Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer
Naifu Xue
Qi Mao
Zijian Wang
Yuan Zhang
Siwei Ma
33
5
0
06 Mar 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Zeqian Ju
Yuancheng Wang
Kai Shen
Xu Tan
Detai Xin
...
Shikun Zhang
Jiang Bian
Lei He
Jinyu Li
Sheng Zhao
DiffM
49
145
0
05 Mar 2024
Large Language Models and Games: A Survey and Roadmap
Roberto Gallotta
Graham Todd
Marvin Zammit
Sam Earle
Antonios Liapis
Julian Togelius
Georgios N. Yannakakis
LLMAG
LM&MA
AI4CE
LRM
50
73
0
28 Feb 2024
Video as the New Language for Real-World Decision Making
Sherry Yang
Jacob Walker
Jack Parker-Holder
Yilun Du
Jake Bruce
Andre Barreto
Pieter Abbeel
Dale Schuurmans
VGen
31
46
0
27 Feb 2024
Genie: Generative Interactive Environments
Jake Bruce
Michael Dennis
Ashley D. Edwards
Jack Parker-Holder
Yuge Shi
...
Konrad Zolna
Jeff Clune
Nando de Freitas
Satinder Singh
Tim Rocktaschel
VGen
VLM
74
146
0
23 Feb 2024
Attention-Guided Masked Autoencoders For Learning Image Representations
Leon Sick
Dominik Engel
Pedro Hermosilla
Timo Ropinski
34
1
0
23 Feb 2024
Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability
Xue-Qing Qian
Yu Wang
Simian Luo
Yinda Zhang
Ying Tai
...
Xiangyang Xue
Bo Zhao
Tiejun Huang
Yunsheng Wu
Yanwei Fu
29
6
0
19 Feb 2024
WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection
Yan Hong
Jianfu Zhang
74
9
0
19 Feb 2024
Fast Timing-Conditioned Latent Audio Diffusion
Zach Evans
CJ Carr
Josiah Taylor
Scott H. Hawley
Jordi Pons
DiffM
82
102
0
07 Feb 2024
Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints
Jian Chen
Ruiyi Zhang
Yufan Zhou
Rajiv Jain
Zhiqiang Xu
Ryan A. Rossi
Changyou Chen
DiffM
47
12
0
07 Feb 2024
Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations
Álvaro Martín-Cortinas
Daniel Sáez-Trigueros
Iván Vallés-Pérez
Biel Tura Vecino
Piotr Bilinski
Mateusz Lajszczak
Grzegorz Beringer
Roberto Barra-Chicote
Jaime Lorenzo-Trueba
21
5
0
05 Feb 2024
Cross-view Masked Diffusion Transformers for Person Image Synthesis
T. Pham
Zhang Kang
Chang D. Yoo
53
6
0
02 Feb 2024
Machine Unlearning for Image-to-Image Generative Models
Guihong Li
Hsiang Hsu
Chun-Fu Chen
R. Marculescu
MU
VLM
70
25
0
01 Feb 2024
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis
Zecheng Tang
Chenfei Wu
Zekai Zhang
Mingheng Ni
Sheng-Siang Yin
...
Zhengyuan Yang
Lijuan Wang
Zicheng Liu
Juntao Li
Nan Duan
25
10
0
30 Jan 2024
Residual Quantization with Implicit Neural Codebooks
Iris A. M. Huijben
Matthijs Douze
Matthew Muckley
Ruud J. G. van Sloun
Jakob Verbeek
MQ
29
11
0
26 Jan 2024
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models
Senthil Purushwalkam
Akash Gokul
Chenyu You
Nikhil Naik
DiffM
39
17
0
25 Jan 2024
Dream360: Diverse and Immersive Outdoor Virtual Scene Creation via Transformer-Based 360 Image Outpainting
Hao Ai
Zidong Cao
H. Lu
Chen Chen
Jiancang Ma
Pengyuan Zhou
Tae-Kyun Kim
Pan Hui
Lin Wang
43
3
0
19 Jan 2024
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
Xiaofeng Wang
Zheng Zhu
Guan Huang
Boyuan Wang
Xinze Chen
Jiwen Lu
VGen
40
32
0
18 Jan 2024
Vlogger: Make Your Dream A Vlog
Shaobin Zhuang
Kunchang Li
Xinyuan Chen
Yaohui Wang
Ziwei Liu
Yu Qiao
Yali Wang
VGen
DiffM
43
35
0
17 Jan 2024
SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers
Nanye Ma
Mark Goldstein
M. S. Albergo
Nicholas M. Boffi
Eric Vanden-Eijnden
Saining Xie
DiffM
40
169
0
16 Jan 2024
Masked Audio Generation using a Single Non-Autoregressive Transformer
Alon Ziv
Itai Gat
Gaël Le Lan
Tal Remez
Felix Kreuk
Alexandre Défossez
Jade Copet
Gabriel Synnaeve
Yossi Adi
54
36
0
09 Jan 2024
Pheme: Efficient and Conversational Speech Generation
Paweł Budzianowski
Taras Sereda
Tomasz Cichy
Ivan Vulić
32
7
0
05 Jan 2024
Efficient Parallel Audio Generation using Group Masked Language Modeling
Myeonghun Jeong
Minchan Kim
Joun Yeop Lee
Nam Soo Kim
30
5
0
02 Jan 2024
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
42
14
0
31 Dec 2023
Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy
Weijian Mai
Jian Zhang
Pengfei Fang
Zhijun Zhang
54
9
0
31 Dec 2023
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Yuhta Takida
Yukara Ikemiya
Takashi Shibuya
Kazuki Shimada
Woosung Choi
...
Naoki Murata
Toshimitsu Uesaka
Kengo Uchida
Wei-Hsiang Liao
Yuki Mitsufuji
BDL
51
11
0
31 Dec 2023
Emage: Non-Autoregressive Text-to-Image Generation
Zhangyin Feng
Runyi Hu
Liangxin Liu
Fan Zhang
Duyu Tang
Yong Dai
Xiaocheng Feng
Jiwei Li
Bing Qin
Shuming Shi
DiffM
VLM
22
0
0
22 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
20
241
0
21 Dec 2023
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Haoyu Ma
Shahin Mahdizadehaghdam
Bichen Wu
Zhipeng Fan
Yuchao Gu
Wenliang Zhao
Lior Shapira
Xiaohui Xie
DiffM
VGen
30
4
0
19 Dec 2023
Anomaly Score: Evaluating Generative Models and Individual Generated Images based on Complexity and Vulnerability
Jaehui Hwang
Junghyuk Lee
Jong-Seok Lee
EGVM
32
2
0
17 Dec 2023
SeiT++: Masked Token Modeling Improves Storage-efficient Training
Min-Seob Lee
Song Park
Byeongho Heo
Dongyoon Han
Hyunjung Shim
MQ
VLM
26
1
0
15 Dec 2023
Fast Sampling via Discrete Non-Markov Diffusion Models
Zixiang Chen
Huizhuo Yuan
Yongqian Li
Yiwen Kou
Junkai Zhang
Quanquan Gu
DiffM
32
6
0
14 Dec 2023
MinD-3D: Reconstruct High-quality 3D objects in Human Brain
Jianxiong Gao
Yu Fu
Yun Wang
Xuelin Qian
Jianfeng Feng
Yanwei Fu
DiffM
27
6
0
12 Dec 2023
Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
Shentong Mo
Enze Xie
Yue Wu
Junsong Chen
Matthias Nießner
Zhenguo Li
26
5
0
12 Dec 2023
Photorealistic Video Generation with Diffusion Models
Agrim Gupta
Lijun Yu
Kihyuk Sohn
Xiuye Gu
Meera Hahn
Fei-Fei Li
Irfan Essa
Lu Jiang
José Lezama
VGen
59
177
0
11 Dec 2023
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
50
64
0
11 Dec 2023
TabMT: Generating tabular data with masked transformers
Manbir Gulati
Paul F. Roysdon
LMTD
50
33
0
11 Dec 2023
Prospective Role of Foundation Models in Advancing Autonomous Vehicles
Jianhua Wu
B. Gao
Jincheng Gao
Jianhao Yu
Hongqing Chu
...
Xun Gong
Yi Chang
H. E. Tseng
Hong Chen
Jie Chen
45
3
0
08 Dec 2023
Free3D: Consistent Novel View Synthesis without 3D Representation
Chuanxia Zheng
Andrea Vedaldi
3DV
45
48
0
07 Dec 2023
MMM: Generative Masked Motion Model
Ekkasit Pinyoanuntapong
Pu Wang
Minwoo Lee
Chong Chen
DiffM
VGen
40
45
0
06 Dec 2023
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Felix Wimbauer
Bichen Wu
Edgar Schoenfeld
Xiaoliang Dai
Ji Hou
...
Jonas Kohler
Christian Rupprecht
Daniel Cremers
Peter Vajda
Jialiang Wang
DiffM
38
58
0
06 Dec 2023
Does Vector Quantization Fail in Spatio-Temporal Forecasting? Exploring a Differentiable Sparse Soft-Vector Quantization Approach
Chao Chen
Tian Zhou
Yanjun Zhao
Hui Liu
Liang Sun
Rong Jin
40
0
0
06 Dec 2023
DiffiT: Diffusion Vision Transformers for Image Generation
Ali Hatamizadeh
Jiaming Song
Guilin Liu
Jan Kautz
Arash Vahdat
39
67
0
04 Dec 2023
GIVT: Generative Infinite-Vocabulary Transformers
Michael Tschannen
Cian Eastwood
Fabian Mentzer
31
33
0
04 Dec 2023
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
Jiarui Xu
Yossi Gandelsman
Amir Bar
Jianwei Yang
Jianfeng Gao
Trevor Darrell
Xiaolong Wang
VLM
28
3
0
04 Dec 2023
Improve Supervised Representation Learning with Masked Image Modeling
Kaifeng Chen
Daniel M. Salz
Huiwen Chang
Kihyuk Sohn
Dilip Krishnan
Mojtaba Seyedhosseini
SSL
ViT
45
3
0
01 Dec 2023
Previous
1
2
3
...
10
5
6
7
8
9
Next