Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,744 papers shown
Title
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
92
1,696
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Y. Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
36
1,786
0
02 Aug 2022
AI Augmented Edge and Fog Computing: Trends and Challenges
Shreshth Tuli
Fatemeh Mirhakimi
Samodha Pallewatta
Syed Zawad
G. Casale
B. Javadi
Feng Yan
Rajkumar Buyya
N. Jennings
21
56
0
01 Aug 2022
Testing Relational Understanding in Text-Guided Image Generation
C. Conwell
T. Ullman
EGVM
152
64
0
29 Jul 2022
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
18
70
0
26 Jul 2022
What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
Pedro Sanchez
Antanas Kascenas
Xiao Liu
Alison Q. OÑeil
Sotirios A. Tsaftaris
MedIm
DiffM
26
63
0
25 Jul 2022
Intention-Conditioned Long-Term Human Egocentric Action Forecasting
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
EgoV
24
28
0
25 Jul 2022
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models
Huy Ha
Shuran Song
LM&Ro
VLM
43
102
0
23 Jul 2022
Do Perceptually Aligned Gradients Imply Adversarial Robustness?
Roy Ganz
Bahjat Kawar
Michael Elad
AAML
22
9
0
22 Jul 2022
A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration
Ming-Yu Liu
Yuxiang Wei
Xiaohe Wu
Wangmeng Zuo
Lei Zhang
35
1
0
21 Jul 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
36
296
0
20 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
27
72
0
20 Jul 2022
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
Rao Fu
Xiaoyu Zhan
Yiwen Chen
Daniel E. Ritchie
Srinath Sridhar
39
79
0
19 Jul 2022
Mimetic Models: Ethical Implications of AI that Acts Like You
Reid McIlroy-Young
Jon M. Kleinberg
S. Sen
Solon Barocas
Ashton Anderson
13
16
0
19 Jul 2022
Progressive Deblurring of Diffusion Models for Coarse-to-Fine Image Synthesis
Sangyun Lee
Hyungjin Chung
Jaehyeon Kim
Jong Chul Ye
DiffM
29
45
0
16 Jul 2022
How to Reuse and Compose Knowledge for a Lifetime of Tasks: A Survey on Continual Learning and Functional Composition
Jorge Armando Mendez Mendez
Eric Eaton
KELM
CLL
32
27
0
15 Jul 2022
WaveGAN: Frequency-aware GAN for High-Fidelity Few-shot Image Generation
Mengping Yang
Zhe Wang
Ziqiu Chi
Wenyi Feng
25
49
0
15 Jul 2022
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval
Jinbin Bai
Chunhui Liu
Feiyue Ni
Haofan Wang
Mengying Hu
Xiaofeng Guo
Lele Cheng
45
11
0
11 Jul 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
158
437
0
10 Jul 2022
Improving Diffusion Model Efficiency Through Patching
Troy Luhman
Eric Luhman
DiffM
20
18
0
09 Jul 2022
Accelerating Material Design with the Generative Toolkit for Scientific Discovery
Matteo Manica
Jannis Born
Joris Cadow
Dimitrios Christofidellis
A. Dave
...
Lauren N. McHugh
Alexy Khrabrov
Payel Das
Seiji Takeda
John Smith
22
26
0
08 Jul 2022
Big Learning
Yulai Cong
Miaoyun Zhao
AI4CE
32
0
0
08 Jul 2022
Exploring Generative Adversarial Networks for Text-to-Image Generation with Evolution Strategies
Victor G. Turrisi da Costa
Nuno Lourenço
João Correia
Penousal Machado
GAN
17
3
0
06 Jul 2022
Can Language Understand Depth?
Renrui Zhang
Ziyao Zeng
Ziyu Guo
Yafeng Li
VLM
MDE
39
71
0
03 Jul 2022
American == White in Multimodal Language-and-Image AI
Robert Wolfe
Aylin Caliskan
VLM
27
46
0
01 Jul 2022
Deep Learning and Symbolic Regression for Discovering Parametric Equations
Michael Zhang
Samuel Kim
Peter Y. Lu
M. Soljavcić
29
18
0
01 Jul 2022
Distilling Model Failures as Directions in Latent Space
Saachi Jain
Hannah Lawrence
Ankur Moitra
A. Madry
23
90
0
29 Jun 2022
Beyond neural scaling laws: beating power law scaling via data pruning
Ben Sorscher
Robert Geirhos
Shashank Shekhar
Surya Ganguli
Ari S. Morcos
22
418
0
29 Jun 2022
Memory Safe Computations with XLA Compiler
A. Artemev
Tilman Roeder
Mark van der Wilk
29
8
0
28 Jun 2022
Studying Generalization Through Data Averaging
C. Gomez-Uribe
FedML
24
0
0
28 Jun 2022
Perspective (In)consistency of Paint by Text
Hany Farid
DiffM
25
36
0
27 Jun 2022
Repository-Level Prompt Generation for Large Language Models of Code
Disha Shrivastava
Hugo Larochelle
Daniel Tarlow
28
137
0
26 Jun 2022
Text-Driven Stylization of Video Objects
Sebastian Loeschcke
Serge Belongie
Sagie Benaim
VGen
DiffM
27
16
0
24 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
42
288
0
23 Jun 2022
The ArtBench Dataset: Benchmarking Generative Models with Artworks
Peiyuan Liao
Xiuyu Li
Xihui Liu
Kurt Keutzer
22
47
0
22 Jun 2022
A Study on the Evaluation of Generative Models
Eyal Betzalel
Coby Penso
Aviv Navon
Ethan Fetaya
EGVM
25
48
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
107
1,066
0
22 Jun 2022
EpiGRAF: Rethinking training of 3D GANs
Ivan Skorokhodov
Sergey Tulyakov
Yiqun Wang
Peter Wonka
DiffM
33
125
0
21 Jun 2022
Generative Modelling With Inverse Heat Dissipation
Severi Rissanen
Markus Heinonen
Arno Solin
DiffM
19
109
0
21 Jun 2022
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
Minguk Kang
Joonghyuk Shin
Jaesik Park
EGVM
16
67
0
19 Jun 2022
Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems
Giannis Daras
Y. Dagan
A. Dimakis
C. Daskalakis
BDL
31
15
0
18 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjD
VLM
MLLM
56
393
0
17 Jun 2022
Lossy Compression with Gaussian Diffusion
Lucas Theis
Tim Salimans
Matthew D. Hoffman
Fabian Mentzer
DiffM
33
78
0
17 Jun 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan
Guanzhi Wang
Yunfan Jiang
Ajay Mandlekar
Yuncong Yang
Haoyi Zhu
Andrew Tang
De-An Huang
Yuke Zhu
Anima Anandkumar
LM&Ro
51
352
0
17 Jun 2022
MixGen: A New Multi-Modal Data Augmentation
Xiaoshuai Hao
Yi Zhu
Srikar Appalaraju
Aston Zhang
Wanqian Zhang
Boyang Li
Mu Li
VLM
22
83
0
16 Jun 2022
Know your audience: specializing grounded language models with listener subtraction
Aaditya K. Singh
David Ding
Andrew M. Saxe
Felix Hill
Andrew Kyle Lampinen
27
2
0
16 Jun 2022
Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning
Anastasia Koloskova
Sebastian U. Stich
Martin Jaggi
FedML
27
78
0
16 Jun 2022
On Privacy and Personalization in Cross-Silo Federated Learning
Ziyu Liu
Shengyuan Hu
Zhiwei Steven Wu
Virginia Smith
FedML
22
52
0
16 Jun 2022
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Shizhe Diao
Wangchunshu Zhou
Xinsong Zhang
Jiawei Wang
MLLM
AI4CE
22
16
0
15 Jun 2022
Emergent Abilities of Large Language Models
Jason W. Wei
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
...
Tatsunori Hashimoto
Oriol Vinyals
Percy Liang
J. Dean
W. Fedus
ELM
ReLM
LRM
69
2,354
0
15 Jun 2022
Previous
1
2
3
...
92
93
94
95
Next