Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

50 / 4,897 papers shown

Title
Language Models Understand Us, Poorly Jared Moore LRM 50 4 0 19 Oct 2022
DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models Royi Rassin Shauli Ravfogel Yoav Goldberg 74 61 0 19 Oct 2022
Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models Ricardo Kleinlein Cristina Luna Jiménez Fernando Fernández-Martínez DiffM 47 3 0 19 Oct 2022
Optimizing Hierarchical Image VAEs for Sample Quality Eric Luhman Troy Luhman DRL 75 5 0 18 Oct 2022
From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data Zichen Jeff Cui Yibin Wang Nur Muhammad (Mahi) Shafiullah Lerrel Pinto LM&Ro VGen OffRL 100 95 0 18 Oct 2022
Differentially Private Diffusion Models Tim Dockhorn Tianshi Cao Arash Vahdat Karsten Kreis DiffM 89 100 0 18 Oct 2022
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation Rui Li Weihua Li Yi Yang Hanyu Wei Jianhua Jiang Quan-wei Bai DiffM 150 11 0 18 Oct 2022
Using Language to Extend to Unseen Domains Lisa Dunlap Clara Mohri Devin Guillory Han Zhang Trevor Darrell Joseph E. Gonzalez Aditi Raghunanthan Anja Rohrbach VLM 96 35 0 18 Oct 2022
UniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a Single Image Dani Valevski Matan Kalman Eyal Molad Eyal Segalis Yossi Matias Yaniv Leviathan DiffM 102 41 0 17 Oct 2022
Bridging the Gap between Artificial Intelligence and Artificial General Intelligence: A Ten Commandment Framework for Human-Like Intelligence Ananta Nair F. Kashani 69 2 0 17 Oct 2022
Non-Contrastive Learning Meets Language-Image Pre-Training Jinghao Zhou Li Dong Zhe Gan Lijuan Wang Furu Wei VLM CLIP 75 26 0 17 Oct 2022
Imagic: Text-Based Real Image Editing with Diffusion Models Bahjat Kawar Shiran Zada Oran Lang Omer Tov Hui-Tang Chang Tali Dekel Inbar Mosseri Michal Irani 136 1,105 0 17 Oct 2022
Principled Pruning of Bayesian Neural Networks through Variational Free Energy Minimization Jim Beckers Bart Van Erp Ziyue Zhao K. Kondrashov Bert De Vries AAML 71 6 0 17 Oct 2022
Meta-Learning via Classifier(-free) Diffusion Guidance Elvis Nava Seijin Kobayashi Yifei Yin Robert K. Katzschmann Benjamin Grewe VLM 71 6 0 17 Oct 2022
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models Shansan Gong Mukai Li Jiangtao Feng Zhiyong Wu Lingpeng Kong 96 334 0 17 Oct 2022
Large-scale Text-to-Image Generation Models for Visual Artists' Creative Works Hyung-Kwon Ko Gwanmo Park Hyeon Jeon Jaemin Jo Juho Kim Jinwook Seo 107 142 0 16 Oct 2022
LAION-5B: An open large-scale dataset for training next generation image-text models Christoph Schuhmann Romain Beaumont Richard Vencu Cade Gordon Ross Wightman ... Srivatsa Kundurthy Katherine Crowson Ludwig Schmidt R. Kaczmarczyk J. Jitsev VLM MLLM CLIP 231 3,520 0 16 Oct 2022
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations Yi-Chun Zhu Hongyu Liu Yibing Song Ziyang Yuan Xintong Han Chun Yuan Qifeng Chen Jue Wang VLM DiffM 113 32 0 14 Oct 2022
TransFusion: Transcribing Speech with Multinomial Diffusion Matthew Baas Kevin Eloff Herman Kamper DiffM 31 4 0 14 Oct 2022
Is synthetic data from generative models ready for image recognition? Ruifei He Shuyang Sun Xin Yu Chuhui Xue Wenqing Zhang Philip Torr Song Bai Xiaojuan Qi 132 302 0 14 Oct 2022
The Hidden Uniform Cluster Prior in Self-Supervised Learning Mahmoud Assran Randall Balestriero Quentin Duval Florian Bordes Ishan Misra Piotr Bojanowski Pascal Vincent Michael G. Rabbat Nicolas Ballas SSL 96 50 0 13 Oct 2022
DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models Zeyang Sha Zheng Li Ning Yu Yang Zhang DiffM 106 135 0 13 Oct 2022
Self-Guided Diffusion Models Vincent Tao Hu David W. Zhang Yuki M. Asano Gertjan J. Burghouts Cees G. M. Snoek 126 33 0 12 Oct 2022
GOTCHA: Real-Time Video Deepfake Detection via Challenge-Response Govind Mittal Chinmay Hegde Nasir Memon 100 8 0 12 Oct 2022
Modular Flows: Differential Molecular Generation Yogesh Verma Samuel Kaski Markus Heinonen Vikas Garg 88 14 0 12 Oct 2022
LION: Latent Point Diffusion Models for 3D Shape Generation Fangyin Wei Arash Vahdat Francis Williams Zan Gojcic Or Litany Sanja Fidler Karsten Kreis DiffM 157 506 0 12 Oct 2022
Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation Chaerin Kong D. Jeon Oh-Hun Kwon Nojun Kwak DiffM 77 17 0 12 Oct 2022
Underspecification in Scene Description-to-Depiction Tasks Ben Hutchinson Jason Baldridge Vinodkumar Prabhakaran DiffM 128 34 0 11 Oct 2022
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance Chen Henry Wu Fernando de la Torre DiffM 112 69 0 11 Oct 2022
Robust and Controllable Object-Centric Learning through Energy-based Models Ruixiang Zhang Tong Che Boris Ivanovic Renhao Wang Marco Pavone Yoshua Bengio Liam Paull OCL 97 8 0 11 Oct 2022
GENIE: Higher-Order Denoising Diffusion Solvers Tim Dockhorn Arash Vahdat Karsten Kreis DiffM 109 114 0 11 Oct 2022
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models Matthew Baas Herman Kamper DiffM 86 8 0 11 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling Yuntian Deng Noriyuki Kojima Alexander M. Rush DiffM 86 4 0 11 Oct 2022
f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation Jiatao Gu Shuangfei Zhai Yizhe Zhang Miguel Angel Bautista J. Susskind DiffM 103 27 0 10 Oct 2022
Meta-Principled Family of Hyperparameter Scaling Strategies Sho Yaida 111 16 0 10 Oct 2022
What the DAAM: Interpreting Stable Diffusion Using Cross Attention Raphael Tang Linqing Liu Akshat Pandey Zhiying Jiang Gefei Yang K. Kumar Pontus Stenetorp Jimmy J. Lin Ferhan Ture 175 177 0 10 Oct 2022
FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings Jean Ogier du Terrail Samy Ayed Edwige Cyffers Felix Grimberg Chaoyang He ... Sai Praneeth Karimireddy Marco Lorenzi Giovanni Neglia Marc Tommasi M. Andreux FedML 133 158 0 10 Oct 2022
CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning Shi-You Xu VLM DiffM 90 14 0 10 Oct 2022
Bridging CLIP and StyleGAN through Latent Alignment for Image Editing Wanfeng Zheng Qiang Li Xiaoyan Guo Pengfei Wan Zhong-ming Wang 122 14 0 10 Oct 2022
Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains Pierre J. Chambon Christian Blüthgen C. Langlotz Akshay S. Chaudhari DiffM MedIm LM&MA 61 117 0 09 Oct 2022
Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs Taojiannan Yang Haokui Zhang Wenze Hu Chen Chen Xiaoyu Wang ViT 69 0 0 08 Oct 2022
CLIP-PAE: Projection-Augmentation Embedding to Extract Relevant Features for a Disentangled, Interpretable, and Controllable Text-Guided Face Manipulation Chenliang Zhou Fangcheng Zhong Cengiz Öztireli CLIP 145 20 0 08 Oct 2022
Can Artificial Intelligence Reconstruct Ancient Mosaics? Fernando Moral-Andrés Elena Merino-Gómez Pedro Reviriego Fabrizio Lombardi 34 7 0 07 Oct 2022
TAN Without a Burn: Scaling Laws of DP-SGD Tom Sander Pierre Stock Alexandre Sablayrolles FedML 86 43 0 07 Oct 2022
GNM: A General Navigation Model to Drive Any Robot Dhruv Shah A. Sridhar Arjun Bhorkar Noriaki Hirose Sergey Levine 120 119 0 07 Oct 2022
Efficient Diffusion Models for Vision: A Survey Anwaar Ulhaq Naveed Akhtar MedIm 155 68 0 07 Oct 2022
On Distillation of Guided Diffusion Models Chenlin Meng Robin Rombach Ruiqi Gao Diederik P. Kingma Stefano Ermon Jonathan Ho Tim Salimans VLM DiffM 89 536 0 06 Oct 2022
Content-Based Search for Deep Generative Models Daohan Lu Sheng-Yu Wang Nupur Kumari Rohan Agarwal Mia Tang David Bau Jun-Yan Zhu DiffM SyDa 101 6 0 06 Oct 2022
Env-Aware Anomaly Detection: Ignore Style Changes, Stay True to Content! Stefan Smeu Elena Burceanu Andrei Liviu Nicolicioiu Emanuela Haller 81 4 0 06 Oct 2022
VIMA: General Robot Manipulation with Multimodal Prompts Yunfan Jiang Agrim Gupta Zichen Zhang Guanzhi Wang Yongqiang Dou Yanjun Chen Li Fei-Fei Anima Anandkumar Yuke Zhu Linxi Fan LM&Ro 117 355 0 06 Oct 2022