ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.06125
  4. Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents

Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
    VLMDiffM
ArXiv (abs)PDFHTML

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

50 / 4,897 papers shown
Title
Diffusion Models for Robotic Manipulation: A Survey
Diffusion Models for Robotic Manipulation: A Survey
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
125
2
0
01 Jul 2025
HumanGif: Single-View Human Diffusion with Generative Prior
HumanGif: Single-View Human Diffusion with Generative Prior
Shoukang Hu
Takuya Narihira
Kazumi Fukuda
Ryosuke Sawata
Takashi Shibuya
Yuki Mitsufuji
201
2
0
01 Jul 2025
Dense Feature Interaction Network for Image Inpainting Localization
Dense Feature Interaction Network for Image Inpainting Localization
Ye Yao
Tingfeng Han
Shan Jia
Siwei Lyu
70
1
0
01 Jul 2025
Pixel super-resolved virtual staining of label-free tissue using diffusion models
Pixel super-resolved virtual staining of label-free tissue using diffusion models
Yijie Zhang
Luzhe Huang
N. Pillar
Yuezun Li
Hanlong Chen
Aydogan Ozcan
76
3
0
01 Jul 2025
A Narrative Review on Large AI Models in Lung Cancer Screening, Diagnosis, and Treatment Planning
A Narrative Review on Large AI Models in Lung Cancer Screening, Diagnosis, and Treatment Planning
Jiachen Zhong
Yiting Wang
Di Zhu
Ziwei Wang
LM&MAAI4CE
30
1
0
01 Jul 2025
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions
How to Train your Text-to-Image Model: Evaluating Design Choices for Synthetic Training Captions
Manuel Brack
Sudeep Katakol
Felix Friedrich
P. Schramowski
Hareesh Ravi
Kristian Kersting
Ajinkya Kale
9
0
0
20 Jun 2025
Reward-Agnostic Prompt Optimization for Text-to-Image Diffusion Models
Reward-Agnostic Prompt Optimization for Text-to-Image Diffusion Models
Semin Kim
Yeonwoo Cha
Jaehoon Yoo
Seunghoon Hong
EGVM
27
0
0
20 Jun 2025
Noise-Informed Diffusion-Generated Image Detection with Anomaly Attention
Noise-Informed Diffusion-Generated Image Detection with Anomaly Attention
Weinan Guan
Wei Wang
Bo Peng
Ziwen He
Jing Dong
Haonan Cheng
DiffM
16
0
0
20 Jun 2025
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration
Wenyang Luo
Haina Qin
Zewen Chen
L. xilinx Wang
Dandan Zheng
Yuming Li
Yufan Liu
B. Li
Weiming Hu
14
0
0
20 Jun 2025
The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation
The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation
Giulia Bertazzini
Chiara Albisani
Daniele Baracchi
Dasara Shullani
Roberto Verdecchia
9
0
0
20 Jun 2025
Watermarking Autoregressive Image Generation
Watermarking Autoregressive Image Generation
Nikola Jovanović
Ismail Labiad
Tomáš Souček
Martin Vechev
Pierre Fernandez
WIGM
21
0
0
19 Jun 2025
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Anirud Aggarwal
Abhinav Shrivastava
M. Gwilliam
43
0
0
18 Jun 2025
Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models
Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models
Tian Xia
Fabio De Sousa Ribeiro
Rajat Rasal
Avinash Kori
Raghav Mehta
Ben Glocker
DiffM
24
0
0
17 Jun 2025
FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
Black Forest Labs
Stephen Batifol
A. Blattmann
Frederic Boesel
Saksham Consul
...
Dustin Podell
Robin Rombach
Harry Saini
Axel Sauer
Luke Smith
DiffM
15
0
0
17 Jun 2025
Toward Rich Video Human-Motion2D Generation
Toward Rich Video Human-Motion2D Generation
Ruihao Xi
Xuekuan Wang
Yongcheng Li
Shuhua Li
Zichen Wang
Yiwei Wang
Feng Wei
Cairong Zhao
VGen
16
0
0
17 Jun 2025
ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection
ASMR: Augmenting Life Scenario using Large Generative Models for Robotic Action Reflection
Shang-Chi Tsai
Seiya Kawano
Angel García Contreras
Koichiro Yoshino
Yun-Nung Chen
LM&Ro
17
2
0
16 Jun 2025
Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching
Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching
Weimin Bai
Yubo Li
Wenzheng Chen
Weijian Luo
H. Sun
15
0
0
16 Jun 2025
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention
Jeonghoon Park
Juyoung Lee
Chaeyeon Chung
Jaeseong Lee
Jaegul Choo
Jindong Gu
9
0
0
16 Jun 2025
Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image Editing
Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image Editing
Zhuoying Li
Zhu Xu
Yuxin Peng
Yang Liu
7
0
0
15 Jun 2025
Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models
Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models
Zongyu Wu
Minhua Lin
Zhiwei Zhang
Fali Wang
Xianren Zhang
Xiang Zhang
Suhang Wang
13
0
0
14 Jun 2025
ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models
ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models
Sibo Dong
Ismail Shaheen
Maggie Shen
Rupayan Mallick
Sarah Adel Bargal
DiffM
16
0
0
13 Jun 2025
CLIP Meets Diffusion: A Synergistic Approach to Anomaly Detection
CLIP Meets Diffusion: A Synergistic Approach to Anomaly Detection
Byeongchan Lee
John Won
Seunghyun Lee
Jinwoo Shin
22
0
0
13 Jun 2025
Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation
Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation
Zhiyang Xu
Jiuhai Chen
Zhaojiang Lin
Xichen Pan
Lifu Huang
...
Di Jin
Michihiro Yasunaga
Lili Yu
Xi Lin
Shaoliang Nie
111
1
0
12 Jun 2025
Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Chun-Mei Feng
Kai-An Yu
Xinxing Xu
Salman Khan
Rick Siow Mong Goh
Wangmeng Zuo
Yong Liu
VLM
133
0
0
12 Jun 2025
DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning
DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning
Dongxu Liu
Yuang Peng
Haomiao Tang
Yuwei Chen
Chunrui Han
Zheng Ge
Daxin Jiang
Mingxue Liao
DiffM
74
0
0
11 Jun 2025
Consistent Story Generation with Asymmetry Zigzag Sampling
Consistent Story Generation with Asymmetry Zigzag Sampling
Mingxiao Li
Mang Ning
Marie-Francine Moens
DiffM
77
0
0
11 Jun 2025
A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
Yukang Feng
Jianwen Sun
Chuanhao Li
Zizhen Li
Jiaxin Ai
...
Yifan Chang
Sizhuo Zhou
Shenglin Zhang
Yu Dai
Kaipeng Zhang
MLLMEGVM
80
0
0
11 Jun 2025
Geometric Regularity in Deterministic Sampling of Diffusion-based Generative Models
Geometric Regularity in Deterministic Sampling of Diffusion-based Generative Models
Defang Chen
Zhenyu Zhou
C. Wang
Siwei Lyu
DiffM
58
0
0
11 Jun 2025
SAGE: Exploring the Boundaries of Unsafe Concept Domain with Semantic-Augment Erasing
SAGE: Exploring the Boundaries of Unsafe Concept Domain with Semantic-Augment Erasing
Hongguang Zhu
Y. X. Wei
Mengyu Wang
Siyu Jiao
Yan Fang
Jiannan Huang
Yao Zhao
59
0
0
11 Jun 2025
Bias Analysis in Unconditional Image Generative Models
Xiaofeng Zhang
Michelle Lin
Simon Lacoste-Julien
Aaron Courville
Yash Goyal
18
0
0
10 Jun 2025
MagCache: Fast Video Generation with Magnitude-Aware Cache
Zehong Ma
Longhui Wei
Feng Wang
Shiliang Zhang
Q. Tian
26
0
0
10 Jun 2025
Generative Modeling of Weights: Generalization or Memorization?
Generative Modeling of Weights: Generalization or Memorization?
Boya Zeng
Yida Yin
Zhiqiu Xu
Zhuang Liu
DiffM
17
0
0
09 Jun 2025
R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation
R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation
William Ljungbergh
Bernardo Taveira
Wenzhao Zheng
Adam Tonderski
Chensheng Peng
...
Christoffer Petersson
Michael Felsberg
Kurt Keutzer
Masayoshi Tomizuka
Wei Zhan
14
0
0
09 Jun 2025
Evaluating Robustness in Latent Diffusion Models via Embedding Level Augmentation
Evaluating Robustness in Latent Diffusion Models via Embedding Level Augmentation
Boris Martirosyan
Alexey Karmanov
DiffM
10
0
0
09 Jun 2025
VIVAT: Virtuous Improving VAE Training through Artifact Mitigation
VIVAT: Virtuous Improving VAE Training through Artifact Mitigation
Lev Novitskiy
Viacheslav Vasilev
Maria Kovaleva
V. Arkhipkin
Denis Dimitrov
VGen
16
0
0
09 Jun 2025
CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems
Aniket Rege
Zinnia Nie
Mahesh Ramesh
Unmesh Raskar
Zhuoran Yu
Aditya Kusupati
Yong Jae Lee
Ramya Korlakai Vinayak
12
0
0
09 Jun 2025
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation
OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation
Jingjing Chang
Yixiao Fang
Peng Xing
Shuhan Wu
Wei Cheng
Rui Wang
Xianfang Zeng
Gang Yu
H. Chen
EGVMVLM
18
0
0
09 Jun 2025
Reinforcing Multimodal Understanding and Generation with Dual Self-rewards
Reinforcing Multimodal Understanding and Generation with Dual Self-rewards
Jixiang Hong
Yiran Zhang
Guanzhong Wang
Yi Liu
Ji-Rong Wen
Rui Yan
LRM
16
0
0
09 Jun 2025
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation
H. Kim
Donghyun Kim
Suhyun Kim
DiffM
23
1
0
09 Jun 2025
Dreamland: Controllable World Creation with Simulator and Generative Models
Dreamland: Controllable World Creation with Simulator and Generative Models
Sicheng Mo
Ziyang Leng
Leon Liu
Weizhen Wang
Honglin He
Bolei Zhou
VGen
10
0
0
09 Jun 2025
A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation
Andrew Z. Wang
Songwei Ge
Tero Karras
Ming-Yu Liu
Yogesh Balaji
23
0
0
09 Jun 2025
Hidden Bias in the Machine: Stereotypes in Text-to-Image Models
Hidden Bias in the Machine: Stereotypes in Text-to-Image Models
Sedat Porikli
Vedat Porikli
9
0
0
09 Jun 2025
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Kevin Rojas
Yuchen Zhu
Sichen Zhu
Felix X.-F. Ye
Molei Tao
DiffM
19
0
0
09 Jun 2025
Path Integral Optimiser: Global Optimisation via Neural Schrödinger-Föllmer Diffusion
Path Integral Optimiser: Global Optimisation via Neural Schrödinger-Föllmer Diffusion
Max McGuinness
Eirik Fladmark
Francisco Vargas
13
0
0
07 Jun 2025
Noise Consistency Regularization for Improved Subject-Driven Image Synthesis
Noise Consistency Regularization for Improved Subject-Driven Image Synthesis
Yao Ni
Song Wen
Piotr Koniusz
A. Cherian
9
0
0
06 Jun 2025
Learning to Weight Parameters for Data Attribution
Learning to Weight Parameters for Data Attribution
Shuangqi Li
Hieu M. Le
Jingyi Xu
Mathieu Salzmann
TDIDiffM
61
0
0
06 Jun 2025
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
Jiatao Gu
Tianrong Chen
David Berthelot
Huangjie Zheng
Yuyang Wang
Ruixiang Zhang
Laurent Dinh
Miguel Angel Bautista
Josh Susskind
Shuangfei Zhai
41
0
0
06 Jun 2025
Exponential Family Variational Flow Matching for Tabular Data Generation
Exponential Family Variational Flow Matching for Tabular Data Generation
Andrés Guzmán-Cordero
Floor Eijkelboom
Jan-Willem van de Meent
50
0
0
06 Jun 2025
Gen-n-Val: Agentic Image Data Generation and Validation
Jing-En Huang
I-Sheng Fang
Tzuhsuan Huang
Chih-Yu Wang
Jun-Cheng Chen
VLM
110
0
0
05 Jun 2025
MARBLE: Material Recomposition and Blending in CLIP-Space
Ta-Ying Cheng
Prafull Sharma
Mark Boss
Varun Jampani
DiffM
90
0
0
05 Jun 2025
1234...969798
Next