ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.06125
  4. Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents

Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
    VLM
    DiffM
ArXivPDFHTML

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

50 / 4,744 papers shown
Title
Color Alignment in Diffusion
Ka Chun Shum
Binh-Son Hua
Duc Thanh Nguyen
Sai-Kit Yeung
65
0
0
09 Mar 2025
Fine-Grained Alignment and Noise Refinement for Compositional Text-to-Image Generation
Amir Mohammad Izadi
Seyed Mohsen Hosseini
Soroush Vafaie Tabar
Ali Abdollahi
Armin Saghafian
M. Baghshah
EGVM
45
0
0
09 Mar 2025
Synthetic Data Generation for Minimum-Exposure Navigation in a Time-Varying Environment using Generative AI Models
Nachiket U. Bapat
Randy C. Paffenroth
Raghvendra V. Cowlagi
52
0
0
09 Mar 2025
Towards More Accurate Personalized Image Generation: Addressing Overfitting and Evaluation Bias
Mingxiao Li
Tingyu Qu
Tinne Tuytelaars
Marie-Francine Moens
EGVM
46
0
0
09 Mar 2025
M3^33amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification
Mingxiang Cao
Weiying Xie
Xin Zhang
Jiaqing Zhang
Kai Jiang
Jie Lei
Yunsong Li
Mamba
48
0
0
09 Mar 2025
D3DR: Lighting-Aware Object Insertion in Gaussian Splatting
Vsevolod Skorokhodov
N. Durasov
Pascal Fua
3DGS
48
0
0
09 Mar 2025
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
Jian Ma
Qirong Peng
Xu Guo
Chen Chen
H. Lu
Zhenyu Yang
VLM
72
1
0
08 Mar 2025
Boosting the Local Invariance for Better Adversarial Transferability
Bohan Liu
Xiaosen Wang
AAML
65
0
0
08 Mar 2025
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
Xiang Gao
Shuai Yang
Jiaying Liu
DiffM
51
0
0
08 Mar 2025
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
Zengqun Zhao
Ziquan Liu
Yu Cao
Shaogang Gong
Ioannis Patras
50
0
0
07 Mar 2025
Frequency Autoregressive Image Generation with Continuous Tokens
Hu Yu
Hao Luo
Hangjie Yuan
Yu Rong
Feng Zhao
VGen
44
3
0
07 Mar 2025
Accelerating db-A* for Kinodynamic Motion Planning Using Diffusion
Julius Franke
A. Moldagalieva
Pia Hanfeld
Wolfgang Hönig
DiffM
75
0
0
07 Mar 2025
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang
Fanqi Wan
Longguang Zhong
Canbin Huang
Guosheng Liang
Xiaojun Quan
MoMe
95
0
0
06 Mar 2025
ControlFill: Spatially Adjustable Image Inpainting from Prompt Learning
Boseong Jeon
55
0
0
06 Mar 2025
Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models
Rui Jiang
Xinghe Fu
Guangcong Zheng
Teng Li
Taiping Yao
Xi Li
DiffM
70
0
0
06 Mar 2025
scDD: Latent Codes Based scRNA-seq Dataset Distillation with Foundation Model Knowledge
Zhen Yu
Jianan Han
Yang Liu
Qingchao Chen
62
0
0
06 Mar 2025
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Rui Zhao
Weijia Mao
Mike Zheng Shou
66
0
0
05 Mar 2025
WarmFed: Federated Learning with Warm-Start for Globalization and Personalization Via Personalized Diffusion Models
Tao Feng
Jie Zhang
Xiangjian Li
Rong Huang
Huashan Liu
Zhijie Wang
FedML
62
0
0
05 Mar 2025
MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Guangyin Bao
Qi Zhang
Z. Gong
Zhuojia Wu
Duoqian Miao
38
0
0
04 Mar 2025
LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New Dataset
LangGas: Introducing Language in Selective Zero-Shot Background Subtraction for Semi-Transparent Gas Leak Detection with a New Dataset
Wenqi Guo
Yiyang Du
Shan Du
75
1
0
04 Mar 2025
MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation
Yi Wang
Mushui Liu
Wanggui He
Longxiang Zhang
Z. Huang
...
Yiming Li
Weilong Dai
Mingli Song
Jie Song
Hao Jiang
MLLM
MoE
LRM
83
1
0
03 Mar 2025
WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
Zhipeng Huang
Shaobin Zhuang
Canmiao Fu
Binxin Yang
Ying Zhang
Chong Sun
Zhizheng Zhang
Yali Wang
Chen Li
Zheng-Jun Zha
DiffM
69
2
0
03 Mar 2025
One-shot In-context Part Segmentation
Zhenqi Dai
Ting Liu
X. Zhang
Y. X. Wei
Yanning Zhang
VLM
82
1
0
03 Mar 2025
CacheQuant: Comprehensively Accelerated Diffusion Models
Xuewen Liu
Zhikai Li
Qingyi Gu
DiffM
40
0
0
03 Mar 2025
Interactive Gadolinium-Free MRI Synthesis: A Transformer with Localization Prompt Learning
Linhao Li
Changhui Su
Yu Guo
Huimao Zhang
Dong Liang
K. Shang
MedIm
56
0
0
03 Mar 2025
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation
Jiantao Lin
Xin Yang
Meixi Chen
Yingjie Xu
D. Yan
Leyi Wu
Xinli Xu
Lie Xu
Shunsi Zhang
Ying-Cong Chen
60
1
0
03 Mar 2025
Fine-Grained Controllable Apparel Showcase Image Generation via Garment-Centric Outpainting
Rong Zhang
Jun Wang
Zhiwen Zuo
Jianfeng Dong
W. Li
Chi-Yin Wang
Wenyuan Xu
Xun Wang
DiffM
69
0
0
03 Mar 2025
FaceShot: Bring Any Character into Life
Junyao Gao
Yanan Sun
Fei Shen
Xin Jiang
Zhening Xing
Kai-xiang Chen
Cairong Zhao
CVBM
3DH
47
1
0
02 Mar 2025
Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think
Jie Tian
Xiaoye Qu
Zhenyi Lu
Wei Wei
Sichen Liu
Yu-Xi Cheng
DiffM
VGen
44
0
0
02 Mar 2025
A Simple and Effective Reinforcement Learning Method for Text-to-Image Diffusion Fine-tuning
Shashank Gupta
Chaitanya Ahuja
Tsung-Yu Lin
Sreya Dutta Roy
Harrie Oosterhuis
Maarten de Rijke
Satya Narayan Shukla
46
1
0
02 Mar 2025
Zero-Shot Head Swapping in Real-World Scenarios
Zero-Shot Head Swapping in Real-World Scenarios
S. Jeong
Taewoong Kang
Hyojin Jang
Jaegul Choo
39
0
0
02 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
56
1
0
02 Mar 2025
Periodic Materials Generation using Text-Guided Joint Diffusion Model
Kishalay Das
Subhojyoti Khastagir
Pawan Goyal
Seung-Cheol Lee
S. Bhattacharjee
Niloy Ganguly
DiffM
34
0
0
01 Mar 2025
Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA
Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA
Ojonugwa Oluwafemi Ejiga Peter
Md Mahmudur Rahman
Fahmi Khalifa
DiffM
MedIm
41
1
0
28 Feb 2025
MFSR: Multi-fractal Feature for Super-resolution Reconstruction with Fine Details Recovery
MFSR: Multi-fractal Feature for Super-resolution Reconstruction with Fine Details Recovery
Lianping Yang
Peng Jiao
Jinshan Pan
Hegui Zhu
Su Guo
43
0
0
27 Feb 2025
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Tight Inversion: Image-Conditioned Inversion for Real Image Editing
Edo Kadosh
Nir Goren
Or Patashnik
Daniel Garibi
Daniel Cohen-Or
DiffM
74
0
0
27 Feb 2025
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
L. Chen
S. Bai
Wenhao Chai
Weichu Xie
Haozhe Zhao
Leon Vinci
Junyang Lin
Baobao Chang
DiffM
90
4
0
27 Feb 2025
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Xin Ye
Burhaneddin Yaman
Sheng Cheng
Feng Tao
Abhirup Mallik
Liu Ren
DiffM
68
1
0
27 Feb 2025
Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation
Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation
Zhi Cen
Huaijin Pi
Sida Peng
Qing Shuai
Yujun Shen
Hujun Bao
Xiaowei Zhou
Ruizhen Hu
VGen
OffRL
67
1
0
27 Feb 2025
QPM: Discrete Optimization for Globally Interpretable Image Classification
QPM: Discrete Optimization for Globally Interpretable Image Classification
Thomas Norrenbrock
T. Kaiser
Sovan Biswas
R. Manuvinakurike
Bodo Rosenhahn
62
0
0
27 Feb 2025
Intent Tagging: Exploring Micro-Prompting Interactions for Supporting Granular Human-GenAI Co-Creation Workflows
Intent Tagging: Exploring Micro-Prompting Interactions for Supporting Granular Human-GenAI Co-Creation Workflows
Frederic Gmeiner
Nicolai Marquardt
Michael Bentley
Hugo Romat
M. Pahud
...
Asta Roseway
Nikolas Martelaro
Kenneth Holstein
K. Hinckley
N. Riche
55
0
0
26 Feb 2025
Improved YOLOv12 with LLM-Generated Synthetic Data for Enhanced Apple Detection and Benchmarking Against YOLOv11 and YOLOv10
Improved YOLOv12 with LLM-Generated Synthetic Data for Enhanced Apple Detection and Benchmarking Against YOLOv11 and YOLOv10
Ranjan Sapkota
Manoj Karkee
45
4
0
26 Feb 2025
Optimal Stochastic Trace Estimation in Generative Modeling
Optimal Stochastic Trace Estimation in Generative Modeling
Xinyang Liu
Hengrong Du
Wei Deng
Ruqi Zhang
AI4TS
52
0
0
26 Feb 2025
Diffusion-based Planning with Learned Viability Filters
Diffusion-based Planning with Learned Viability Filters
Nicholas Ioannidis
Daniele Reda
S. Cohan
M. van de Panne
76
0
0
26 Feb 2025
On the Interpolation Effect of Score Smoothing
On the Interpolation Effect of Score Smoothing
Zhengdao Chen
DiffM
83
0
0
26 Feb 2025
HRR: Hierarchical Retrospection Refinement for Generated Image Detection
HRR: Hierarchical Retrospection Refinement for Generated Image Detection
Peipei Yuan
Zijing Xie
Shuo Ye
Hong Chen
Yulong Wang
DiffM
154
1
0
25 Feb 2025
Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training
Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training
Botao Ye
Sifei Liu
Xueting Li
Marc Pollefeys
Ming Yang
69
0
0
25 Feb 2025
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
Mingkun Zhang
Keping Bi
Wei Chen
J. Guo
Xueqi Cheng
BDL
VLM
52
1
0
25 Feb 2025
FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance
Mintong Kang
Vinayshekhar Bannihatti Kumar
Shamik Roy
Abhishek Kumar
Sopan Khosla
Balakrishnan Narayanaswamy
Rashmi Gangadharaiah
50
0
0
25 Feb 2025
Bayesian Optimization for Controlled Image Editing via LLMs
Bayesian Optimization for Controlled Image Editing via LLMs
Chengkun Cai
Haoliang Liu
Xu Zhao
Zhongyu Jiang
Tianfang Zhang
Zongkai Wu
Lei Li
Lei Li
Lei Li
BDL
OffRL
103
2
0
25 Feb 2025
Previous
123...678...939495
Next