ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.06125
  4. Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents

Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
    VLMDiffM
ArXiv (abs)PDFHTML

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

50 / 4,897 papers shown
Title
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Hanyang Zhao
Haoxian Chen
Ji Zhang
D. Yao
Wenpin Tang
150
1
0
03 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
88
1
0
02 Feb 2025
Shape from Semantics: 3D Shape Generation from Multi-View Semantics
Shape from Semantics: 3D Shape Generation from Multi-View Semantics
Liangchen Li
Caoliwen Wang
Yuqi Zhou
Bailin Deng
Juyong Zhang
3DV
129
0
0
01 Feb 2025
Text-to-Image Generation for Vocabulary Learning Using the Keyword Method
Text-to-Image Generation for Vocabulary Learning Using the Keyword Method
Nuwan T. Attygalle
M. Kljun
Aaron Quigley
Klen Copic Pucihar
Jens Grubert
...
Juri Yoneyama
Alice Toniolo
Angela Miguel
Hirokazu Kato
M. Weerasinghe
DiffM
175
0
0
28 Jan 2025
Visual Generation Without Guidance
Huayu Chen
Kai Jiang
Kaiwen Zheng
Jianfei Chen
Hang Su
Jun Zhu
166
2
0
28 Jan 2025
An analysis of the noise schedule for score-based generative models
An analysis of the noise schedule for score-based generative models
SU StanislasStrasman
Antonio Ocello
Claire Boyer Lpsm
Sylvain Le Corff Lpsm
Vincent Lemaire
DiffM
189
7
0
28 Jan 2025
Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Make-A-Texture: Fast Shape-Aware Texture Generation in 3 Seconds
Xiaoyu Xiang
Liat Sless Gorelik
Yuchen Fan
Omri Armstrong
Forrest N. Iandola
Yilei Li
Ita Lifshitz
Rakesh Ranjan
3DGSDiffM
183
5
0
28 Jan 2025
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
Shuhe Wang
Xiaoya Li
Xiaofei Sun
G. Wang
Tianwei Zhang
Jiwei Li
Eduard H. Hovy
119
1
0
28 Jan 2025
CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes
CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes
Tim Broedermann
Daniel Gehrig
Yuqian Fu
Luc Van Gool
133
11
0
28 Jan 2025
EDSep: An Effective Diffusion-Based Method for Speech Source Separation
Jinwei Dong
Xinsheng Wang
Qirong Mao
143
1
0
28 Jan 2025
Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation
Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation
Adil Kaan Akan
Yucel Yemez
DiffMOCL
82
0
0
27 Jan 2025
Diffusion-Based Planning for Autonomous Driving with Flexible Guidance
Yinan Zheng
Ruiming Liang
Kexin Zheng
Jinliang Zheng
Liyuan Mao
...
Weihao Gu
Rui Ai
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
131
17
0
26 Jan 2025
Mitigating GenAI-powered Evidence Pollution for Out-of-Context Multimodal Misinformation Detection
Mitigating GenAI-powered Evidence Pollution for Out-of-Context Multimodal Misinformation Detection
Zehong Yan
Peng Qi
Wynne Hsu
Mong Li Lee
85
0
0
24 Jan 2025
Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols
Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols
John Joon Young Chung
Melissa Roemmele
Max Kreminski
VGen
124
0
0
23 Jan 2025
Neural Radiance Fields for the Real World: A Survey
Neural Radiance Fields for the Real World: A Survey
Wenhui Xiao
Remi Chierchia
Rodrigo Santa Cruz
Xuesong Li
David Ahmedt-Aristizabal
Olivier Salvado
Clinton Fookes
Léo Lebrat
AI4CE
174
0
0
22 Jan 2025
Accelerate High-Quality Diffusion Models with Inner Loop Feedback
Accelerate High-Quality Diffusion Models with Inner Loop Feedback
M. Gwilliam
Han Cai
Di Wu
Abhinav Shrivastava
Zhiyu Cheng
224
1
0
22 Jan 2025
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model
Xianwei Zhuang
Yuxin Xie
Yufan Deng
Liming Liang
Jinghan Ru
Yuguo Yin
Yuexian Zou
MLLMVLMLRM
166
11
0
21 Jan 2025
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space
Daniel Garibi
Shahar Yadin
Roni Paiss
Omer Tov
Shiran Zada
Ariel Ephrat
T. Michaeli
Inbar Mosseri
Tali Dekel
DiffM
140
5
0
21 Jan 2025
Block Flow: Learning Straight Flow on Data Blocks
Block Flow: Learning Straight Flow on Data Blocks
Zibin Wang
Zhiyuan Ouyang
Xiangyun Zhang
78
0
0
20 Jan 2025
Nested Annealed Training Scheme for Generative Adversarial Networks
Nested Annealed Training Scheme for Generative Adversarial Networks
Chang Wan
Ming-Hsuan Yang
Minglu Li
Yunliang Jiang
Zhonglong Zheng
GAN
129
0
0
20 Jan 2025
Model Synthesis for Zero-Shot Model Attribution
Model Synthesis for Zero-Shot Model Attribution
Tianyun Yang
Juan Cao
Danding Wang
Chang Xu
WIGM
151
4
0
20 Jan 2025
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
160
4
0
20 Jan 2025
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas
Matthew Shreve
Xuelu Li
Prateek Singhal
Kaushik Roy
DiffM
103
1
0
20 Jan 2025
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
Ruojun Xu
Weijie Xi
Xiaodi Wang
Yongbo Mao
Zach Cheng
DiffM
118
1
0
20 Jan 2025
DPCL-Diff: The Temporal Knowledge Graph Reasoning Based on Graph Node Diffusion Model with Dual-Domain Periodic Contrastive Learning
DPCL-Diff: The Temporal Knowledge Graph Reasoning Based on Graph Node Diffusion Model with Dual-Domain Periodic Contrastive Learning
Yukun Cao
Lisheng Wang
Luobing Huang
DiffM
110
2
0
20 Jan 2025
Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style
Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style
Haohan Wang
Wei Feng
Yang Lu
Yaoyu Li
Zheng Zhang
Jingjing Lv
Xin Zhu
Jun-Jun Shen
DiffM
177
5
0
20 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CELM&MAVLM
282
27
0
17 Jan 2025
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
Yong-Hyun Park
Sangdoo Yun
Jin-Hwa Kim
Junho Kim
Geonhui Jang
Yonghyun Jeong
Junghyo Jo
Gayoung Lee
167
19
0
17 Jan 2025
Simplified and Generalized Masked Diffusion for Discrete Data
Simplified and Generalized Masked Diffusion for Discrete Data
Jiaxin Shi
Kehang Han
Zehao Wang
Arnaud Doucet
Michalis K. Titsias
DiffM
223
105
0
17 Jan 2025
TextureCrop: Enhancing Synthetic Image Detection through Texture-based Cropping
TextureCrop: Enhancing Synthetic Image Detection through Texture-based Cropping
Despina Konstantinidou
C. Koutlis
Symeon Papadopoulos
142
3
0
17 Jan 2025
Joint Learning of Depth and Appearance for Portrait Image Animation
Joint Learning of Depth and Appearance for Portrait Image Animation
Xinya Ji
Gaspard Zoss
Prashanth Chandran
Lingchen Yang
Xun Cao
B. Solenthaler
D. Bradley
3DHMDE
133
1
0
15 Jan 2025
Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints
Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints
Jonathan Nöther
Adish Singla
Goran Radanović
AAML
163
0
0
14 Jan 2025
IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion
IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion
Tharun Anand
Aryan Garg
Kaushik Mitra
VGenDiffM
88
0
0
13 Jan 2025
Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning
Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning
Maomao Li
Lijian Lin
Yunfei Liu
Ye Zhu
Yu Li
DiffMVGen
108
0
0
11 Jan 2025
Has an AI model been trained on your images?
Has an AI model been trained on your images?
Matyáš Boháček
Hany Farid
111
0
0
11 Jan 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Wu
Chenyang Yang
178
34
0
10 Jan 2025
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation
Minxing Luo
Zixun Xia
L. Chen
Zhenhang Li
Weichao Zeng
Jinqiao Wang
Wentao Cheng
Yaxing Wang
Yu Zhou
Jian Yang
DiffM
149
1
0
10 Jan 2025
TextToucher: Fine-Grained Text-to-Touch Generation
TextToucher: Fine-Grained Text-to-Touch Generation
Jiahang Tu
Hao Fu
Fengyu Yang
Hanbin Zhao
Chao Zhang
Hui Qian
VLMDiffM
157
12
0
10 Jan 2025
EditAR: Unified Conditional Generation with Autoregressive Models
EditAR: Unified Conditional Generation with Autoregressive Models
Jiteng Mu
Nuno Vasconcelos
Xinyu Wang
DiffM
89
6
0
08 Jan 2025
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Jiawei Liu
Yuanzhi Zhu
Feiyu Gao
Zhiyong Yang
P. Wang
Junyang Lin
Xinyu Wang
Wenyu Liu
DiffM
88
0
0
08 Jan 2025
Concept Matching with Agent for Out-of-Distribution Detection
Concept Matching with Agent for Out-of-Distribution Detection
YuXiao Lee
Xiaofeng Cao
Jingcai Guo
Wei Ye
Qing Guo
Yi Chang
92
0
0
08 Jan 2025
XGeM: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation
XGeM: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation
Daniele Molino
Francesco Di Feola
E. Faiella
Deborah Fazzini
D. Santucci
Linlin Shen
V. Guarrasi
Paolo Soda
SyDaMedIm
127
1
0
08 Jan 2025
Unity by Diversity: Improved Representation Learning in Multimodal VAEs
Unity by Diversity: Improved Representation Learning in Multimodal VAEs
Thomas M. Sutter
Yang Meng
Andrea Agostini
Daphné Chopard
Norbert Fortin
Julia E. Vogt
Bahbak Shahbaba
Stephan Mandt
SSL
111
2
0
08 Jan 2025
Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models
Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models
Gen Li
Yuling Yan
DiffM
117
23
0
03 Jan 2025
SOEDiff: Efficient Distillation for Small Object Editing
SOEDiff: Efficient Distillation for Small Object Editing
Yiming Wu
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Ronghua Liang
DiffM
173
0
0
03 Jan 2025
Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models
Ethical-Lens: Curbing Malicious Usages of Open-Source Text-to-Image Models
Yuzhu Cai
Sheng Yin
Yuxi Wei
Chenxin Xu
Weibo Mao
Felix Juefei Xu
Siheng Chen
Yanfeng Wang
EGVM
200
3
0
03 Jan 2025
GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
Rahul Sajnani
Jeroen Vanbaar
Jie Min
Kapil D. Katyal
Srinath Sridhar
DiffM
169
13
0
03 Jan 2025
Population Aware Diffusion for Time Series Generation
Yang Li
Han Meng
Zhenyu Bi
Ingolv T. Urnes
Haipeng Chen
AI4TS
71
0
0
03 Jan 2025
Neural Network Diffusion
Neural Network Diffusion
Kaili Wang
Dongwen Tang
Boya Zeng
Yida Yin
Zhaopan Xu
Yukun Zhou
Zelin Zang
Trevor Darrell
Zhuang Liu
Yang You
DiffM
137
26
0
03 Jan 2025
DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Feng Han
Kai-xiang Chen
Chao Gong
Zhipeng Wei
Jingjing Chen
Yu-Gang Jiang
89
3
0
03 Jan 2025
Previous
123...111213...969798
Next