ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.06125
  4. Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents

Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
    VLM
    DiffM
ArXivPDFHTML

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

50 / 4,756 papers shown
Title
LLM-SmartAudit: Advanced Smart Contract Vulnerability Detection
LLM-SmartAudit: Advanced Smart Contract Vulnerability Detection
Zhiyuan Wei
Jing Sun
Zijiang Zhang
Xianhao Zhang
Meng Li
Zhe Hou
41
4
0
12 Oct 2024
Toward Guidance-Free AR Visual Generation via Condition Contrastive
  Alignment
Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Huayu Chen
Hang Su
Peize Sun
Jun Zhu
VLM
56
3
0
12 Oct 2024
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
Yifeng Xu
Zhenliang He
Shiguang Shan
Xilin Chen
DiffM
32
3
0
12 Oct 2024
MiRAGeNews: Multimodal Realistic AI-Generated News Detection
MiRAGeNews: Multimodal Realistic AI-Generated News Detection
Runsheng Huang
Liam Dugan
Yue Yang
Chris Callison-Burch
44
3
0
11 Oct 2024
RealEra: Semantic-level Concept Erasure via Neighbor-Concept Mining
RealEra: Semantic-level Concept Erasure via Neighbor-Concept Mining
Yufan Liu
Jinyang An
Wanqian Zhang
Ming Li
Dayan Wu
Jingzi Gu
Zheng Lin
Weiping Wang
32
4
0
11 Oct 2024
Audio Description Generation in the Era of LLMs and VLMs: A Review of
  Transferable Generative AI Technologies
Audio Description Generation in the Era of LLMs and VLMs: A Review of Transferable Generative AI Technologies
Yingqiang Gao
Lukas Fischer
Alexa Lintner
Sarah Ebling
41
0
0
11 Oct 2024
Natural Language Induced Adversarial Images
Natural Language Induced Adversarial Images
Xiaopei Zhu
Peiyang Xu
Guanning Zeng
Yingpeng Dong
Xiaolin Hu
AAML
35
0
0
11 Oct 2024
Diffusion-Based Depth Inpainting for Transparent and Reflective Objects
Diffusion-Based Depth Inpainting for Transparent and Reflective Objects
Tianyu Sun
Dingchang Hu
Yixiang Dai
Guijin Wang
DiffM
52
5
0
11 Oct 2024
Diffusion Models Need Visual Priors for Image Generation
Diffusion Models Need Visual Priors for Image Generation
Xiaoyu Yue
Zidong Wang
Zeyu Lu
S. Sun
Meng Wei
Wanli Ouyang
Junlin Wu
Luping Zhou
VLM
58
1
0
11 Oct 2024
Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image
  Generative Models
Generated Bias: Auditing Internal Bias Dynamics of Text-To-Image Generative Models
Abhishek Mandal
Susan Leavy
Suzanne Little
30
1
0
10 Oct 2024
HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with
  Heterogeneous Graph Adapter
HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with Heterogeneous Graph Adapter
Yumiao Zhao
Bo Jiang
Xiao Wang
Qin Xu
Jin Tang
VLM
38
0
0
10 Oct 2024
Teddy: Efficient Large-Scale Dataset Distillation via
  Taylor-Approximated Matching
Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching
Ruonan Yu
Songhua Liu
Jingwen Ye
Xinchao Wang
DD
45
4
0
10 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Hefei Ling
Zhen Dong
Lei Zhu
71
14
0
10 Oct 2024
DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models
DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models
Xiaoxiao He
Ligong Han
Quan Dao
Song Wen
Minhao Bai
...
Hongdong Li
Junzhou Huang
Faez Ahmed
Akash Srivastava
Dimitris Metaxas
DiffM
SyDa
42
5
0
10 Oct 2024
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation
Susan Liang
Chao Huang
Yapeng Tian
Anurag Kumar
Chenliang Xu
DiffM
34
7
0
09 Oct 2024
Positive-Augmented Contrastive Learning for Vision-and-Language
  Evaluation and Training
Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training
Sara Sarto
Nicholas Moratelli
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
50
3
0
09 Oct 2024
AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Yukang Cao
Liang Pan
Kai Han
Kwan-Yee K. Wong
Ziwei Liu
VGen
43
6
0
09 Oct 2024
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Fu-Yun Wang
Ling Yang
Zhaoyang Huang
Mengdi Wang
Hongsheng Li
47
15
0
09 Oct 2024
Enhancing Vision-Language Model Pre-training with Image-text Pair
  Pruning Based on Word Frequency
Enhancing Vision-Language Model Pre-training with Image-text Pair Pruning Based on Word Frequency
Mingliang Liang
Martha Larson
VLM
CLIP
26
0
0
09 Oct 2024
Patterns of Creativity: How User Input Shapes AI-Generated Visual
  Diversity
Patterns of Creativity: How User Input Shapes AI-Generated Visual Diversity
Maria-Teresa De Rosa Palmini
Eva Cetinic
41
3
0
09 Oct 2024
InstantIR: Blind Image Restoration with Instant Generative Reference
InstantIR: Blind Image Restoration with Instant Generative Reference
Jen-Yuan Huang
Haofan Wang
Qixun Wang
Xu Bai
Hao Ai
Peng-Fei Xing
Jen-tse Huang
30
1
0
09 Oct 2024
Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning
Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning
Qianli Ma
Xuefei Ning
Dongrui Liu
Li Niu
Linfeng Zhang
MoMe
62
0
0
09 Oct 2024
Temporal Image Caption Retrieval Competition -- Description and Results
Temporal Image Caption Retrieval Competition -- Description and Results
Jakub Pokrywka
Piotr Wierzchoñ
Kornel Weryszko
Krzysztof Jassem
52
0
0
08 Oct 2024
RelitLRM: Generative Relightable Radiance for Large Reconstruction
  Models
RelitLRM: Generative Relightable Radiance for Large Reconstruction Models
Tianyuan Zhang
Zhengfei Kuang
Haian Jin
Zexiang Xu
Sai Bi
...
Yiwei Hu
Miloš Hašan
William T. Freeman
Kai Zhang
Fujun Luan
3DGS
34
2
0
08 Oct 2024
Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing
  Images
Prompting DirectSAM for Semantic Contour Extraction in Remote Sensing Images
Shiyu Miao
Delong Chen
Fan Liu
Chuanyi Zhang
Yanhui Gu
Shengjie Guo
Jun Zhou
34
1
0
08 Oct 2024
Diversity-Rewarded CFG Distillation
Diversity-Rewarded CFG Distillation
Geoffrey Cideron
A. Agostinelli
Johan Ferret
Sertan Girgin
Romuald Elie
Olivier Bachem
Sarah Perrin
Alexandre Ramé
49
2
0
08 Oct 2024
FINALLY: fast and universal speech enhancement with studio-like quality
FINALLY: fast and universal speech enhancement with studio-like quality
Nicholas Babaev
Kirill Tamogashev
Azat Saginbaev
Ivan Shchekotov
Hanbin Bae
Hosang Sung
WonJun Lee
Hoon-Young Cho
Pavel Andreev
39
2
0
08 Oct 2024
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image
  Editing
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing
June Suk Choi
Kyungmin Lee
Jongheon Jeong
Saining Xie
Jinwoo Shin
Kimin Lee
DiffM
AAML
46
3
0
08 Oct 2024
TeaserGen: Generating Teasers for Long Documentaries
TeaserGen: Generating Teasers for Long Documentaries
Weihan Xu
Paul Pu Liang
Haven Kim
Julian McAuley
Taylor Berg-Kirkpatrick
Hao-Wen Dong
VGen
VLM
DiffM
34
0
0
08 Oct 2024
Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning
Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning
Saemi Moon
M. Lee
Sangdon Park
Dongwoo Kim
44
1
0
08 Oct 2024
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation
Gihyun Kwon
Jong Chul Ye
DiffM
66
3
0
08 Oct 2024
GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting
GS-VTON: Controllable 3D Virtual Try-on with Gaussian Splatting
Yukang Cao
Masoud Hadi
Liang Pan
Ziwei Liu
3DGS
DiffM
58
4
0
07 Oct 2024
MetaDD: Boosting Dataset Distillation with Neural Network
  Architecture-Invariant Generalization
MetaDD: Boosting Dataset Distillation with Neural Network Architecture-Invariant Generalization
Yunlong Zhao
Xiaoheng Deng
Xiu Su
Hongyan Xu
Xiuxing Li
Yijing Liu
Shan You
FedML
DD
41
1
0
07 Oct 2024
Compositional Diffusion Models for Powered Descent Trajectory Generation
  with Flexible Constraints
Compositional Diffusion Models for Powered Descent Trajectory Generation with Flexible Constraints
Julia Briden
Yilun Du
Enrico M. Zucchelli
Richard Linares
47
0
0
05 Oct 2024
ShieldDiff: Suppressing Sexual Content Generation from Diffusion Models
  through Reinforcement Learning
ShieldDiff: Suppressing Sexual Content Generation from Diffusion Models through Reinforcement Learning
Dong Han
Salaheldin Mohamed
Yong Li
31
2
0
04 Oct 2024
MDMP: Multi-modal Diffusion for supervised Motion Predictions with
  uncertainty
MDMP: Multi-modal Diffusion for supervised Motion Predictions with uncertainty
Leo Bringer
Joey Wilson
Kira Barton
Maani Ghaffari
DiffM
36
0
0
04 Oct 2024
Estimating Body and Hand Motion in an Ego-sensed World
Estimating Body and Hand Motion in an Ego-sensed World
Brent Yi
Vickie Ye
Maya Zheng
Lea Müller
Georgios Pavlakos
Yi Ma
Jitendra Malik
Angjoo Kanazawa
DiffM
55
6
0
04 Oct 2024
Conditional Enzyme Generation Using Protein Language Models with
  Adapters
Conditional Enzyme Generation Using Protein Language Models with Adapters
Jason Yang
Aadyot Bhatnagar
Jeffrey A. Ruffolo
Ali Madani
36
5
0
04 Oct 2024
CalliffusionV2: Personalized Natural Calligraphy Generation with
  Flexible Multi-modal Control
CalliffusionV2: Personalized Natural Calligraphy Generation with Flexible Multi-modal Control
Qisheng Liao
Liang Li
Yulang Fei
Gus Xia
DiffM
VLM
36
0
0
03 Oct 2024
SteerDiff: Steering towards Safe Text-to-Image Diffusion Models
SteerDiff: Steering towards Safe Text-to-Image Diffusion Models
Hongxiang Zhang
Yifeng He
Hao Chen
33
3
0
03 Oct 2024
NL-Eye: Abductive NLI for Images
NL-Eye: Abductive NLI for Images
Mor Ventura
Michael Toker
Nitay Calderon
Zorik Gekhman
Yonatan Bitton
Roi Reichart
38
1
0
03 Oct 2024
Event-Customized Image Generation
Event-Customized Image Generation
Zhen Wang
Yilei Jiang
Dong Zheng
Jun Xiao
Long Chen
DiffM
31
1
0
03 Oct 2024
Eliminating Oversaturation and Artifacts of High Guidance Scales in
  Diffusion Models
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models
Seyedmorteza Sadat
Otmar Hilliges
Romann M. Weber
DiffM
23
8
0
03 Oct 2024
Stochastic Sampling from Deterministic Flow Models
Stochastic Sampling from Deterministic Flow Models
Saurabh Singh
Ian S. Fischer
41
2
0
03 Oct 2024
CaLMFlow: Volterra Flow Matching using Causal Language Models
CaLMFlow: Volterra Flow Matching using Causal Language Models
Shiyang Zhang
Daniel Levine
Ivan Vrkic
Marco Francesco Bressana
David Zhang
S. Rizvi
Yangtian Zhang
E. Zappala
David van Dijk
27
0
0
03 Oct 2024
DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized
  Image Generation
DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation
Jing He
Haodong Li
Yongzhe Hu
Guibao Shen
Yingjie Cai
Weichao Qiu
Ying-Cong Chen
DiffM
34
2
0
02 Oct 2024
DreamGarden: A Designer Assistant for Growing Games from a Single Prompt
DreamGarden: A Designer Assistant for Growing Games from a Single Prompt
Sam Earle
Samyak Parajuli
Andrzej Banburski-Fahey
39
2
0
02 Oct 2024
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
Rinon Gal
Adi Haviv
Yuval Alaluf
Amit H. Bermano
Daniel Cohen-Or
Gal Chechik
DiffM
39
3
0
02 Oct 2024
COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based
  Video Generation
COMUNI: Decomposing Common and Unique Video Signals for Diffusion-based Video Generation
Mingzhen Sun
Weining Wang
Xinxin Zhu
Jing Liu
VGen
DiffM
36
0
0
02 Oct 2024
Data Extrapolation for Text-to-image Generation on Small Datasets
Data Extrapolation for Text-to-image Generation on Small Datasets
Senmao Ye
Fei Liu
33
0
0
02 Oct 2024
Previous
123...151617...949596
Next