ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.06125
  4. Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents

Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
    VLM
    DiffM
ArXivPDFHTML

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

50 / 4,759 papers shown
Title
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized
  Text-to-Image Generation
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
Yu Zeng
Vishal M. Patel
Haochen Wang
Xun Huang
Ting-Chun Wang
Xuan Li
Yogesh Balaji
DiffM
32
18
0
08 Jul 2024
Layered Diffusion Model for One-Shot High Resolution Text-to-Image
  Synthesis
Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis
Emaad Khwaja
Abdullah Rashwan
Ting Chen
Oliver Wang
Suraj Kothawade
Yeqing Li
DiffM
48
0
0
08 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and
  Editing
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLM
DiffM
84
26
0
08 Jul 2024
Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with
  Pre-trained Image Encoder
Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder
Jia Liu
Changlin Li
Qirui Sun
Jiahui Ming
Chen Fang
Jue Wang
Bing Zeng
Shuaicheng Liu
DiffM
42
3
0
08 Jul 2024
AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal
  Enhancement and Assessment Labeling
AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling
Sherry X. Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Misha Sra
Pradeep Sen
42
0
0
08 Jul 2024
Image-Conditional Diffusion Transformer for Underwater Image Enhancement
Image-Conditional Diffusion Transformer for Underwater Image Enhancement
Xingyang Nie
Su Pan
Xiaoyu Zhai
Shifei Tao
Fengzhong Qu
Biao Wang
Huilin Ge
Guojie Xiao
49
2
0
07 Jul 2024
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
Danni Yang
Ruohan Dong
Jiayi Ji
Yiwei Ma
Haowei Wang
Xiaoshuai Sun
Rongrong Ji
62
3
0
07 Jul 2024
An Improved Method for Personalizing Diffusion Models
An Improved Method for Personalizing Diffusion Models
Yan Zeng
Masanori Suganuma
Takayuki Okatani
DiffM
52
1
0
07 Jul 2024
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Haozhe Zhao
Xiaojian Ma
Liang Chen
Shuzheng Si
Rujie Wu
Kaikai An
Peiyu Yu
Minjia Zhang
Qing Li
Baobao Chang
71
46
0
07 Jul 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
103
8
0
07 Jul 2024
Large Language Model as an Assignment Evaluator: Insights, Feedback, and
  Challenges in a 1000+ Student Course
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course
Cheng-Han Chiang
Wei-Chih Chen
Chun-Yi Kuan
Chienchou Yang
Hung-yi Lee
ELM
AI4Ed
54
5
0
07 Jul 2024
Synthetic Data Aided Federated Learning Using Foundation Models
Synthetic Data Aided Federated Learning Using Foundation Models
Fatima Abacha
Sin G. Teo
Lucas C. Cordeiro
Mustafa A. Mustafa
FedML
44
2
0
06 Jul 2024
FedTSA: A Cluster-based Two-Stage Aggregation Method for
  Model-heterogeneous Federated Learning
FedTSA: A Cluster-based Two-Stage Aggregation Method for Model-heterogeneous Federated Learning
Boyu Fan
Chenrui Wu
Xiang Su
Pan Hui
FedML
69
2
0
06 Jul 2024
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
Zhekai Chen
Wen Wang
Zhen Yang
Zeqing Yuan
Hao Chen
Chunhua Shen
DiffM
63
1
0
06 Jul 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for
  Text-to-Image Generation?
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Zhaorun Chen
Yichao Du
Zichen Wen
Yiyang Zhou
Chenhang Cui
...
Jiawei Zhou
Zhuokai Zhao
Rafael Rafailov
Chelsea Finn
Huaxiu Yao
EGVM
MLLM
76
30
0
05 Jul 2024
PartCraft: Crafting Creative Objects by Parts
PartCraft: Crafting Creative Objects by Parts
Kam Woh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
56
6
0
05 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting,
  and Transportation
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
54
7
0
05 Jul 2024
VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided
  Texturing
VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing
Shang Liu
Chaohui Yu
Chenjie Cao
Wen Qian
Fan Wang
DiffM
47
3
0
05 Jul 2024
GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction
GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction
Yuxuan Mu
Wei Ji
Chuan Guo
Yilin Wang
Juwei Lu
Xiaofeng Wu
Songcen Xu
Peng Dai
Youliang Yan
Li Cheng
3DGS
67
5
0
05 Jul 2024
Leveraging Latent Diffusion Models for Training-Free In-Distribution
  Data Augmentation for Surface Defect Detection
Leveraging Latent Diffusion Models for Training-Free In-Distribution Data Augmentation for Surface Defect Detection
Federico Girella
Ziyue Liu
Franco Fummi
Francesco Setti
Marco Cristani
Luigi Capogrosso
64
3
0
04 Jul 2024
Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal
  Image Restoration
Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration
Yuhong Zhang
Hengsheng Zhang
Xinning Chai
Zhengxue Cheng
Rong Xie
Li Song
Wenjun Zhang
DiffM
55
4
0
04 Jul 2024
Learning Action and Reasoning-Centric Image Editing from Videos and
  Simulations
Learning Action and Reasoning-Centric Image Editing from Videos and Simulations
Benno Krojer
Dheeraj Vattikonda
Luis Lara
Varun Jampani
Eva Portelance
Christopher Pal
Siva Reddy
EGVM
VGen
54
4
0
03 Jul 2024
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
Yilun Xu
Gabriele Corso
Tommi Jaakkola
Arash Vahdat
Karsten Kreis
49
13
0
03 Jul 2024
Improved Noise Schedule for Diffusion Training
Improved Noise Schedule for Diffusion Training
Tiankai Hang
Shuyang Gu
DiffM
39
9
0
03 Jul 2024
Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation
Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation
Xiang Gao
Zhengbo Xu
Junhan Zhao
Jiaying Liu
DiffM
42
8
0
03 Jul 2024
No Training, No Problem: Rethinking Classifier-Free Guidance for
  Diffusion Models
No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models
Seyedmorteza Sadat
Manuel Kansy
Otmar Hilliges
Romann M. Weber
49
11
0
02 Jul 2024
Meta 3D Gen
Meta 3D Gen
Raphael Bensadoun
Tom Monnier
Yanir Kleiman
Filippos Kokkinos
Yawar Siddiqui
...
Antoine Toisoul
David Novotny
Oran Gafni
Natalia Neverova
Andrea Vedaldi
54
1
0
02 Jul 2024
Magic Insert: Style-Aware Drag-and-Drop
Magic Insert: Style-Aware Drag-and-Drop
Nataniel Ruiz
Yuanzhen Li
Neal Wadhwa
Yael Pritch
Michael Rubinstein
David E. Jacobs
Shlomi Fruchter
DiffM
70
7
0
02 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual
  Conditional Diffusion Models
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
92
35
0
02 Jul 2024
Meta 3D TextureGen: Fast and Consistent Texture Generation for 3D
  Objects
Meta 3D TextureGen: Fast and Consistent Texture Generation for 3D Objects
Raphael Bensadoun
Yanir Kleiman
Idan Azuri
Omri Harosh
Andrea Vedaldi
Natalia Neverova
Oran Gafni
50
27
0
02 Jul 2024
Text-Aware Diffusion for Policy Learning
Text-Aware Diffusion for Policy Learning
Calvin Luo
Mandy He
Zilai Zeng
Chen Sun
40
4
0
02 Jul 2024
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
Jian Ma
Yonglin Deng
Chen Chen
H. Lu
Zhenyu Yang
Zhenyu Yang
VLM
DiffM
97
6
0
02 Jul 2024
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Dewei Zhou
Yuchen Li
Fan Ma
Zongxin Yang
Yue Yang
104
11
0
02 Jul 2024
Label-free Neural Semantic Image Synthesis
Label-free Neural Semantic Image Synthesis
Jiayi Wang
Kevin Laube
Yumeng Li
J. H. Metzen
Shin-I Cheng
Julio Borges
Anna Khoreva
DiffM
64
0
0
01 Jul 2024
FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training
  with Limited Resources
FastCLIP: A Suite of Optimization Techniques to Accelerate CLIP Training with Limited Resources
Xiyuan Wei
Fanjiang Ye
Ori Yonay
Xingyu Chen
Baixi Sun
Dingwen Tao
Tianbao Yang
VLM
CLIP
79
2
0
01 Jul 2024
An Expectation-Maximization Algorithm for Training Clean Diffusion
  Models from Corrupted Observations
An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations
Weimin Bai
Yifei Wang
Wenzheng Chen
He Sun
61
9
0
01 Jul 2024
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
Chang-Han Yeh
Chin-Yang Lin
Zhixiang Wang
Chi-Wei Hsiao
Ting-Hsuan Chen
Hau-Shiang Shiu
Yu-Lun Liu
VGen
DiffM
57
5
0
01 Jul 2024
StyleShot: A Snapshot on Any Style
StyleShot: A Snapshot on Any Style
Junyao Gao
Yanchen Liu
Yanan Sun
Yinhao Tang
Yanhong Zeng
Kai Chen
Cairong Zhao
TTA
3DH
VLM
84
15
0
01 Jul 2024
Controlling Face's Frame generation in StyleGAN's latent space
  operations: Modifying faces to deceive our memory
Controlling Face's Frame generation in StyleGAN's latent space operations: Modifying faces to deceive our memory
Agustín Roca
Nicolás Ignacio Britos
CVBM
38
0
0
30 Jun 2024
LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image
  Generation
LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
Mushui Liu
Yuhang Ma
Yang Zhen
Jun Dan
Yunlong Yu
Zeng Zhao
Zhipeng Hu
Bai Liu
Changjie Fan
VLM
DiffM
73
14
0
30 Jun 2024
Instruct-IPT: All-in-One Image Processing Transformer via Weight
  Modulation
Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation
Yuchuan Tian
Jianhong Han
Hanting Chen
Yuanyuan Xi
Guoyang Zhang
Jie Hu
Chao Xu
Yunhe Wang
ViT
VLM
57
8
0
30 Jun 2024
Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP
Unveiling Glitches: A Deep Dive into Image Encoding Bugs within CLIP
Ayush Ranjan
Daniel Wen
Karthik Bhat
39
0
0
30 Jun 2024
SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix
SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix
Peng Dai
Feitong Tan
Qiangeng Xu
David Futschik
Ruofei Du
S. Fanello
Xiaojuan Qi
Yinda Zhang
VGen
43
5
0
29 Jun 2024
Guided Trajectory Generation with Diffusion Models for Offline
  Model-based Optimization
Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization
Taeyoung Yun
Sujin Yun
Jaewoo Lee
Jinkyoo Park
OffRL
62
5
0
29 Jun 2024
SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting
SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting
S. Sabour
Lily Goli
George Kopanas
Mark J. Matthews
Dmitry Lagun
Leonidas Guibas
Alec Jacobson
David J. Fleet
Andrea Tagliasacchi
59
18
0
28 Jun 2024
Wavelets Are All You Need for Autoregressive Image Generation
Wavelets Are All You Need for Autoregressive Image Generation
Wael Mattar
Idan Levy
Nir Sharon
S. Dekel
55
3
0
28 Jun 2024
Concept Lens: Visually Analyzing the Consistency of Semantic
  Manipulation in GANs
Concept Lens: Visually Analyzing the Consistency of Semantic Manipulation in GANs
S. Jeong
Mingwei Li
Matthew Berger
Shusen Liu
72
0
0
28 Jun 2024
Analyzing Quality, Bias, and Performance in Text-to-Image Generative
  Models
Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models
Nila Masrourisaadat
Nazanin Sedaghatkish
Fatemeh Sarshartehrani
Edward A. Fox
56
6
0
28 Jun 2024
PopAlign: Population-Level Alignment for Fair Text-to-Image Generation
PopAlign: Population-Level Alignment for Fair Text-to-Image Generation
Shufan Li
Harkanwar Singh
Aditya Grover
EGVM
70
2
0
28 Jun 2024
SK-VQA: Synthetic Knowledge Generation at Scale for Training
  Context-Augmented Multimodal LLMs
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs
Xin Su
Man Luo
Kris W Pan
Tien Pei Chou
Vasudev Lal
Phillip Howard
68
4
0
28 Jun 2024
Previous
123...232425...949596
Next