ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.06125
  4. Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents

Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
    VLM
    DiffM
ArXivPDFHTML

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

50 / 4,757 papers shown
Title
A new approach for encoding code and assisting code understanding
A new approach for encoding code and assisting code understanding
Mengdan Fan
Changde Du
Haiyan Zhao
Zhi Jin
51
0
0
01 Aug 2024
Detecting, Explaining, and Mitigating Memorization in Diffusion Models
Detecting, Explaining, and Mitigating Memorization in Diffusion Models
Yuxin Wen
Yuchen Liu
Chen Chen
Lingjuan Lyu
38
51
0
31 Jul 2024
Conditioned Prompt-Optimization for Continual Deepfake Detection
Conditioned Prompt-Optimization for Continual Deepfake Detection
Francesco Laiti
Benedetta Liberatori
Thomas De Min
Elisa Ricci
45
3
0
31 Jul 2024
Fine-gained Zero-shot Video Sampling
Fine-gained Zero-shot Video Sampling
Dengsheng Chen
Jie Hu
Javier Segovia-Aguas
Enhua Wu
VGen
DiffM
44
0
0
31 Jul 2024
Big Cooperative Learning
Big Cooperative Learning
Yulai Cong
AI4CE
44
0
0
31 Jul 2024
DEF-oriCORN: efficient 3D scene understanding for robust
  language-directed manipulation without demonstrations
DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations
Dongwon Son
Sanghyeon Son
Jaehyung Kim
Beomjoon Kim
LM&Ro
55
0
0
31 Jul 2024
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Zhenghao Zhang
Junchao Liao
Menghao Li
Zuozhuo Dai
Bingxue Qiu
Hao Hu
Shaowei Cai
Weizhi Wang
VGen
50
45
0
31 Jul 2024
Add-SD: Rational Generation without Manual Reference
Add-SD: Rational Generation without Manual Reference
Lingfeng Yang
Xinyu Zhang
Xiang Li
Jinwen Chen
Kun Yao
Gang Zhang
Errui Ding
Ling-Ling Liu
Jingdong Wang
Jian Yang
45
0
0
30 Jul 2024
Autonomous Improvement of Instruction Following Skills via Foundation
  Models
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
37
11
0
30 Jul 2024
Learning Feature-Preserving Portrait Editing from Generated Pairs
Learning Feature-Preserving Portrait Editing from Generated Pairs
Bowei Chen
Tiancheng Zhi
Peihao Zhu
Shen Sang
Jing Liu
Linjie Luo
DiffM
35
0
0
29 Jul 2024
Contrasting Deepfakes Diffusion via Contrastive Learning and
  Global-Local Similarities
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities
Lorenzo Baraldi
Federico Cocchi
Marcella Cornia
Lorenzo Baraldi
Alessandro Nicolosi
Rita Cucchiara
43
8
0
29 Jul 2024
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing
Ekaterina Iakovleva
Fabio Pizzati
Philip Torr
Stéphane Lathuiliere
DiffM
44
0
0
29 Jul 2024
Diffusion Feedback Helps CLIP See Better
Diffusion Feedback Helps CLIP See Better
Wenxuan Wang
Quan-Sen Sun
Fan Zhang
Yepeng Tang
Jing Liu
Xinlong Wang
VLM
56
14
0
29 Jul 2024
DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion
  Models
DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models
Jing Yang
Runping Xi
Yingxin Lai
Xun Lin
Zitong Yu
DiffM
36
1
0
29 Jul 2024
MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and
  Disentangled Multi-Modality Fusion
MambaGesture: Enhancing Co-Speech Gesture Generation with Mamba and Disentangled Multi-Modality Fusion
Chencan Fu
Yabiao Wang
Jiangning Zhang
Zhengkai Jiang
Xiaofeng Mao
Jiafu Wu
Weijian Cao
Chengjie Wang
Yanhao Ge
Yong Liu
Mamba
65
2
0
29 Jul 2024
Garment Animation NeRF with Color Editing
Garment Animation NeRF with Color Editing
Renke Wang
Meng Zhang
Jun Li
Jian Yang
AI4CE
3DH
64
0
0
29 Jul 2024
Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular
  Transformer
Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular Transformer
Yang Wu
Kaihua Zhang
Jianjun Qian
Jin Xie
Jian Yang
DiffM
49
4
0
29 Jul 2024
Multi-Modal CLIP-Informed Protein Editing
Multi-Modal CLIP-Informed Protein Editing
Mingze Yin
Hanjing Zhou
Yiheng Zhu
Miao Lin
YiXuan Wu
...
Hongxia Xu
Chang-Yu Hsieh
Tingjun Hou
Jintai Chen
Jian Wu
53
7
0
27 Jul 2024
SHIC: Shape-Image Correspondences with no Keypoint Supervision
SHIC: Shape-Image Correspondences with no Keypoint Supervision
Aleksandar Shtedritski
Christian Rupprecht
Andrea Vedaldi
3DPC
3DH
3DV
35
3
0
26 Jul 2024
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments
Yu-Yun Tseng
Tanusree Sharma
Lotus Zhang
Abigale Stangl
Leah Findlater
Yang Wang
Danna Gurari
81
0
0
25 Jul 2024
ReCorD: Reasoning and Correcting Diffusion for HOI Generation
ReCorD: Reasoning and Correcting Diffusion for HOI Generation
Jian-Yu Jiang-Lin
Kang-Yang Huang
Ling Lo
Yi-Ning Huang
Terence Lin
Jhih-Ciang Wu
Hong-Han Shuai
Wen-Huang Cheng
DiffM
31
5
0
25 Jul 2024
DragText: Rethinking Text Embedding in Point-based Image Editing
DragText: Rethinking Text Embedding in Point-based Image Editing
Gayoon Choi
Taejin Jeong
Sujung Hong
Jaehoon Joo
Seong Jae Hwang
DiffM
55
1
0
25 Jul 2024
Multi-physics Simulation Guided Generative Diffusion Models with
  Applications in Fluid and Heat Dynamics
Multi-physics Simulation Guided Generative Diffusion Models with Applications in Fluid and Heat Dynamics
Naichen Shi
Hao Yan
Shenghan Guo
Raed Al Kontar
DiffM
AI4CE
43
0
0
25 Jul 2024
Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment
Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment
Shenghong Dai
Shiqi Jiang
Yifan Yang
Ting Cao
Mo Li
Suman Banerjee
Lili Qiu
49
2
0
25 Jul 2024
Diffusion Models for Multi-Task Generative Modeling
Diffusion Models for Multi-Task Generative Modeling
Changyou Chen
Han Ding
Bunyamin Sisman
Yi Tian Xu
Ouye Xie
Benjamin Z. Yao
Son Dinh Tran
Belinda Zeng
DiffM
50
4
0
24 Jul 2024
HumanVid: Demystifying Training Data for Camera-controllable Human Image
  Animation
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Zhenzhi Wang
Yixuan Li
Yanhong Zeng
Youqing Fang
Yuwei Guo
...
Jing Tan
Kai Chen
Tianfan Xue
Bo Dai
Dahua Lin
VGen
3DH
48
18
0
24 Jul 2024
PreciseControl: Enhancing Text-To-Image Diffusion Models with
  Fine-Grained Attribute Control
PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control
Rishubh Parihar
VS Sachidanand
Sabariswaran Mani
Tejan Karmali
R. V. Babu
DiffM
49
12
0
24 Jul 2024
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Lirui Zhao
Tianshuo Yang
Wenqi Shao
Yuxin Zhang
Yu Qiao
Ping Luo
Kaipeng Zhang
Rongrong Ji
DiffM
53
3
0
24 Jul 2024
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging
  Conditions
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions
Fabio Tosi
Pierluigi Zama Ramirez
Matteo Poggi
DiffM
MQ
MDE
40
9
0
23 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
41
1
0
23 Jul 2024
DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion
  Models
DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models
Zhenyu Xie
Haoye Dong
Yufei Gao
Zehua Ma
Xiaodan Liang
DiffM
50
3
0
23 Jul 2024
DreamDissector: Learning Disentangled Text-to-3D Generation from 2D
  Diffusion Priors
DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors
Zizheng Yan
Jiapeng Zhou
Fanpeng Meng
Yushuang Wu
Lingteng Qiu
Zisheng Ye
Shuguang Cui
Guanying Chen
Xiaoguang Han
DiffM
43
4
0
23 Jul 2024
CloudFixer: Test-Time Adaptation for 3D Point Clouds via
  Diffusion-Guided Geometric Transformation
CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation
Hajin Shim
Changhun Kim
Eunho Yang
TTA
41
5
0
23 Jul 2024
Diffusion Models as Optimizers for Efficient Planning in Offline RL
Diffusion Models as Optimizers for Efficient Planning in Offline RL
Renming Huang
Yunqiang Pei
Guoqing Wang
Yangming Zhang
Yang Yang
Peng Wang
H. Shen
OffRL
47
0
0
23 Jul 2024
Reconstructing Training Data From Real World Models Trained with
  Transfer Learning
Reconstructing Training Data From Real World Models Trained with Transfer Learning
Yakir Oz
Gilad Yehudai
Gal Vardi
Itai Antebi
Michal Irani
Niv Haim
43
2
0
22 Jul 2024
Stretching Each Dollar: Diffusion Training from Scratch on a
  Micro-Budget
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Vikash Sehwag
Xianghao Kong
Jingtao Li
Michael Spranger
Lingjuan Lyu
DiffM
47
9
0
22 Jul 2024
DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Jiahang Tu
Wei Ji
Han Zhao
Chao Zhang
Roger Zimmermann
Hui Qian
43
5
0
22 Jul 2024
WebRPG: Automatic Web Rendering Parameters Generation for Visual
  Presentation
WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation
Zirui Shao
Feiyu Gao
Hangdi Xing
Zepeng Zhu
Zhi Yu
Jiajun Bu
Qi Zheng
Cong Yao
36
2
0
22 Jul 2024
DiffX: Guide Your Layout to Cross-Modal Generative Modeling
DiffX: Guide Your Layout to Cross-Modal Generative Modeling
Zeyu Wang
Jingyu Lin
Yifei Qian
Yi Huang
Shicen Tian
...
Qu Yang
Lan Du
Cunjian Chen
Yufei Guo
Kejie Huang
DiffM
VLM
33
2
0
22 Jul 2024
Text2Place: Affordance-aware Text Guided Human Placement
Text2Place: Affordance-aware Text Guided Human Placement
Rishubh Parihar
Harsh Gupta
VS Sachidanand
R. V. Babu
DiffM
47
5
0
22 Jul 2024
Rethinking Domain Adaptation and Generalization in the Era of CLIP
Rethinking Domain Adaptation and Generalization in the Era of CLIP
Ruoyu Feng
Tao Yu
Xin Jin
Xiaoyuan Yu
Lei Xiao
Zhibo Chen
VLM
39
1
0
21 Jul 2024
Assessing Sample Quality via the Latent Space of Generative Models
Assessing Sample Quality via the Latent Space of Generative Models
Jingyi Xu
Hieu M. Le
Dimitris Samaras
MedIm
51
2
0
21 Jul 2024
Distilling Vision-Language Foundation Models: A Data-Free Approach via
  Prompt Diversification
Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification
Yunyi Xuan
Weijie Chen
Shicai Yang
Di Xie
Luojun Lin
Yueting Zhuang
VLM
45
4
0
21 Jul 2024
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
Zheng Chong
Xiao Dong
Haoxiang Li
Shiyue Zhang
Wenqing Zhang
Xujie Zhang
Hanqing Zhao
D. Jiang
Xiaodan Liang
DiffM
65
18
0
21 Jul 2024
Recent Advances in Generative AI and Large Language Models: Current
  Status, Challenges, and Perspectives
Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives
D. Hagos
Rick Battle
Danda B. Rawat
LM&MA
OffRL
50
23
0
20 Jul 2024
CoCoG-2: Controllable generation of visual stimuli for understanding
  human concept representation
CoCoG-2: Controllable generation of visual stimuli for understanding human concept representation
Chen Wei
Jiachen Zou
Dietmar Heinke
Quanying Liu
40
0
0
20 Jul 2024
Diffusion Models as Data Mining Tools
Diffusion Models as Data Mining Tools
Ioannis Siglidis
Aleksander Holynski
Alexei A. Efros
Mathieu Aubry
Shiry Ginosar
DiffM
MedIm
49
3
0
20 Jul 2024
Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic
  Rewards via Failure Prompts
Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic Rewards via Failure Prompts
Yanting Yang
Minghao Chen
Qibo Qiu
Jiahao Wu
Wenxiao Wang
Binbin Lin
Ziyu Guan
Xiaofei He
LM&Ro
50
2
0
20 Jul 2024
Intelligent Artistic Typography: A Comprehensive Review of Artistic Text
  Design and Generation
Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation
Yuhang Bai
Zichuan Huang
Wenshuo Gao
Shuai Yang
Jiaying Liu
49
5
0
20 Jul 2024
FedDM: Enhancing Communication Efficiency and Handling Data
  Heterogeneity in Federated Diffusion Models
FedDM: Enhancing Communication Efficiency and Handling Data Heterogeneity in Federated Diffusion Models
Jayneel Vora
Nader Bouacida
Aditya Krishnan
Prasant Mohapatra
FedML
60
2
0
20 Jul 2024
Previous
123...212223...949596
Next