Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.17249
Cited By
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
31 March 2021
Or Patashnik
Zongze Wu
Eli Shechtman
Daniel Cohen-Or
Dani Lischinski
CLIP
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery"
50 / 311 papers shown
Title
MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models
Hongyang Zhu
Haipeng Liu
Bo Fu
Yang Wang
DiffM
35
0
0
08 May 2025
CapsFake: A Multimodal Capsule Network for Detecting Instruction-Guided Deepfakes
Tuan Nguyen
Naseem Khan
Issa Khalil
AAML
64
0
0
27 Apr 2025
Distilling Textual Priors from LLM to Efficient Image Fusion
Ran Zhang
Xuanhua He
Ke Cao
L. Liu
Li Zhang
Man Zhou
Jie Zhang
29
0
0
09 Apr 2025
Language-Depth Navigated Thermal and Visible Image Fusion
Jinchang Zhang
Zijun Li
Guoyu Lu
MDE
66
1
0
11 Mar 2025
Bayesian Optimization for Controlled Image Editing via LLMs
Chengkun Cai
Haoliang Liu
Xu Zhao
Zhongyu Jiang
Tianfang Zhang
Zongkai Wu
Lei Li
Jenq-Neng Hwang
Lei Li
BDL
OffRL
103
2
0
25 Feb 2025
UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting
Haoyuan Li
Yanpeng Zhou
Tao Tang
Jifei Song
Yihan Zeng
Michael C. Kampffmeyer
Hang Xu
Xiaodan Liang
3DGS
67
1
0
25 Feb 2025
Transfer Learning with Pre-trained Conditional Generative Models
Shinýa Yamaguchi
Sekitoshi Kanai
Atsutoshi Kumagai
Daiki Chijiwa
H. Kashima
VLM
CLL
BDL
DiffM
148
5
0
21 Feb 2025
DFCon: Attention-Driven Supervised Contrastive Learning for Robust Deepfake Detection
MD Sadik Hossain Shanto
Mahir Labib Dihan
Souvik Ghosh
Riad Ahmed Anonto
Hafijul Hoque Chowdhury
...
Rakib Ahsan
Md Tanvir Hassan
MD Roqunuzzaman Sojib
Sheikh Azizul Hakim
M. Saifur Rahman
CVBM
71
0
0
28 Jan 2025
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space
Daniel Garibi
Shahar Yadin
Roni Paiss
Omer Tov
Shiran Zada
Ariel Ephrat
T. Michaeli
Inbar Mosseri
Tali Dekel
DiffM
103
2
0
21 Jan 2025
Continual Test-Time Adaptation for Single Image Defocus Deblurring via Causal Siamese Networks
Shuang Cui
Yi Li
Jiangmeng Li
Xiongxin Tang
Bing-Huang Su
Fanjiang Xu
Hui Xiong
53
0
0
15 Jan 2025
AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models
Yongjian Wu
Yang Zhou
Jiya Saiyin
Bingzheng Wei
M. Lai
Jianzhong Shou
Yan Xu
VLM
MedIm
27
1
0
22 Oct 2024
Flex: End-to-End Text-Instructed Visual Navigation from Foundation Model Features
Makram Chahine
Alex Quach
Alaa Maalouf
Tsun-Hsuan Wang
Daniela Rus
26
0
0
16 Oct 2024
Revealing Directions for Text-guided 3D Face Editing
Zhuo Chen
Yichao Yan
Sehngqi Liu
Yuhao Cheng
Weiming Zhao
Lincheng Li
Mengxiao Bi
Xiaokang Yang
DiffM
37
0
0
07 Oct 2024
Connecting Dreams with Visual Brainstorming Instruction
Yasheng Sun
Bohan Li
Mingchen Zhuge
Deng-Ping Fan
Salman Khan
F. Khan
Hideki Koike
DiffM
42
0
0
14 Aug 2024
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation
Jiwook Kim
Seonho Lee
Jaeyo Shin
Jiho Choi
Hyunjung Shim
DiffM
50
0
0
16 Jul 2024
Concept Lens: Visually Analyzing the Consistency of Semantic Manipulation in GANs
S. Jeong
Mingwei Li
Matthew Berger
Shusen Liu
49
0
0
28 Jun 2024
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data
Rotem Shalev-Arkushin
Aharon Azulay
Tavi Halperin
Eitan Richardson
Amit H. Bermano
Ohad Fried
DiffM
49
0
0
20 Jun 2024
Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning
Amandeep Kumar
Muhammad Awais
Sanath Narayan
Hisham Cholakkal
Salman Khan
Rao Muhammad Anwer
45
0
0
06 Jun 2024
Dream-in-Style: Text-to-3D Generation Using Stylized Score Distillation
Hubert Kompanowski
Binh-Son Hua
DiffM
64
3
0
05 Jun 2024
ComFace: Facial Representation Learning with Synthetic Data for Comparing Faces
Yusuke Akamatsu
Terumi Umematsu
Hitoshi Imaoka
Shizuko Gomi
Hideo Tsurushima
97
0
0
25 May 2024
Future You: A Conversation with an AI-Generated Future Self Reduces Anxiety, Negative Emotions, and Increases Future Self-Continuity
Pat Pataranutaporn
Kavin Winson
Peggy Yin
Auttasak Lapapirojn
Pichayoot Ouppaphan
Monchai Lertsutthiwong
Pattie Maes
Hal E. Hershfield
33
6
0
21 May 2024
ReasonPix2Pix: Instruction Reasoning Dataset for Advanced Image Editing
Ying Jin
Pengyang Ling
Xiao-wen Dong
Pan Zhang
Jiaqi Wang
Dahua Lin
34
2
0
18 May 2024
Generative Unlearning for Any Identity
Juwon Seo
Sung-Hoon Lee
Tae-Young Lee
Seungjun Moon
Gyeong-Moon Park
45
5
0
16 May 2024
Semantic Contextualization of Face Forgery: A New Definition, Dataset, and Detection Method
Mian Zou
Baosheng Yu
Yibing Zhan
Siwei Lyu
Kede Ma
CVBM
56
3
0
14 May 2024
SignAvatar: Sign Language 3D Motion Reconstruction and Generation
Lu Dong
Lipisha Chaudhary
Fei Xu
Xiao Wang
Mason Lary
Ifeoma Nwogu
SLR
34
3
0
13 May 2024
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Haoyu Zheng
Wenqiao Zhang
Yaoke Wang
Hao Zhou
Jiang Liu
Juncheng Li
Zheqi Lv
Siliang Tang
Yueting Zhuang
Yueting Zhuang
44
1
0
21 Apr 2024
Uncovering the Text Embedding in Text-to-Image Diffusion Models
Huikang Yu
Hao Luo
Fan Wang
Feng Zhao
31
10
0
01 Apr 2024
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
44
2
0
31 Mar 2024
DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment
Stella Bounareli
Christos Tzelepis
Vasileios Argyriou
Ioannis Patras
Georgios Tzimiropoulos
DiffM
45
7
0
25 Mar 2024
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
S. A. Baumann
Felix Krause
Michael Neumayr
Nick Stracke
Vincent Tao Hu
Bjorn Ommer
Björn Ommer
DiffM
LM&Ro
70
11
0
25 Mar 2024
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing
Wonjun Kang
Kevin Galim
Hyung Il Koo
DiffM
34
5
0
14 Mar 2024
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Jinlu Zhang
Yiyi Zhou
Qiancheng Zheng
Xiaoxiong Du
Gen Luo
Jun Peng
Xiaoshuai Sun
Rongrong Ji
3DH
27
3
0
11 Mar 2024
Scene Depth Estimation from Traditional Oriental Landscape Paintings
Sungho Kang
Yeonghyeon Park
H. Park
Juneho Yi
52
0
0
06 Mar 2024
Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
Huan Ma
Yan Zhu
Changqing Zhang
Peilin Zhao
Baoyuan Wu
Long-Kai Huang
Qinghua Hu
Bing Wu
VLM
69
1
0
01 Mar 2024
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
66
85
0
27 Feb 2024
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning
W. Para
Abdelrahman Eldesokey
Zhenyu Li
Pradyumna Reddy
Jiankang Deng
Peter Wonka
DiffM
35
0
0
08 Feb 2024
LanDA: Language-Guided Multi-Source Domain Adaptation
Zhenbin Wang
Lei Zhang
Lituan Wang
Minjuan Zhu
35
10
0
25 Jan 2024
UniHDA: A Unified and Versatile Framework for Multi-Modal Hybrid Domain Adaptation
Hengjia Li
Yang Liu
Yuqi Lin
Zhanwei Zhang
Yibo Zhao
...
Tu Zheng
Zheng Yang
Yuchun Jiang
Boxi Wu
Deng Cai
DiffM
36
0
0
23 Jan 2024
CCA: Collaborative Competitive Agents for Image Editing
Tiankai Hang
Shuyang Gu
Dong Chen
Xin Geng
Baining Guo
33
5
0
23 Jan 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Wang
Xin Li
Luisa Verdoliva
Shu Hu
88
57
0
22 Jan 2024
From Text to Pixels: A Context-Aware Semantic Synergy Solution for Infrared and Visible Image Fusion
Xingyuan Li
Yang Zou
Jinyuan Liu
Zhiying Jiang
Long Ma
Xin-Yue Fan
Risheng Liu
51
4
0
31 Dec 2023
Cross Initialization for Personalized Text-to-Image Generation
Lianyu Pang
Jian Yin
Haoran Xie
Qiping Wang
Qing Li
Xudong Mao
DiffM
35
7
0
26 Dec 2023
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing
Xiaoyue Duan
Shuhao Cui
Guoliang Kang
Baochang Zhang
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
36
8
0
22 Dec 2023
Zero-shot Building Attribute Extraction from Large-Scale Vision and Language Models
Fei Pan
Sangryul Jeon
Brian Wang
Frank Mckenna
Stella X. Yu
44
2
0
19 Dec 2023
Mask Grounding for Referring Image Segmentation
Yong Xien Chng
Henry Zheng
Yizeng Han
Xuchong Qiu
Gao Huang
ISeg
ObjD
37
15
0
19 Dec 2023
PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns
Shuliang Ning
Duomin Wang
Yipeng Qin
Zirong Jin
Baoyuan Wang
Xiaoguang Han
DiffM
32
11
0
07 Dec 2023
DemoCaricature: Democratising Caricature Generation with a Rough Sketch
Dar-Yen Chen
A. Bhunia
Subhadeep Koley
Aneeshan Sain
Pinaki Nath Chowdhury
Yi-Zhe Song
26
8
0
07 Dec 2023
AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing
Fan Yang
Tianyi Chen
Xiaosheng He
Zhongang Cai
Lei Yang
Si Wu
Guosheng Lin
30
9
0
03 Dec 2023
CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt
Haiyao Xiao
Chenglai Zhong
Xuan Gao
Yudong Guo
Juyong Zhang
38
0
0
30 Nov 2023
Text-Driven Image Editing via Learnable Regions
Yuanze Lin
Yi-Wen Chen
Yi-Hsuan Tsai
Lu Jiang
Ming-Hsuan Yang
DiffM
31
16
0
28 Nov 2023
1
2
3
4
5
6
7
Next