ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.06985
  4. Cited By
BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation

BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation

11 May 2025
Panwen Hu
Jiehui Huang
Qiang Sun
Xiaodan Liang
    DiffMVGen
ArXiv (abs)PDFHTML

Papers citing "BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation"

48 / 48 papers shown
Title
CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers
D. She
Mushui Liu
Jingxuan Pang
Jin Wang
Zhen Yang
...
Yi Wang
Qihan Huang
Haobin Tang
YunLong Yu
Siming Fu
VGen
203
5
0
21 Feb 2025
A Reinforcement Learning-Based Automatic Video Editing Method Using
  Pre-trained Vision-Language Model
A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model
Panwen Hu
Nan Xiao
Feifei Li
Yongquan Chen
Rui Huang
VGenOffRL
83
3
0
07 Nov 2024
StoryAgent: Customized Storytelling Video Generation via Multi-Agent
  Collaboration
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
Panwen Hu
Jin Jiang
Jianqi Chen
Mingfei Han
Shengcai Liao
Xiaojun Chang
Xiaodan Liang
VGenDiffM
100
6
0
07 Nov 2024
On Information-Theoretic Measures of Predictive Uncertainty
On Information-Theoretic Measures of Predictive Uncertainty
Kajetan Schweighofer
L. Aichberger
Mykyta Ielanskyi
Sepp Hochreiter
UQCVUD
109
6
0
14 Oct 2024
CustomCrafter: Customized Video Generation with Preserving Motion and
  Concept Composition Abilities
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Tao Wu
Yong Zhang
Xintao Wang
Xianpan Zhou
Guangcong Zheng
Zhongang Qi
Ying Shan
Xi Li
VGenDiffM
50
29
0
23 Aug 2024
EasyControl: Transfer ControlNet to Video Diffusion for Controllable
  Generation and Interpolation
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation
Cong Wang
Jiaxi Gu
Panwen Hu
Haoyu Zhao
Yuanfan Guo
J. N. Han
Hang Xu
Xiaodan Liang
VGenDiffM
68
7
0
23 Aug 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Zhuoyi Yang
Jiayan Teng
Wendi Zheng
Ming Ding
Shiyu Huang
...
Weihan Wang
Yean Cheng
Xiaotao Gu
Yuxiao Dong
Jie Tang
DiffMVGen
226
512
0
12 Aug 2024
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Zhenghao Zhang
Junchao Liao
Menghao Li
Zuozhuo Dai
Bingxue Qiu
Hao Hu
Shaowei Cai
Weizhi Wang
VGen
90
52
0
31 Jul 2024
HumanVid: Demystifying Training Data for Camera-controllable Human Image
  Animation
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Zhenzhi Wang
Yixuan Li
Yanhong Zeng
Youqing Fang
Yuwei Guo
...
Jing Tan
Kai Chen
Tianfan Xue
Bo Dai
Dahua Lin
VGen3DH
138
23
0
24 Jul 2024
Still-Moving: Customized Video Generation without Customized Video Data
Still-Moving: Customized Video Generation without Customized Video Data
Hila Chefer
Shiran Zada
Roni Paiss
Ariel Ephrat
Omer Tov
Michael Rubinstein
Lior Wolf
Tali Dekel
T. Michaeli
Inbar Mosseri
DiffMVGen
80
24
0
11 Jul 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
92
52
0
11 Jun 2024
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Xuanhua He
Quande Liu
Shengju Qian
Xin Eric Wang
Tao Hu
Ke Cao
K. Yan
Jie Zhang
VGen
80
47
0
23 Apr 2024
DreamMatcher: Appearance Matching Self-Attention for
  Semantically-Consistent Text-to-Image Personalization
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Jisu Nam
Heesu Kim
Dongjae Lee
Siyoon Jin
Seungryong Kim
Seunggyu Chang
DiffM
76
43
0
15 Feb 2024
Magic-Me: Identity-Specific Video Customized Diffusion
Magic-Me: Identity-Specific Video Customized Diffusion
Ze Ma
Daquan Zhou
Chun-Hsiao Yeh
Xue-She Wang
Xiuyu Li
Huanrui Yang
Zhen Dong
Kurt Keutzer
Jiashi Feng
VGenDiffM
57
31
0
14 Feb 2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Weiming Ren
Harry Yang
Ge Zhang
Cong Wei
Xinrun Du
Stephen W. Huang
Wenhu Chen
DiffMVGen
118
59
0
06 Feb 2024
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
Zhao Wang
Aoxue Li
Lingting Zhu
Yong Guo
Qi Dou
Zhenguo Li
VGenDiffM
75
43
0
18 Jan 2024
Vlogger: Make Your Dream A Vlog
Vlogger: Make Your Dream A Vlog
Shaobin Zhuang
Kunchang Li
Xinyuan Chen
Yaohui Wang
Ziwei Liu
Yu Qiao
Yali Wang
VGenDiffM
69
38
0
17 Jan 2024
SSR-Encoder: Encoding Selective Subject Representation for
  Subject-Driven Generation
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
...
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
77
68
0
26 Dec 2023
DreamVideo: Composing Your Dream Videos with Customized Subject and
  Motion
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Yujie Wei
Shiwei Zhang
Zhiwu Qing
Hangjie Yuan
Zhiheng Liu
Yu Liu
Yingya Zhang
Jingren Zhou
Hongming Shan
DiffMVGen
68
98
0
07 Dec 2023
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention
  and Text Guidance
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance
Cong Wang
Jiaxi Gu
Panwen Hu
Songcen Xu
Hang Xu
Xiaodan Liang
VGen
65
16
0
05 Dec 2023
VideoBooth: Diffusion-based Video Generation with Image Prompts
VideoBooth: Diffusion-based Video Generation with Image Prompts
Yuming Jiang
Tianxing Wu
Shuai Yang
Chenyang Si
Dahua Lin
Yu Qiao
Chen Change Loy
Ziwei Liu
DiffMVGen
94
73
0
01 Dec 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
263
1,170
0
25 Nov 2023
VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning on Language-Video Foundation Models
VideoDreamer: Customized Multi-Subject Text-to-Video Generation with Disen-Mix Finetuning on Language-Video Foundation Models
Hong Chen
Xin Wang
Guanning Zeng
Yipeng Zhang
Yuwei Zhou
Feilin Han
Wenwu Zhu
Wenwu Zhu
VGenDiffM
46
1
0
02 Nov 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Jinbo Xing
Menghan Xia
Yong Zhang
Haoxin Chen
Wangbo Yu
Hanyuan Liu
Xintao Wang
Tien-Tsin Wong
Ying Shan
VGen
103
246
0
18 Oct 2023
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion
  Models
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Yaohui Wang
Xinyuan Chen
Xin Ma
Shangchen Zhou
Ziqi Huang
...
Chen Change Loy
Bo Dai
Dahua Lin
Yu Qiao
Ziwei Liu
VGenDiffM
76
230
0
26 Sep 2023
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion
  Models
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Li Chen
Mengyi Zhao
Yiheng Liu
Mingxu Ding
Yangyang Song
...
Xu Wang
Hao Yang
Jing Liu
Kang Du
Min Zheng
DiffM
53
55
0
11 Sep 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
  without Specific Tuning
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
99
845
0
10 Jul 2023
Emergent Correspondence from Image Diffusion
Emergent Correspondence from Image Diffusion
Luming Tang
Menglin Jia
Qianqian Wang
Cheng Perng Phoo
Bharath Hariharan
91
266
0
06 Jun 2023
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image
  Generation
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation
Shaozhe Hao
Kai Han
Shihao Zhao
Kwan-Yee K. Wong
51
10
0
01 Jun 2023
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot
  Semantic Correspondence
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence
Junyi Zhang
Charles Herrmann
Junhwa Hur
Luisa Polania Cabrera
Varun Jampani
Deqing Sun
Ming-Hsuan Yang
DiffM
73
183
0
24 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized
  Attention
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGenDiffM
114
254
0
17 May 2023
Key-Locked Rank One Editing for Text-to-Image Personalization
Key-Locked Rank One Editing for Text-to-Image Personalization
Yoad Tewel
Rinon Gal
Gal Chechik
Yuval Atzmon
DiffM
202
173
0
02 May 2023
Align your Latents: High-Resolution Video Synthesis with Latent
  Diffusion Models
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGSVGen
196
1,092
0
18 Apr 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
90
284
0
20 Mar 2023
Cones: Concept Neurons in Diffusion Models for Customized Generation
Cones: Concept Neurons in Diffusion Models for Customized Generation
Zhiheng Liu
Ruili Feng
Kai Zhu
Yifei Zhang
Kecheng Zheng
Yu Liu
Deli Zhao
Jingren Zhou
Yang Cao
DiffM
141
127
0
09 Mar 2023
Latent Video Diffusion Models for High-Fidelity Long Video Generation
Latent Video Diffusion Models for High-Fidelity Long Video Generation
Yin-Yin He
Tianyu Yang
Yong Zhang
Ying Shan
Qifeng Chen
DiffMVGen
90
235
0
23 Nov 2022
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image
  Translation
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation
Narek Tumanyan
Michal Geyer
Shai Bagon
Tali Dekel
126
679
0
22 Nov 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert
  Denoisers
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLMMoE
168
827
0
02 Nov 2022
Neural Matching Fields: Implicit Representation of Matching Fields for
  Visual Correspondence
Neural Matching Fields: Implicit Representation of Matching Fields for Visual Correspondence
Sung‐Jin Hong
Jisu Nam
Seokju Cho
Susung Hong
Sangryul Jeon
Dongbo Min
Seung Wook Kim
3DV
68
20
0
06 Oct 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
279
2,861
0
25 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
200
1,773
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using
  Textual Inversion
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
161
1,889
0
02 Aug 2022
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
167
1,451
0
07 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
460
15,665
0
20 Dec 2021
PDC-Net+: Enhanced Probabilistic Dense Correspondence Network
PDC-Net+: Enhanced Probabilistic Dense Correspondence Network
Prune Truong
Martin Danelljan
Radu Timofte
Luc Van Gool
72
86
0
28 Sep 2021
CATs: Cost Aggregation Transformers for Visual Correspondence
CATs: Cost Aggregation Transformers for Visual Correspondence
Seokju Cho
Sunghwan Hong
Sangryul Jeon
Yunsung Lee
Kwanghoon Sohn
Seungryong Kim
ViT
76
91
0
04 Jun 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
964
29,731
0
26 Feb 2021
Denoising Diffusion Implicit Models
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLMDiffM
283
7,384
0
06 Oct 2020
1