ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.10709
  4. Cited By
Emu Video: Factorizing Text-to-Video Generation by Explicit Image
  Conditioning

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

17 November 2023
Rohit Girdhar
Mannat Singh
Andrew Brown
Quentin Duval
S. Azadi
Sai Saketh Rambhatla
Akbar Shah
Xi Yin
Devi Parikh
Ishan Misra
    DiffM
    VGen
ArXivPDFHTML

Papers citing "Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning"

50 / 160 papers shown
Title
GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I
  Diffusion Models
GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models
Sai Sree Harsha
Ambareesh Revanur
Dhwanit Agarwal
Shradha Agrawal
VGen
DiffM
48
3
0
18 Apr 2024
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
VideoGigaGAN: Towards Detail-rich Video Super-Resolution
Yiran Xu
Taesung Park
Richard Zhang
Yang Zhou
Eli Shechtman
Feng Liu
Jia-Bin Huang
Difan Liu
SupR
93
10
0
18 Apr 2024
AniClipart: Clipart Animation with Text-to-Video Priors
AniClipart: Clipart Animation with Text-to-Video Priors
Rong Wu
Wanchao Su
Kede Ma
Jing Liao
32
4
0
18 Apr 2024
Predicting Long-horizon Futures by Conditioning on Geometry and Time
Predicting Long-horizon Futures by Conditioning on Geometry and Time
Tarasha Khurana
Deva Ramanan
AI4TS
55
0
0
17 Apr 2024
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Sicheng Xu
Guojun Chen
Yu-Xiao Guo
Jiaolong Yang
Chong Li
Zhenyu Zang
Yizhong Zhang
Xin Tong
Baining Guo
48
87
0
16 Apr 2024
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
  Controls to Any Diffusion Model
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Han Lin
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
DiffM
VGen
69
20
0
15 Apr 2024
UniFL: Improve Stable Diffusion via Unified Feedback Learning
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Jiacheng Zhang
Jie Wu
Yuxi Ren
Xin Xia
Huafeng Kuang
...
Jiashi Li
Xuefeng Xiao
Min Zheng
Lean Fu
Guanbin Li
39
2
0
08 Apr 2024
AI-Generated Video Detection via Spatio-Temporal Anomaly Learning
AI-Generated Video Detection via Spatio-Temporal Anomaly Learning
Jianfa Bai
Man Lin
Gang Cao
29
3
0
25 Mar 2024
Opportunities and challenges in the application of large artificial
  intelligence models in radiology
Opportunities and challenges in the application of large artificial intelligence models in radiology
Liangrui Pan
Zhenyu Zhao
Ying Lu
Kewei Tang
Liyong Fu
Qingchun Liang
Shaoliang Peng
LM&MA
MedIm
AI4CE
45
5
0
24 Mar 2024
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
DiffM
37
18
0
22 Mar 2024
Spectral Motion Alignment for Video Motion Transfer using Diffusion
  Models
Spectral Motion Alignment for Video Motion Transfer using Diffusion Models
Geon Yeong Park
Hyeonho Jeong
Sang Wan Lee
Jong Chul Ye
VGen
DiffM
37
10
0
22 Mar 2024
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Roberto Henschel
Levon Khachatryan
Daniil Hayrapetyan
Hayk Poghosyan
Vahram Tadevosyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
VGen
101
77
0
21 Mar 2024
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Zhengqing Yuan
Ruoxi Chen
Zhaoxu Li
Haolong Jia
Lifang He
Chi Wang
Lichao Sun
VGen
63
27
0
20 Mar 2024
Generative Enhancement for 3D Medical Images
Generative Enhancement for 3D Medical Images
Lingting Zhu
Noel Codella
Dongdong Chen
Zhenchao Jin
Lu Yuan
Lequan Yu
DiffM
MedIm
42
10
0
19 Mar 2024
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion
  Models
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Junlin Han
Filippos Kokkinos
Philip Torr
VGen
77
40
0
18 Mar 2024
SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image
  using Latent Video Diffusion
SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
Vikram S. Voleti
Chun-Han Yao
Mark Boss
Adam Letts
David Pankratz
Dmitry Tochilkin
Christian Laforte
Robin Rombach
Varun Jampani
DiffM
VGen
35
170
0
18 Mar 2024
Animate Your Motion: Turning Still Images into Dynamic Videos
Animate Your Motion: Turning Still Images into Dynamic Videos
Mingxiao Li
Bo Wan
Marie-Francine Moens
Tinne Tuytelaars
VGen
DiffM
35
4
0
15 Mar 2024
Video Editing via Factorized Diffusion Distillation
Video Editing via Factorized Diffusion Distillation
Uriel Singer
Amit Zohar
Yuval Kirstain
Shelly Sheynin
Adam Polyak
Devi Parikh
Yaniv Taigman
DiffM
VGen
46
12
0
14 Mar 2024
DragAnything: Motion Control for Anything using Entity Representation
DragAnything: Motion Control for Anything using Entity Representation
Wejia Wu
Zhuang Li
Yuchao Gu
Rui Zhao
Yefei He
David Junhao Zhang
Mike Zheng Shou
Yan Li
Tingting Gao
Di Zhang
VGen
76
51
0
12 Mar 2024
V3D: Video Diffusion Models are Effective 3D Generators
V3D: Video Diffusion Models are Effective 3D Generators
Zilong Chen
Yikai Wang
Feng Wang
Zhengyi Wang
Huaping Liu
VGen
40
61
0
11 Mar 2024
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video
  Diffusion Models
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models
Wenhao Wang
Yi Yang
VGen
DiffM
33
32
0
10 Mar 2024
VideoElevator: Elevating Video Generation Quality with Versatile
  Text-to-Image Diffusion Models
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Yabo Zhang
Yuxiang Wei
Xianhui Lin
Zheng Hui
Peiran Ren
Xuansong Xie
Xiangyang Ji
Wangmeng Zuo
VGen
43
6
0
08 Mar 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video
  Generation
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVM
VGen
36
40
0
08 Mar 2024
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video
  Diffusion Models via Training-Free Unified Attention Control
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control
Xuweiyi Chen
Tian Xia
Sihan Xu
VGen
DiffM
37
7
0
04 Mar 2024
AtomoVideo: High Fidelity Image-to-Video Generation
AtomoVideo: High Fidelity Image-to-Video Generation
Litong Gong
Yiran Zhu
Weijie Li
Xiaoyang Kang
Biao Wang
Tiezheng Ge
Bo Zheng
DiffM
VGen
132
12
0
04 Mar 2024
Sora: A Review on Background, Technology, Limitations, and Opportunities
  of Large Vision Models
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Yixin Liu
Kai Zhang
Yuan Li
Zhiling Yan
Chujie Gao
...
Yue Huang
Hanchi Sun
Jianfeng Gao
Lifang He
Lichao Sun
VLM
VGen
EGVM
75
259
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
66
85
0
27 Feb 2024
Beyond A*: Better Planning with Transformers via Search Dynamics
  Bootstrapping
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Lucas Lehnert
Sainbayar Sukhbaatar
DiJia Su
Qinqing Zheng
Paul Mcvay
Michael Rabbat
Yuandong Tian
32
53
0
21 Feb 2024
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance
  Editing
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
Jianhong Bai
Tianyu He
Yuchi Wang
Junliang Guo
Haoji Hu
Zuozhu Liu
Jiang Bian
VGen
31
26
0
20 Feb 2024
Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI:
  Unpredictable Plays Never Repeating The Same
Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same
Sungjun Ahn
Hyun-Jeong Yim
Youngwan Lee
Sung-Ik Park
VGen
41
4
0
19 Feb 2024
Classification Diffusion Models: Revitalizing Density Ratio Estimation
Classification Diffusion Models: Revitalizing Density Ratio Estimation
Shahar Yadin
Noam Elata
T. Michaeli
DiffM
43
1
0
15 Feb 2024
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality
  3D Generation
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation
Luke Melas-Kyriazi
Iro Laina
Christian Rupprecht
Natalia Neverova
Andrea Vedaldi
Oran Gafni
Filippos Kokkinos
3DGS
32
64
0
13 Feb 2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Weiming Ren
Harry Yang
Ge Zhang
Cong Wei
Xinrun Du
Stephen W. Huang
Wenhu Chen
DiffM
VGen
90
54
0
06 Feb 2024
Video-LaVIT: Unified Video-Language Pre-training with Decoupled
  Visual-Motional Tokenization
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Yang Jin
Zhicheng Sun
Kun Xu
Kun Xu
Liwei Chen
...
Yuliang Liu
Di Zhang
Yang Song
Kun Gai
Yadong Mu
VGen
55
42
0
05 Feb 2024
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Boximator: Generating Rich and Controllable Motions for Video Synthesis
Jiawei Wang
Yuchen Zhang
Jiaxin Zou
Yan Zeng
Guoqiang Wei
Liping Yuan
Hang Li
DiffM
VGen
32
43
0
02 Feb 2024
Lumiere: A Space-Time Diffusion Model for Video Generation
Lumiere: A Space-Time Diffusion Model for Video Generation
Omer Bar-Tal
Hila Chefer
Omer Tov
Charles Herrmann
Roni Paiss
...
T. Michaeli
Oliver Wang
Deqing Sun
Tali Dekel
Inbar Mosseri
VGen
114
215
0
23 Jan 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Wang
Xin Li
Luisa Verdoliva
Shu Hu
88
57
0
22 Jan 2024
WorldDreamer: Towards General World Models for Video Generation via
  Predicting Masked Tokens
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
Xiaofeng Wang
Zheng Zhu
Guan Huang
Boyuan Wang
Xinze Chen
Jiwen Lu
VGen
37
32
0
18 Jan 2024
UniVG: Towards UNIfied-modal Video Generation
UniVG: Towards UNIfied-modal Video Generation
Ludan Ruan
Lei Tian
Chuanwei Huang
Xu Zhang
Xinyan Xiao
VGen
DiffM
34
3
0
17 Jan 2024
AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated
  by AI
AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI
Fanda Fan
Chunjie Luo
Wanling Gao
Jianfeng Zhan
85
15
0
03 Jan 2024
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed
  Diffusion Models
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM
3DGS
42
113
0
21 Dec 2023
Generating Illustrated Instructions
Generating Illustrated Instructions
Sachit Menon
Ishan Misra
Rohit Girdhar
DiffM
34
4
0
07 Dec 2023
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion
  Models
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
Pengxiang Li
Kai Chen
Zhili Liu
Ruiyuan Gao
Lanqing Hong
Guo Zhou
Hua Yao
Dit-Yan Yeung
Huchuan Lu
Xu Jia
VGen
DiffM
22
0
0
01 Dec 2023
Motion-Conditioned Image Animation for Video Editing
Motion-Conditioned Image Animation for Video Editing
Wilson Yan
Andrew Brown
Pieter Abbeel
Rohit Girdhar
S. Azadi
DiffM
VGen
58
12
0
30 Nov 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
VBench: Comprehensive Benchmark Suite for Video Generative Models
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
77
349
0
29 Nov 2023
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation
  and Editing
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
Haoyu Zhao
Tianyi Lu
Jiaxi Gu
Xing Zhang
Qingping Zheng
Zuxuan Wu
Hang Xu
Yu-Gang Jiang
VGen
DiffM
33
10
0
29 Nov 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffM
VGen
28
114
0
28 Nov 2023
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion
  Schedule Flaws and Enhancing Low-Frequency Controls
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
Minghui Hu
Jianbin Zheng
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
DiffM
26
3
0
27 Nov 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large
  Datasets
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
167
1,016
0
25 Nov 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Jinbo Xing
Menghan Xia
Yong Zhang
Haoxin Chen
Wangbo Yu
Hanyuan Liu
Xintao Wang
Tien-Tsin Wong
Ying Shan
VGen
47
224
0
18 Oct 2023
Previous
1234
Next