ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Communities
  3. ...

Neighbor communities

0 / 0 papers shown
Title
Top Contributors
Name# Papers# Citations
Social Events
DateLocationEvent
  1. Home
  2. Communities
  3. VGen

Video Generation

VGen
More data

Innovative methods and technologies for generating high-quality video content using AI and machine learning techniques.

Neighbor communities

51015

Featured Papers

0 / 0 papers shown
Title

All papers

50 / 4,191 papers shown
Title
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
Wenqiang Sun
Haiyu Zhang
Haoyuan Wang
Junta Wu
Zehan Wang
Zhenwei Wang
Yunhong Wang
Jun Zhang
Tengfei Wang
Chunchao Guo
VGen
0
0
0
16 Dec 2025
AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation
AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation
Sisi Dai
Kai Xu
DiffMVGen
0
0
0
16 Dec 2025
End-to-End Learning-based Video Streaming Enhancement Pipeline: A Generative AI Approach
End-to-End Learning-based Video Streaming Enhancement Pipeline: A Generative AI Approach
Emanuele Artioli
Farzad Tashtarian
Christian Timmerer
VGen
0
0
0
16 Dec 2025
Distill Video Datasets into Images
Zhenghao Zhao
Haoxuan Wang
Kai Wang
Yuzhang Shang
Yuan Hong
Yan Yan
DDVGen
72
0
0
16 Dec 2025
S2D: Sparse-To-Dense Keymask Distillation for Unsupervised Video Instance Segmentation
S2D: Sparse-To-Dense Keymask Distillation for Unsupervised Video Instance Segmentation
Leon Sick
Lukas Hoyer
Dominik Engel
Pedro Hermosilla
Timo Ropinski
VGen
0
0
0
16 Dec 2025
MMGR: Multi-Modal Generative Reasoning
Zefan Cai
Haoyi Qiu
Tianyi Ma
Haozhe Zhao
Gengze Zhou
...
Minjia Zhang
Xiao Wen
Jiuxiang Gu
Nanyun Peng
Junjie Hu
EGVMVGenLRM
84
0
0
16 Dec 2025
DRAW2ACT: Turning Depth-Encoded Trajectories into Robotic Demonstration Videos
DRAW2ACT: Turning Depth-Encoded Trajectories into Robotic Demonstration Videos
Yang Bai
Liudi Yang
George Eskandar
Fengyi Shen
Mohammad Altillawi
Ziyuan Liu
Gitta Kutyniok
VGen
8
0
0
16 Dec 2025
MobileWorldBench: Towards Semantic World Modeling For Mobile Agents
MobileWorldBench: Towards Semantic World Modeling For Mobile Agents
Shufan Li
Konstantinos Kallidromitis
Akash Gokul
Yusuke Kato
Kazuki Kozuka
Aditya Grover
VGen
8
0
0
16 Dec 2025
AnimaMimic: Imitating 3D Animation from Video Priors
AnimaMimic: Imitating 3D Animation from Video Priors
Tianyi Xie
Yunuo Chen
Yaowei Guo
Yin Yang
Bolei Zhou
Demetri Terzopoulos
Ying Jiang
Chenfanfu Jiang
VGen
4
0
0
16 Dec 2025
ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Juze Zhang
Changan Chen
Xin Chen
Heng Yu
Tiange Xiang
Ali Sartaz Khan
Shrinidhi K. Lakshmikanth
Ehsan Adeli
LM&RoVGen
4
0
0
16 Dec 2025
Elastic3D: Controllable Stereo Video Conversion with Guided Latent Decoding
Elastic3D: Controllable Stereo Video Conversion with Guided Latent Decoding
Nando Metzger
Prune Truong
Goutam Bhat
Konrad Schindler
Federico Tombari
VGen
0
0
0
16 Dec 2025
Recurrent Video Masked Autoencoders
Recurrent Video Masked Autoencoders
Daniel Zoran
Nikhil Parthasarathy
Yi Yang
Drew A Hudson
Joao Carreira
Andrew Zisserman
VGen
0
0
0
15 Dec 2025
Beyond the Visible: Disocclusion-Aware Editing via Proxy Dynamic Graphs
Beyond the Visible: Disocclusion-Aware Editing via Proxy Dynamic Graphs
Anran Qi
Changjian Li
Adrien Bousseau
Niloy J.Mitra
DiffMVGen
0
0
0
15 Dec 2025
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
Xiaohu Huang
Hao Zhou
Qiangpeng Yang
Shilei Wen
Kai Han
VGen
12
0
0
15 Dec 2025
PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence
PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence
Ruiyan Wang
Teng Hu
Kaihui Huang
Zihan Su
Ran Yi
Lizhuang Ma
DiffMVGen
0
0
0
15 Dec 2025
Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model
Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model
Heyi Chen
Siyan Chen
Xin Chen
Yanfei Chen
Ying Chen
...
Xueqiong Qu
Yuxi Ren
Kai Shen
Guang Shi
Lei Shi
VGen
32
0
0
15 Dec 2025
KlingAvatar 2.0 Technical Report
KlingAvatar 2.0 Technical Report
Kling Team
Jialu Chen
Yikang Ding
Zhixue Fang
Kun Gai
...
Chao Wang
Xuebo Wang
Haoxian Zhang
Yuanxing Zhang
Yan Zhou
VGen
8
0
0
15 Dec 2025
LongVie 2: Multimodal Controllable Ultra-Long Video World Model
LongVie 2: Multimodal Controllable Ultra-Long Video World Model
Jianxiong Gao
Zhaoxi Chen
Xian Liu
Junhao Zhuang
Chengming Xu
Jianfeng Feng
Yu Qiao
Yanwei Fu
Chenyang Si
Ziwei Liu
VGenSyDaVLM
20
0
0
15 Dec 2025
Content Adaptive based Motion Alignment Framework for Learned Video Compression
Content Adaptive based Motion Alignment Framework for Learned Video Compression
Tiange Zhang
Xiandong Meng
Siwei Ma
VGen
0
0
0
15 Dec 2025
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?
Jiaqi Wang
Weijia Wu
Yi Zhan
Rui Zhao
Ming Hu
James Cheng
Wei Liu
Philip Torr
Kevin Qinghong Lin
VGen
4
0
0
15 Dec 2025
World Models Can Leverage Human Videos for Dexterous Manipulation
World Models Can Leverage Human Videos for Dexterous Manipulation
Raktim Gautam Goswami
Amir Bar
David Fan
Tsung-Yen Yang
Gaoyue Zhou
Prashanth Krishnamurthy
Michael Rabbat
Farshad Khorrami
Yann LeCun
VGen
16
0
0
15 Dec 2025
DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders
DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders
Susung Hong
Chongjian Ge
Zhifei Zhang
Jui-Hsien Wang
DiffMVGen
16
0
0
15 Dec 2025
Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models
Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models
Shweta Mahajan
Shreya Kadambi
Hoang Le
Munawar Hayat
Fatih Porikli
VGen
8
0
0
15 Dec 2025
STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
Foivos Paraperas Papantoniou
Stathis Galanakis
Rolandos Alexandros Potamias
Bernhard Kainz
Stefanos Zafeiriou
DiffMVGen
0
0
0
15 Dec 2025
UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era
UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era
Ziqiang Zhu
Bowei Yang
VGen
0
0
0
15 Dec 2025
SneakPeek: Future-Guided Instructional Streaming Video Generation
SneakPeek: Future-Guided Instructional Streaming Video Generation
Cheeun Hong
German Barquero
Fadime Sener
Markos Georgopoulos
Edgar Schönfeld
Stefan Popov
Yuming Du
Oscar Mañas
Albert Pumarola
DiffMVGenAI4TS
16
0
0
15 Dec 2025
Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
Jiangning Zhang
Junwei Zhu
Zhenye Gan
Donghao Luo
Chuming Lin
...
Xu Chen
Chencan Fu
Keke He
Xiaobin Hu
Chengjie Wang
VGen
12
0
0
15 Dec 2025
GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation
GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation
Zhenya Yang
Zhe Liu
Yuxiang Lu
Liping Hou
Chenxuan Miao
Siyi Peng
Bailan Feng
Xiang Bai
Hengshuang Zhao
DiffMVGen
0
0
0
14 Dec 2025
FysicsWorld: A Unified Full-Modality Benchmark for Any-to-Any Understanding, Generation, and Reasoning
FysicsWorld: A Unified Full-Modality Benchmark for Any-to-Any Understanding, Generation, and Reasoning
Yue Jiang
Dingkang Yang
Minghao Han
Jinghang Han
Zizhi Chen
Yizhou Liu
Mingcheng Li
Peng Zhai
Lihua Zhang
VGenVLM
4
0
0
14 Dec 2025
Robust Motion Generation using Part-level Reliable Data from Videos
Robust Motion Generation using Part-level Reliable Data from Videos
Boyuan Li
Sipeng Zheng
Bin Cao
Ruihua Song
Zongqing Lu
VGen
0
0
0
14 Dec 2025
Animus3D: Text-driven 3D Animation via Motion Score Distillation
Animus3D: Text-driven 3D Animation via Motion Score Distillation
Qi Sun
Can Wang
Jiaxiang Shang
Wensen Feng
Jing Liao
VGen
0
0
0
14 Dec 2025
Generative Spatiotemporal Data Augmentation
Generative Spatiotemporal Data Augmentation
Jinfan Zhou
Lixin Luo
Sungmin Eum
Heesung Kwon
Jeong Joon Park
VGen
0
0
0
14 Dec 2025
Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal
Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal
Weihan Xu
Kan Jen Cheng
Koichi Saito
Muhammad Jehanzeb Mirza
Tingle Li
...
Masato Ishii
Takashi Shibuya
Yuki Mitsufuji
Gopala Anumanchipalli
Paul Pu Liang
DiffMVGen
0
0
0
14 Dec 2025
V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping
V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping
Hyunkoo Lee
Wooseok Jang
Jini Yang
Taehwan Kim
Sangoh Kim
Sangwon Jung
Seungryong Kim
DiffMVGen
0
0
0
13 Dec 2025
STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative
STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative
Peixuan Zhang
Zijian Jia
Kaiqi Liu
Shuchen Weng
Si Li
Boxin Shi
VGen
0
0
0
13 Dec 2025
ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States
ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States
Haowen Wang
Xiaoping Yuan
Fugang Zhang
Rui Jian
Yuanwei Zhu
Xiuquan Qiao
Yakun Huang
VGen
0
0
0
13 Dec 2025
AutoMV: An Automatic Multi-Agent System for Music Video Generation
AutoMV: An Automatic Multi-Agent System for Music Video Generation
Xiaoxuan Tang
Xinping Lei
Chaoran Zhu
Shiyun Chen
Ruibin Yuan
...
Wenhao Huang
Emmanouil Benetos
Yang Liu
Jiaheng Liu
Yinghao Ma
VGen
0
0
0
13 Dec 2025
SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
Xuancheng Xu
Yaning Li
Sisi You
Bing-Kun Bao
DiffMVGen
0
0
0
13 Dec 2025
CineLOG: A Training Free Approach for Cinematic Long Video Generation
CineLOG: A Training Free Approach for Cinematic Long Video Generation
Zahra Dehghanian
Morteza Abolghasemi
Hamid Beigy
Hamid R. Rabiee
VGen
0
0
0
13 Dec 2025
Endless World: Real-Time 3D-Aware Long Video Generation
Endless World: Real-Time 3D-Aware Long Video Generation
Ke Zhang
Yiqun Mei
Jiacong Xu
Vishal M. Patel
VGen
0
0
0
13 Dec 2025
Audio-Visual Camera Pose Estimation with Passive Scene Sounds and In-the-Wild Video
Audio-Visual Camera Pose Estimation with Passive Scene Sounds and In-the-Wild Video
Daniel Adebi
Sagnik Majumder
Kristen Grauman
VGen
88
0
0
13 Dec 2025
AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path
AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path
Zhengyang Yu
Akio Hayakawa
Masato Ishii
Qingtao Yu
Takashi Shibuya
Jing Zhang
Yuki Mitsufuji
DiffMVGen
4
0
0
12 Dec 2025
FactorPortrait: Controllable Portrait Animation via Disentangled Expression, Pose, and Viewpoint
FactorPortrait: Controllable Portrait Animation via Disentangled Expression, Pose, and Viewpoint
Jiapeng Tang
Kai Li
Chengxiang Yin
Liuhao Ge
Fei Jiang
...
Matthias Nießner
Christian Häne
Timur Bagautdinov
Egor Zakharov
Peihong Guo
DiffMVGenCoGe
4
0
0
12 Dec 2025
Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation
Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation
Yang Fei
George Stoica
Jingyuan Liu
Qifeng Chen
Ranjay Krishna
Xiaojuan Wang
Benlin Liu
DiffMVGen
9
0
0
12 Dec 2025
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties
Ye Fang
Tong Wu
Valentin Deschaintre
Duygu Ceylan
Iliyan Georgiev
Chun-Hao Paul Huang
Yiwei Hu
Xuelin Chen
Tuanfeng Yang Wang
VGen
1
0
0
12 Dec 2025
Referring Change Detection in Remote Sensing Imagery
Referring Change Detection in Remote Sensing Imagery
Yilmaz Korkmaz
Jay N. Paranjape
Celso M. de Melo
Vishal M. Patel
ObjDVGen
24
0
0
12 Dec 2025
JoyAvatar: Real-time and Infinite Audio-Driven Avatar Generation with Autoregressive Diffusion
JoyAvatar: Real-time and Infinite Audio-Driven Avatar Generation with Autoregressive Diffusion
Chaochao Li
Ruikui Wang
Liangbo Zhou
Jinheng Feng
Huaishao Luo
Huan Zhang
Youzheng Wu
Xiaodong He
DiffMVGen
8
0
0
12 Dec 2025
Flowception: Temporally Expansive Flow Matching for Video Generation
Flowception: Temporally Expansive Flow Matching for Video Generation
Tariq Berrada Ifriqi
John Nguyen
Karteek Alahari
Jakob Verbeek
Ricky T. Q. Chen
VGen
4
0
0
12 Dec 2025
FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model
FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model
Hongbin Lin
Yiming Yang
Yifan Zhang
Chaoda Zheng
Jie Feng
...
Boyang Wang
Yu Zhang
Xianming Liu
Shuguang Cui
Zhen Li
VGenLRM
4
0
0
12 Dec 2025
BAgger: Backwards Aggregation for Mitigating Drift in Autoregressive Video Diffusion Models
BAgger: Backwards Aggregation for Mitigating Drift in Autoregressive Video Diffusion Models
Ryan Po
Eric Ryan Chan
Changan Chen
Gordon Wetzstein
DiffMVGen
0
0
0
12 Dec 2025
Loading #Papers per Month with "VGen"
Past speakers
Name (-)
Top Contributors
Name (-)
Top Organizations at ResearchTrend.AI
Name (-)
Social Events
DateLocationEvent
No social events available