ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Communities
  3. ...

Neighbor communities

0 / 0 papers shown
Title
Top Contributors
Name# Papers# Citations
Social Events
DateLocationEvent
  1. Home
  2. Communities
  3. VGen

Video Generation

VGen
More data

Innovative methods and technologies for generating high-quality video content using AI and machine learning techniques.

Neighbor communities

51015

Featured Papers

0 / 0 papers shown
Title

All papers

50 / 4,191 papers shown
Title
MMGR: Multi-Modal Generative Reasoning
MMGR: Multi-Modal Generative Reasoning
Zefan Cai
Haoyi Qiu
Tianyi Ma
Haozhe Zhao
Gengze Zhou
...
Minjia Zhang
Xiao Wen
Jiuxiang Gu
Nanyun Peng
Junjie Hu
EGVMVGenLRM
105
0
0
16 Dec 2025
End-to-End Learning-based Video Streaming Enhancement Pipeline: A Generative AI Approach
End-to-End Learning-based Video Streaming Enhancement Pipeline: A Generative AI ApproachInternational Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV), 2025
Emanuele Artioli
Farzad Tashtarian
Christian Timmerer
VGen
4
0
0
16 Dec 2025
ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body
Juze Zhang
Changan Chen
Xin Chen
Heng Yu
Tiange Xiang
Ali Sartaz Khan
Shrinidhi K. Lakshmikanth
Ehsan Adeli
LM&RoVGen
12
0
0
16 Dec 2025
MobileWorldBench: Towards Semantic World Modeling For Mobile Agents
MobileWorldBench: Towards Semantic World Modeling For Mobile Agents
Shufan Li
Konstantinos Kallidromitis
Akash Gokul
Yusuke Kato
Kazuki Kozuka
Aditya Grover
VGen
36
0
0
16 Dec 2025
S2D: Sparse-To-Dense Keymask Distillation for Unsupervised Video Instance Segmentation
S2D: Sparse-To-Dense Keymask Distillation for Unsupervised Video Instance Segmentation
Leon Sick
Lukas Hoyer
Dominik Engel
Pedro Hermosilla
Timo Ropinski
VGen
8
0
0
16 Dec 2025
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling
Wenqiang Sun
Haiyu Zhang
Haoyuan Wang
Junta Wu
Zehan Wang
Zhenwei Wang
Yunhong Wang
Jun Zhang
Tengfei Wang
Chunchao Guo
VGen
0
0
0
16 Dec 2025
Elastic3D: Controllable Stereo Video Conversion with Guided Latent Decoding
Elastic3D: Controllable Stereo Video Conversion with Guided Latent Decoding
Nando Metzger
Prune Truong
Goutam Bhat
Konrad Schindler
Federico Tombari
VGen
0
0
0
16 Dec 2025
DRAW2ACT: Turning Depth-Encoded Trajectories into Robotic Demonstration Videos
DRAW2ACT: Turning Depth-Encoded Trajectories into Robotic Demonstration Videos
Yang Bai
Liudi Yang
George Eskandar
Fengyi Shen
Mohammad Altillawi
Ziyuan Liu
Gitta Kutyniok
VGen
8
0
0
16 Dec 2025
AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation
AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation
Sisi Dai
Kai Xu
DiffMVGen
0
0
0
16 Dec 2025
AnimaMimic: Imitating 3D Animation from Video Priors
AnimaMimic: Imitating 3D Animation from Video Priors
Tianyi Xie
Yunuo Chen
Yaowei Guo
Yin Yang
Bolei Zhou
Demetri Terzopoulos
Ying Jiang
Chenfanfu Jiang
VGen
4
0
0
16 Dec 2025
Distill Video Datasets into Images
Distill Video Datasets into Images
Zhenghao Zhao
Haoxuan Wang
Kai Wang
Yuzhang Shang
Yuan Hong
Yan Yan
DDVGen
95
0
0
16 Dec 2025
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?
Jiaqi Wang
Weijia Wu
Yi Zhan
Rui Zhao
Ming Hu
James Cheng
Wei Liu
Philip Torr
Kevin Qinghong Lin
VGen
4
0
0
15 Dec 2025
Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model
Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model
Heyi Chen
Siyan Chen
Xin Chen
Yanfei Chen
Ying Chen
...
Xueqiong Qu
Yuxi Ren
Kai Shen
Guang Shi
Lei Shi
VGen
89
0
0
15 Dec 2025
SneakPeek: Future-Guided Instructional Streaming Video Generation
SneakPeek: Future-Guided Instructional Streaming Video Generation
Cheeun Hong
German Barquero
Fadime Sener
Markos Georgopoulos
Edgar Schönfeld
Stefan Popov
Yuming Du
Oscar Mañas
Albert Pumarola
DiffMVGenAI4TS
20
0
0
15 Dec 2025
LongVie 2: Multimodal Controllable Ultra-Long Video World Model
LongVie 2: Multimodal Controllable Ultra-Long Video World Model
Jianxiong Gao
Zhaoxi Chen
Xian Liu
Junhao Zhuang
Chengming Xu
Jianfeng Feng
Yu Qiao
Yanwei Fu
Chenyang Si
Ziwei Liu
VGenSyDaVLM
28
0
0
15 Dec 2025
Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
Jiangning Zhang
Junwei Zhu
Zhenye Gan
Donghao Luo
Chuming Lin
...
Xu Chen
Chencan Fu
Keke He
Xiaobin Hu
Chengjie Wang
VGen
12
0
0
15 Dec 2025
UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era
UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era
Ziqiang Zhu
Bowei Yang
VGen
0
0
0
15 Dec 2025
Content Adaptive based Motion Alignment Framework for Learned Video Compression
Content Adaptive based Motion Alignment Framework for Learned Video Compression
Tiange Zhang
Xiandong Meng
Siwei Ma
VGen
0
0
0
15 Dec 2025
Recurrent Video Masked Autoencoders
Recurrent Video Masked Autoencoders
Daniel Zoran
Nikhil Parthasarathy
Yi Yang
Drew A Hudson
Joao Carreira
Andrew Zisserman
VGen
0
0
0
15 Dec 2025
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
Xiaohu Huang
Hao Zhou
Qiangpeng Yang
Shilei Wen
Kai Han
VGen
12
0
0
15 Dec 2025
World Models Can Leverage Human Videos for Dexterous Manipulation
World Models Can Leverage Human Videos for Dexterous Manipulation
Raktim Gautam Goswami
Amir Bar
David Fan
Tsung-Yen Yang
Gaoyue Zhou
Prashanth Krishnamurthy
Michael Rabbat
Farshad Khorrami
Yann LeCun
VGen
16
0
0
15 Dec 2025
KlingAvatar 2.0 Technical Report
KlingAvatar 2.0 Technical Report
Kling Team
Jialu Chen
Yikang Ding
Zhixue Fang
Kun Gai
...
Chao Wang
Xuebo Wang
Haoxian Zhang
Yuanxing Zhang
Yan Zhou
VGen
16
0
0
15 Dec 2025
DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders
DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders
Susung Hong
Chongjian Ge
Zhifei Zhang
Jui-Hsien Wang
DiffMVGen
16
0
0
15 Dec 2025
Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models
Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models
Shweta Mahajan
Shreya Kadambi
Hoang Le
Munawar Hayat
Fatih Porikli
VGen
8
0
0
15 Dec 2025
PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence
PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence
Ruiyan Wang
Teng Hu
Kaihui Huang
Zihan Su
Ran Yi
Lizhuang Ma
DiffMVGen
0
0
0
15 Dec 2025
STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits
Foivos Paraperas Papantoniou
Stathis Galanakis
Rolandos Alexandros Potamias
Bernhard Kainz
Stefanos Zafeiriou
DiffMVGen
0
0
0
15 Dec 2025
Beyond the Visible: Disocclusion-Aware Editing via Proxy Dynamic Graphs
Beyond the Visible: Disocclusion-Aware Editing via Proxy Dynamic Graphs
Anran Qi
Changjian Li
Adrien Bousseau
Niloy J.Mitra
DiffMVGen
0
0
0
15 Dec 2025
Robust Motion Generation using Part-level Reliable Data from Videos
Robust Motion Generation using Part-level Reliable Data from Videos
Boyuan Li
Sipeng Zheng
Bin Cao
Ruihua Song
Zongqing Lu
VGen
0
0
0
14 Dec 2025
GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation
GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation
Zhenya Yang
Zhe Liu
Yuxiang Lu
Liping Hou
Chenxuan Miao
Siyi Peng
Bailan Feng
Xiang Bai
Hengshuang Zhao
DiffMVGen
0
0
0
14 Dec 2025
Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal
Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal
Weihan Xu
Kan Jen Cheng
Koichi Saito
Muhammad Jehanzeb Mirza
Tingle Li
...
Masato Ishii
Takashi Shibuya
Yuki Mitsufuji
Gopala Anumanchipalli
Paul Pu Liang
DiffMVGen
0
0
0
14 Dec 2025
FysicsWorld: A Unified Full-Modality Benchmark for Any-to-Any Understanding, Generation, and Reasoning
FysicsWorld: A Unified Full-Modality Benchmark for Any-to-Any Understanding, Generation, and Reasoning
Yue Jiang
Dingkang Yang
Minghao Han
Jinghang Han
Zizhi Chen
Yizhou Liu
Mingcheng Li
Peng Zhai
Lihua Zhang
VGenVLM
4
0
0
14 Dec 2025
Generative Spatiotemporal Data Augmentation
Generative Spatiotemporal Data Augmentation
Jinfan Zhou
Lixin Luo
Sungmin Eum
Heesung Kwon
Jeong Joon Park
VGen
0
0
0
14 Dec 2025
Animus3D: Text-driven 3D Animation via Motion Score Distillation
Animus3D: Text-driven 3D Animation via Motion Score Distillation
Qi Sun
Can Wang
Jiaxiang Shang
Wensen Feng
Jing Liao
VGen
0
0
0
14 Dec 2025
STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative
STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative
Peixuan Zhang
Zijian Jia
Kaiqi Liu
Shuchen Weng
Si Li
Boxin Shi
VGen
0
0
0
13 Dec 2025
ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States
ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States
Haowen Wang
Xiaoping Yuan
Fugang Zhang
Rui Jian
Yuanwei Zhu
Xiuquan Qiao
Yakun Huang
VGen
4
0
0
13 Dec 2025
Audio-Visual Camera Pose Estimation with Passive Scene Sounds and In-the-Wild Video
Audio-Visual Camera Pose Estimation with Passive Scene Sounds and In-the-Wild Video
Daniel Adebi
Sagnik Majumder
Kristen Grauman
VGen
92
0
0
13 Dec 2025
V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping
V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping
Hyunkoo Lee
Wooseok Jang
Jini Yang
Taehwan Kim
Sangoh Kim
Sangwon Jung
Seungryong Kim
DiffMVGen
0
0
0
13 Dec 2025
CineLOG: A Training Free Approach for Cinematic Long Video Generation
CineLOG: A Training Free Approach for Cinematic Long Video Generation
Zahra Dehghanian
Morteza Abolghasemi
Hamid Beigy
Hamid R. Rabiee
VGen
0
0
0
13 Dec 2025
SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
Xuancheng Xu
Yaning Li
Sisi You
Bing-Kun Bao
DiffMVGen
0
0
0
13 Dec 2025
Endless World: Real-Time 3D-Aware Long Video Generation
Endless World: Real-Time 3D-Aware Long Video Generation
Ke Zhang
Yiqun Mei
Jiacong Xu
Vishal M. Patel
VGen
0
0
0
13 Dec 2025
AutoMV: An Automatic Multi-Agent System for Music Video Generation
AutoMV: An Automatic Multi-Agent System for Music Video Generation
Xiaoxuan Tang
Xinping Lei
Chaoran Zhu
Shiyun Chen
Ruibin Yuan
...
Wenhao Huang
Emmanouil Benetos
Yang Liu
Jiaheng Liu
Yinghao Ma
VGen
0
0
0
13 Dec 2025
Kinetic Mining in Context: Few-Shot Action Synthesis via Text-to-Motion Distillation
Kinetic Mining in Context: Few-Shot Action Synthesis via Text-to-Motion Distillation
Luca Cazzola
Ahed Alboody
VGen
14
0
0
12 Dec 2025
Referring Change Detection in Remote Sensing Imagery
Referring Change Detection in Remote Sensing Imagery
Yilmaz Korkmaz
Jay N. Paranjape
Celso M. de Melo
Vishal M. Patel
ObjDVGen
40
0
0
12 Dec 2025
KeyframeFace: From Text to Expressive Facial Keyframes
KeyframeFace: From Text to Expressive Facial Keyframes
Jingchao Wu
Zejian Kang
Haibo Liu
Yuanchen Fei
Xiangru Huang
VGen
0
0
0
12 Dec 2025
SPDMark: Selective Parameter Displacement for Robust Video Watermarking
SPDMark: Selective Parameter Displacement for Robust Video Watermarking
Samar Fares
Nurbek Tastan
Karthik Nandakumar
VGen
0
0
0
12 Dec 2025
PersonaLive! Expressive Portrait Image Animation for Live Streaming
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Zhiyuan Li
Chi-Man Pun
Chen Fang
Jue Wang
Xiaodong Cun
VGen
28
0
0
12 Dec 2025
FilmWeaver: Weaving Consistent Multi-Shot Videos with Cache-Guided Autoregressive Diffusion
FilmWeaver: Weaving Consistent Multi-Shot Videos with Cache-Guided Autoregressive Diffusion
Xiangyang Luo
Qingyu Li
Xiaokun Liu
Wenyu Qin
Miao Yang
Meng Wang
Pengfei Wan
Di Zhang
Kun Gai
Shao-Lun Huang
DiffMVGen
20
0
0
12 Dec 2025
CreativeVR: Diffusion-Prior-Guided Approach for Structure and Motion Restoration in Generative and Real Videos
CreativeVR: Diffusion-Prior-Guided Approach for Structure and Motion Restoration in Generative and Real Videos
Tejas Panambur
Ishan Rajendrakumar Dave
Chongjian Ge
Ersin Yumer
Xue Bai
DiffMVGen
80
0
0
12 Dec 2025
AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis
AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis
Junjie Ye
Rong Xue
Basile Van Hoorick
Pavel Tokmakov
Muhammad Zubair Irshad
Yue Wang
Vitor Guizilini
VGen
24
0
0
12 Dec 2025
Reframing Music-Driven 2D Dance Pose Generation as Multi-Channel Image Generation
Reframing Music-Driven 2D Dance Pose Generation as Multi-Channel Image Generation
Yan Zhang
Han Zou
Lincong Feng
Cong Xie
Ruiqi Yu
Zhenpeng Zhan
DiffMVGen
20
0
0
12 Dec 2025
Loading #Papers per Month with "VGen"
Past speakers
Name (-)
Top Contributors
Name (-)
Top Organizations at ResearchTrend.AI
Name (-)
Social Events
DateLocationEvent
No social events available