ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Communities
  3. ...

Neighbor communities

0 / 0 papers shown
Title
Top Contributors
Name# Papers# Citations
Social Events
DateLocationEvent
  1. Home
  2. Communities
  3. VGen

Video Generation

VGen
More data

Innovative methods and technologies for generating high-quality video content using AI and machine learning techniques.

Neighbor communities

51015

Featured Papers

0 / 0 papers shown
Title

All papers

50 / 3,789 papers shown
Title
Emu3.5: Native Multimodal Models are World Learners
Emu3.5: Native Multimodal Models are World Learners
Yufeng Cui
Honghao Chen
Haoge Deng
Xu Huang
Xinghang Li
...
Zhuo Chen
Yulong Ao
Tiejun Huang
Zhongyuan Wang
Xinlong Wang
MLLMVGen
62
0
0
30 Oct 2025
SEE4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting
SEE4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting
Dongyue Lu
Ao Liang
Tianxin Huang
Xiao Fu
Yuyang Zhao
...
Liang Pan
Wei Yin
Lingdong Kong
Wei Tsang Ooi
Ziwei Liu
VGen
0
0
0
30 Oct 2025
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark
Ziyu Guo
Xinyan Chen
Renrui Zhang
Ruichuan An
Yu Qi
Dongzhi Jiang
Xiangtai Li
Manyuan Zhang
Hongsheng Li
Pheng-Ann Heng
VGenLRM
4
0
0
30 Oct 2025
LoCoT2V-Bench: A Benchmark for Long-Form and Complex Text-to-Video Generation
LoCoT2V-Bench: A Benchmark for Long-Form and Complex Text-to-Video Generation
Xiangqing Zheng
Chengyue Wu
Kehai Chen
Min Zhang
DiffMVGen
0
0
0
30 Oct 2025
Co-Evolving Latent Action World Models
Co-Evolving Latent Action World Models
Yucen Wang
Fengming Zhang
De-Chuan Zhan
Li Zhao
Kaixin Wang
Jiang Bian
VGen
0
0
0
30 Oct 2025
OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes
Yukun Huang
Jiwen Yu
Yanning Zhou
Jianan Wang
Xintao Wang
Pengfei Wan
Xihui Liu
VGen
0
0
0
30 Oct 2025
The Quest for Generalizable Motion Generation: Data, Model, and Evaluation
The Quest for Generalizable Motion Generation: Data, Model, and Evaluation
Jing Lin
Ruisi Wang
Junzhe Lu
Ziqi Huang
Guorui Song
...
Wanqi Yin
Qingping Sun
Zhongang Cai
Lei Yang
Ziwei Liu
DiffMVGen
0
0
0
30 Oct 2025
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
Baolu Li
Yiming Zhang
Qinghe Wang
Liqian Ma
Xiaoyu Shi
...
Pengfei Wan
Zhenfei Yin
Yunzhi Zhuge
Huchuan Lu
Xu Jia
VGen
0
0
0
29 Oct 2025
VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations
VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations
Qianqian Qiao
DanDan Zheng
Yihang Bo
Bao Peng
Heng Huang
Longteng Jiang
Huaye Wang
Jingdong Chen
Jun Zhou
Xin Jin
VGen
0
0
0
29 Oct 2025
StreamingCoT: A Dataset for Temporal Dynamics and Multimodal Chain-of-Thought Reasoning in Streaming VideoQA
StreamingCoT: A Dataset for Temporal Dynamics and Multimodal Chain-of-Thought Reasoning in Streaming VideoQA
Yuhang Hu
Zhenyu Yang
Shihan Wang
Shengsheng Qian
Bin Wen
Fan Yang
Tingting Gao
Changsheng Xu
VGenLRM
4
0
0
29 Oct 2025
Improving Temporal Consistency and Fidelity at Inference-time in Perceptual Video Restoration by Zero-shot Image-based Diffusion Models
Improving Temporal Consistency and Fidelity at Inference-time in Perceptual Video Restoration by Zero-shot Image-based Diffusion Models
Nasrin Rahimi
A. Murat Tekalp
DiffMVGen
0
0
0
29 Oct 2025
Uniform Discrete Diffusion with Metric Path for Video Generation
Uniform Discrete Diffusion with Metric Path for Video Generation
Haoge Deng
Ting Pan
Fan Zhang
Y. Liu
Zhuoyan Luo
...
Wenxuan Wang
Chunhua Shen
Shiguang Shan
Zhaoxiang Zhang
Xinlong Wang
VGen
0
0
0
28 Oct 2025
Generative View Stitching
Generative View Stitching
Chonghyuk Song
Michal Stary
Boyuan Chen
George Kopanas
Vincent Sitzmann
VGen
0
0
0
28 Oct 2025
VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos
VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos
Qiucheng Wu
Handong Zhao
Zhixin Shu
Jing Shi
Yang Zhang
Shiyu Chang
DiffMVGen
58
0
0
28 Oct 2025
Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation
Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation
Kang Zhang
T. Pham
Suyeon Lee
Axi Niu
Arda Senocak
Joon Son Chung
AuLLMVGen
0
0
0
28 Oct 2025
SAGE: Structure-Aware Generative Video Transitions between Diverse Clips
SAGE: Structure-Aware Generative Video Transitions between Diverse Clips
Mia Kan
Yilin Liu
Niloy J. Mitra
DiffMVGen
8
0
0
28 Oct 2025
CoMo: Compositional Motion Customization for Text-to-Video Generation
CoMo: Compositional Motion Customization for Text-to-Video Generation
Youcan Xu
Zhen Wang
Jiaxin Shi
Kexin Li
Feifei Shao
Jun Xiao
Yi Yang
Jun Yu
Long Chen
DiffMVGen
12
0
0
27 Oct 2025
Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling
Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling
Shuhong Zheng
Ashkan Mirzaei
Igor Gilitschenski
DiffMVGen
12
0
0
27 Oct 2025
Revising Second Order Terms in Deep Animation Video Coding
Revising Second Order Terms in Deep Animation Video Coding
Konstantin Schmidt
Thomas Richter
VGen
12
0
0
27 Oct 2025
FARMER: Flow AutoRegressive Transformer over Pixels
FARMER: Flow AutoRegressive Transformer over Pixels
Guangting Zheng
Qinyu Zhao
Tao Yang
Fei Xiao
Zhijie Lin
Jie Wu
Jiajun Deng
Y. Zhang
Rui Zhu
VGen
65
0
0
27 Oct 2025
Yesnt: Are Diffusion Relighting Models Ready for Capture Stage Compositing? A Hybrid Alternative to Bridge the Gap
Yesnt: Are Diffusion Relighting Models Ready for Capture Stage Compositing? A Hybrid Alternative to Bridge the Gap
Elisabeth Jüttner
Leona Krath
Stefan Korfhage
Hannah Dröge
M. Hullin
Markus Plack
VGen
12
0
0
27 Oct 2025
FAME: Fairness-aware Attention-modulated Video Editing
FAME: Fairness-aware Attention-modulated Video Editing
Zhangkai Wu
Xuhui Fan
Zhongyuan Xie
Kaize Shi
Zhidong Li
Longbing Cao
VGen
12
0
0
27 Oct 2025
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation
Junyoung Seo
Rodrigo Mira
A. Haliassos
Stella Bounareli
Honglie Chen
Linh Tran
Seungryong Kim
Zoe Landgraf
Jie Shen
VGen
12
0
0
27 Oct 2025
VALA: Learning Latent Anchors for Training-Free and Temporally Consistent
VALA: Learning Latent Anchors for Training-Free and Temporally Consistent
Zhangkai Wu
Xuhui Fan
Zhongyuan Xie
Kaize Shi
Longbing Cao
DiffMVGen
12
0
0
27 Oct 2025
Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views
Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views
Anna Deichler
Jonas Beskow
VGen
12
0
0
26 Oct 2025
MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control
MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control
Fatemeh Nazarieh
Zhenhua Feng
Diptesh Kanojia
Muhammad Awais
J. Kittler
DiffMVGen
12
0
0
26 Oct 2025
Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration
Hollywood Town: Long-Video Generation via Cross-Modal Multi-Agent Orchestration
Zheng Wei
Mingchen Li
Zeqian Zhang
Ruibin Yuan
Pan Hui
Huamin Qu
James Evans
Maneesh Agrawala
Anyi Rao
VGen
12
0
0
25 Oct 2025
LongCat-Video Technical Report
LongCat-Video Technical Report
M-A-P Team
Xunliang Cai
Qilong Huang
Zhuoliang Kang
Hongyu Li
...
Liya Ma
Siyu Ren
Xiaoming Wei
Rixu Xie
Tong Zhang
VGenVLM
12
0
0
25 Oct 2025
BachVid: Training-Free Video Generation with Consistent Background and Character
BachVid: Training-Free Video Generation with Consistent Background and Character
Han Yan
Xibin Song
Yifu Wang
Hongdong Li
Pan Ji
Chao Ma
DiffMVGen
12
0
0
24 Oct 2025
Epipolar Geometry Improves Video Generation Models
Epipolar Geometry Improves Video Generation Models
Orest Kupyn
Fabian Manhardt
F. Tombari
Christian Rupprecht
VGen
48
0
0
24 Oct 2025
VidSplice: Towards Coherent Video Inpainting via Explicit Spaced Frame Guidance
VidSplice: Towards Coherent Video Inpainting via Explicit Spaced Frame Guidance
Ming Xie
Junqiu Yu
Qiaole Dong
Xiangyang Xue
Yanwei Fu
VGen
12
0
0
24 Oct 2025
PhysWorld: From Real Videos to World Models of Deformable Objects via Physics-Aware Demonstration Synthesis
PhysWorld: From Real Videos to World Models of Deformable Objects via Physics-Aware Demonstration Synthesis
Yu Yang
Zhilu Zhang
Xiang Zhang
Yihan Zeng
Hui Li
W. Zuo
VGenSyDaPINN
50
0
0
24 Oct 2025
Foley Control: Aligning a Frozen Latent Text-to-Audio Model to Video
Foley Control: Aligning a Frozen Latent Text-to-Audio Model to Video
Ciara Rowles
Varun Jampani
Simon Donné
Shimon Vainer
Julian Parker
Zach Evans
VGen
12
0
0
24 Oct 2025
WorldGrow: Generating Infinite 3D World
WorldGrow: Generating Infinite 3D World
Sikuang Li
Chen-Ning Yang
Jiemin Fang
Taoran Yi
Jia Lu
Jiazhong Cen
Lingxi Xie
Wei Shen
Qi Tian
VGen
12
0
0
24 Oct 2025
Breakdance Video classification in the age of Generative AI
Breakdance Video classification in the age of Generative AI
Sauptik Dhar
Naveen Ramakrishnan
Michelle Munson
VGenVLM
12
0
0
23 Oct 2025
Evaluating Video Models as Simulators of Multi-Person Pedestrian Trajectories
Evaluating Video Models as Simulators of Multi-Person Pedestrian Trajectories
Aaron Appelle
Jerome P. Lynch
EGVMVGen
12
0
0
23 Oct 2025
RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling
RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling
Bingjie Gao
Qianli Ma
Xiaoxue Wu
Shuai Yang
Guanzhou Lan
...
Qingyang Liu
Yu Qiao
Xinyuan Chen
Y. Wang
Li Niu
VGen
12
0
0
23 Oct 2025
From Masks to Worlds: A Hitchhiker's Guide to World Models
From Masks to Worlds: A Hitchhiker's Guide to World Models
Jinbin Bai
Yu Lei
H. Wu
Yuchen Zhu
Shufan Li
Yi Xin
Xiangtai Li
Molei Tao
Aditya Grover
Ming-Hsuan Yang
VGenSyDa
20
0
0
23 Oct 2025
DMC$^3$: Dual-Modal Counterfactual Contrastive Construction for Egocentric Video Question Answering
DMC3^33: Dual-Modal Counterfactual Contrastive Construction for Egocentric Video Question Answering
Jiayi Zou
Chaofan CHEN
Bing-Kun Bao
Changsheng Xu
EgoVVGen
53
0
0
23 Oct 2025
Video-As-Prompt: Unified Semantic Control for Video Generation
Video-As-Prompt: Unified Semantic Control for Video Generation
Yuxuan Bian
Xin Chen
Zenan Li
Tiancheng Zhi
S. Sang
Linjie Luo
Qiang Xu
DiffMVGen
16
0
0
23 Oct 2025
FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation
FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation
Wenhao Wang
Kehe Ye
Xinyu Zhou
Tianxing Chen
Cao Min
...
Ping Luo
Yongjian Shen
Yang Yang
Maoqing Yao
Yao Mu
VGen
56
0
0
23 Oct 2025
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence
Jiahao Meng
X. Li
Haochen Wang
Yue Tan
Tao Zhang
...
Yunhai Tong
Anran Wang
Zhiyang Teng
Y. Wang
Z. Wang
VGenLRM
50
0
0
23 Oct 2025
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
Yihao Meng
Hao Ouyang
Yue Yu
Qiuyu Wang
Wen Wang
...
Yixuan Li
Cheng Chen
Yanhong Zeng
Yujun Shen
Huamin Qu
VGen
20
0
0
23 Oct 2025
Positional Encoding Field
Positional Encoding Field
Yunpeng Bai
Haoxiang Li
Qixing Huang
VGen
12
0
0
23 Oct 2025
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
Jiacheng Chen
Ziyu Jiang
Mingfu Liang
Bingbing Zhuang
Jong-Chyi Su
Sparsh Garg
Ying Wu
Manmohan Chandraker
VGen
20
0
0
23 Oct 2025
Semantic World Models
Semantic World Models
Jacob Berg
Chuning Zhu
Yanda Bao
Ishan Durugkar
Abhishek Gupta
LM&RoVGen
13
0
0
22 Oct 2025
Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks
Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks
Kai Zeng
Zhanqian Wu
Kaixin Xiong
Xiaobao Wei
Xiangyu Guo
...
Haiyang Sun
Bing Wang
Guang Chen
Hangjun Ye
W. Zhang
VGen
36
0
0
22 Oct 2025
OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation
OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation
Guowei Xu
Yuxuan Bian
Ailing Zeng
Mingyi Shi
Shaoli Huang
Wen Li
Lixin Duan
Qiang Xu
VGen
12
0
0
22 Oct 2025
Improving the Physics of Video Generation with VJEPA-2 Reward Signal
Improving the Physics of Video Generation with VJEPA-2 Reward Signal
Jianhao Yuan
Xiaofeng Zhang
Felix Friedrich
Nicolas Beltran-Velez
Melissa Hall
Reyhane Askari Hemmat
Xiaochuang Han
Nicolas Ballas
M. Drozdzal
Adriana Romero-Soriano
VGenEGVM
98
0
0
22 Oct 2025
From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction
From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction
Zhida Zhao
Talas Fu
Yifan Wang
Lijun Wang
Huchuan Lu
VGen
20
0
0
22 Oct 2025
Loading #Papers per Month with "VGen"
Past speakers
Name (-)
Top Contributors
Name (-)
Top Organizations at ResearchTrend.AI
Name (-)
Social Events
DateLocationEvent
No social events available