Neighbor communities
0 / 0 papers shown
Title |
|---|
Top Contributors
| Name | # Papers | # Citations |
|---|---|---|
Social Events
| Date | Location | Event |
|---|---|---|
Title |
|---|
| Name | # Papers | # Citations |
|---|---|---|
| Date | Location | Event |
|---|---|---|
Innovative methods and technologies for generating high-quality video content using AI and machine learning techniques.
Title |
|---|
Title | |||
|---|---|---|---|
![]() 3DProxyImg: Controllable 3D-Aware Animation Synthesis from Single Image via 2D-3D Aligned Proxy Embedding Yupeng Zhu Xiongzhen Zhang Ye Chen Bingbing Ni | |||
End-to-End Training for Autoregressive Video Diffusion via Self-Resampling Yuwei Guo Ceyuan Yang Hao He Yang Zhao Meng Wei Zhenheng Yang Weilin Huang Dahua Lin | |||
DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations Yuxiang Shi Zhe Li Yanwen Wang Hao Zhu Xun Cao Ligang Liu | |||
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models Bozhou Li Sihan Yang Yushuo Guan Ruichuan An Xinlong Chen Yang Shi Pengfei Wan Wentao Zhang Yuanxing zhang | |||
IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning Yuanhang Li Yiren Song Junzhe Bai Xinran Liang Hu Yang Libiao Jin Qi Mao | |||
Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning Yifei Li Wenzhao Zheng Yanran Zhang Runze Sun Yu Zheng Lei Chen Jie Zhou Jiwen Lu | |||
![]() Elastic3D: Controllable Stereo Video Conversion with Guided Latent Decoding Nando Metzger Prune Truong Goutam Bhat Konrad Schindler Federico Tombari | |||
![]() TalkVerse: Democratizing Minute-Long Audio-Driven Video Generation Zhenzhi Wang Jian Wang Ke Ma Dahua Lin Bing Zhou | |||
![]() End-to-End Learning-based Video Streaming Enhancement Pipeline: A Generative AI ApproachInternational Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV), 2025 Emanuele Artioli Farzad Tashtarian Christian Timmerer | |||
![]() ViBES: A Conversational Agent with Behaviorally-Intelligent 3D Virtual Body Juze Zhang Changan Chen Xin Chen Heng Yu Tiange Xiang Ali Sartaz Khan Shrinidhi K. Lakshmikanth Ehsan Adeli | |||
![]() MMGR: Multi-Modal Generative Reasoning Zefan Cai Haoyi Qiu Tianyi Ma Haozhe Zhao Gengze Zhou ...Minjia Zhang Wen Xiao Jiuxiang Gu Nanyun Peng Junjie Hu | |||
![]() S2D: Sparse-To-Dense Keymask Distillation for Unsupervised Video Instance Segmentation Leon Sick Lukas Hoyer Dominik Engel Pedro Hermosilla Timo Ropinski | |||
![]() DRAW2ACT: Turning Depth-Encoded Trajectories into Robotic Demonstration Videos Yang Bai Liudi Yang George Eskandar Fengyi Shen Mohammad Altillawi Ziyuan Liu Gitta Kutyniok | |||
![]() WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Wenqiang Sun Haiyu Zhang Haoyuan Wang Junta Wu Zehan Wang Zhenwei Wang Yunhong Wang Jun Zhang Tengfei Wang Chunchao Guo | |||
![]() AnimaMimic: Imitating 3D Animation from Video Priors Tianyi Xie Yunuo Chen Yaowei Guo Yin Yang Bolei Zhou Demetri Terzopoulos Ying Jiang Chenfanfu Jiang | |||
![]() Distill Video Datasets into Images Zhenghao Zhao Haoxuan Wang Kai Wang Yuzhang Shang Yuan Hong Yan Yan | |||
![]() MobileWorldBench: Towards Semantic World Modeling For Mobile Agents Shufan Li Konstantinos Kallidromitis Akash Gokul Yusuke Kato Kazuki Kozuka Aditya Grover | |||
![]() SneakPeek: Future-Guided Instructional Streaming Video Generation Cheeun Hong German Barquero Fadime Sener Markos Georgopoulos Edgar Schönfeld Stefan Popov Yuming Du Oscar Mañas Albert Pumarola | |||
![]() Content Adaptive based Motion Alignment Framework for Learned Video Compression Tiange Zhang Xiandong Meng Siwei Ma | |||
![]() Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models Shweta Mahajan Shreya Kadambi Hoang Le Munawar Hayat Fatih Porikli | |||
![]() Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation Jiangning Zhang Junwei Zhu Zhenye Gan Donghao Luo Chuming Lin ...Xu Chen Chencan Fu Keke He Xiaobin Hu Chengjie Wang | |||
![]() STARCaster: Spatio-Temporal AutoRegressive Video Diffusion for Identity- and View-Aware Talking Portraits Foivos Paraperas Papantoniou Stathis Galanakis Rolandos Alexandros Potamias Bernhard Kainz Stefanos Zafeiriou | |||
![]() LongVie 2: Multimodal Controllable Ultra-Long Video World Model Jianxiong Gao Zhaoxi Chen Xian Liu Junhao Zhuang Chengming Xu Jianfeng Feng Yu Qiao Yanwei Fu Chenyang Si Ziwei Liu | |||
![]() World Models Can Leverage Human Videos for Dexterous Manipulation Raktim Gautam Goswami Amir Bar David Fan Tsung-Yen Yang Gaoyue Zhou Prashanth Krishnamurthy Michael Rabbat Farshad Khorrami Yann LeCun | |||
![]() UniVCD: A New Method for Unsupervised Change Detection in the Open-Vocabulary Era Ziqiang Zhu Bowei Yang | |||
![]() KlingAvatar 2.0 Technical Report Kling Team Jialu Chen Yikang Ding Zhixue Fang Kun Gai ...Chao Wang Xuebo Wang Haoxian Zhang Yuanxing Zhang Yan Zhou | |||
![]() JoVA: Unified Multimodal Learning for Joint Video-Audio Generation Xiaohu Huang Hao Zhou Qiangpeng Yang Shilei Wen Kai Han | |||
![]() PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence Ruiyan Wang Teng Hu Kaihui Huang Zihan Su Ran Yi Lizhuang Ma | |||
![]() Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Jiaqi Wang Weijia Wu Yi Zhan Rui Zhao Ming Hu James Cheng Wei Liu Philip Torr Kevin Qinghong Lin | |||
![]() Recurrent Video Masked Autoencoders Daniel Zoran Nikhil Parthasarathy Yi Yang Drew A Hudson Joao Carreira Andrew Zisserman | |||
![]() DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders Susung Hong Chongjian Ge Zhifei Zhang Jui-Hsien Wang | |||
![]() Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model Heyi Chen Siyan Chen Xin Chen Yanfei Chen Ying Chen ...Xueqiong Qu Yuxi Ren Kai Shen Guang Shi Lei Shi | |||
![]() Beyond the Visible: Disocclusion-Aware Editing via Proxy Dynamic Graphs Anran Qi Changjian Li Adrien Bousseau Niloy J.Mitra | |||
![]() Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal Weihan Xu Kan Jen Cheng Koichi Saito Muhammad Jehanzeb Mirza Tingle Li ...Masato Ishii Takashi Shibuya Yuki Mitsufuji Gopala Anumanchipalli Paul Pu Liang | |||
![]() Animus3D: Text-driven 3D Animation via Motion Score Distillation Qi Sun Can Wang Jiaxiang Shang Wensen Feng Jing Liao | |||
![]() GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation Zhenya Yang Zhe Liu Yuxiang Lu Liping Hou Chenxuan Miao Siyi Peng Bailan Feng Xiang Bai Hengshuang Zhao | |||
![]() FysicsWorld: A Unified Full-Modality Benchmark for Any-to-Any Understanding, Generation, and Reasoning Yue Jiang Dingkang Yang Minghao Han Jinghang Han Zizhi Chen Yizhou Liu Mingcheng Li Peng Zhai Lihua Zhang | |||
![]() Robust Motion Generation using Part-level Reliable Data from Videos Boyuan Li Sipeng Zheng Bin Cao Ruihua Song Zongqing Lu | |||
![]() Generative Spatiotemporal Data Augmentation Jinfan Zhou Lixin Luo Sungmin Eum Heesung Kwon Jeong Joon Park | |||
![]() Endless World: Real-Time 3D-Aware Long Video Generation Ke Zhang Yiqun Mei Jiacong Xu Vishal M. Patel | |||
![]() Audio-Visual Camera Pose Estimation with Passive Scene Sounds and In-the-Wild Video Daniel Adebi Sagnik Majumder Kristen Grauman | |||
![]() V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping Hyunkoo Lee Wooseok Jang Jini Yang Taehwan Kim Sangoh Kim Sangwon Jung Seungryong Kim | |||
![]() SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation Xuancheng Xu Yaning Li Sisi You Bing-Kun Bao | |||
![]() STAGE: Storyboard-Anchored Generation for Cinematic Multi-shot Narrative Peixuan Zhang Zijian Jia Kaiqi Liu Shuchen Weng Si Li Boxin Shi | |||
![]() CineLOG: A Training Free Approach for Cinematic Long Video Generation Zahra Dehghanian Morteza Abolghasemi Hamid Beigy Hamid R. Rabiee | |||
![]() AutoMV: An Automatic Multi-Agent System for Music Video Generation Xiaoxuan Tang Xinping Lei Chaoran Zhu Shiyun Chen Ruibin Yuan ...Wenhao Huang Emmanouil Benetos Yang Liu Jiaheng Liu Yinghao Ma | |||
![]() ArtGen: Conditional Generative Modeling of Articulated Objects in Arbitrary Part-Level States Haowen Wang Xiaoping Yuan Fugang Zhang Rui Jian Yuanwei Zhu Xiuquan Qiao Yakun Huang | |||
![]() Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation Yang Fei George Stoica Jingyuan Liu Qifeng Chen Ranjay Krishna Xiaojuan Wang Benlin Liu | |||
![]() Reframing Music-Driven 2D Dance Pose Generation as Multi-Channel Image Generation Yan Zhang Han Zou Lincong Feng Cong Xie Ruiqi Yu Zhenpeng Zhan | |||
| Name (-) |
|---|
| Name (-) |
|---|
| Name (-) |
|---|
| Date | Location | Event | |
|---|---|---|---|
| No social events available | |||