StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification

11 November 2024

Papers citing "StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification"

2 / 2 papers shown

Title
Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark Enxin Song Wenhao Chai Weili Xu Jianwen Xie Yuxuan Liu Gaoang Wang 62 0 0 20 Apr 2025
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation Xinlong Chen Yang Zhang Chongling Rao Yushuo Guan Jiaheng Liu Fuzheng Zhang Chengru Song Qiang Liu Di Zhang Tieniu Tan 15 0 0 18 Feb 2025