Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.07076
Cited By
StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification
11 November 2024
Yichen He
Yuan Lin
Jianchao Wu
Hanchong Zhang
Yuchen Zhang
Ruicheng Le
VGen
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification"
2 / 2 papers shown
Title
Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark
Enxin Song
Wenhao Chai
Weili Xu
Jianwen Xie
Yuxuan Liu
Gaoang Wang
62
0
0
20 Apr 2025
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation
Xinlong Chen
Yang Zhang
Chongling Rao
Yushuo Guan
Jiaheng Liu
Fuzheng Zhang
Chengru Song
Qiang Liu
Di Zhang
Tieniu Tan
15
0
0
18 Feb 2025
1