Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.13468
Cited By
Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge
23 January 2025
Haomiao Xiong
Zhiyong Yang
Jiazuo Yu
Yunzhi Zhuge
Lu Zhang
Jiawen Zhu
Huchuan Lu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge"
8 / 8 papers shown
Title
FindingDory: A Benchmark to Evaluate Memory in Embodied Agents
Karmesh Yadav
Yusuf Ali
Gunshi Gupta
Y. Gal
Z. Kira
LM&Ro
54
0
0
18 Jun 2025
Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought
Chao Huang
Benfeng Wang
Jie Wen
Chengliang Liu
Wei Wang
Li Shen
Xiaochun Cao
LRM
73
0
0
26 May 2025
Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models
Keunwoo Peter Yu
Joyce Chai
MLLM
VLM
85
0
0
16 May 2025
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant
Haibo Wang
Bo Feng
Zhengfeng Lai
Mingze Xu
Shiyu Li
Weifeng Ge
Afshin Dehghan
Meng Cao
Ping Huang
OffRL
150
0
0
08 May 2025
RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video
Shuhang Xun
Sicheng Tao
Jiajun Li
Yibo Shi
Zhixin Lin
...
Shikang Wang
Yang Liu
Hao Zhang
Ying Ma
Xuming Hu
VLM
LRM
100
1
0
04 May 2025
Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook
Xu Zheng
Ziqiao Weng
Yuanhuiyi Lyu
Lutao Jiang
Haiwei Xue
Bin Ren
Danda Pani Paudel
N. Sebe
Luc Van Gool
Xuming Hu
3DV
143
10
0
23 Mar 2025
ViSpeak: Visual Instruction Feedback in Streaming Videos
Shenghao Fu
Q. Yang
Yuan-Ming Li
Yi-Xing Peng
Kun-Yu Lin
Xihan Wei
Jian-Fang Hu
Xiaohua Xie
Wei-Shi Zheng
VLM
145
1
0
17 Mar 2025
StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition
Xin Ding
Hao Wu
Yue Yang
Shiqi Jiang
Donglin Bai
Zhibo Chen
Ting Cao
403
1
0
08 Mar 2025
1