Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.11875
Cited By
M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation
19 February 2024
Hongcheng Liu
Pingjie Wang
Yu Wang
Yanfeng Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation"
6 / 6 papers shown
Title
Question-Aware Gaussian Experts for Audio-Visual Question Answering
Hongyeob Kim
Inyoung Jung
Dayoon Suh
Youjia Zhang
Sangmin Lee
Sungeun Hong
61
0
0
06 Mar 2025
HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue
Sunjae Yoon
Dahyun Kim
Eunseop Yoon
Hee Suk Yoon
Junyeong Kim
C. Yoo
39
6
0
15 Dec 2023
MSG-BART: Multi-granularity Scene Graph-Enhanced Encoder-Decoder Language Model for Video-grounded Dialogue Generation
Hongcheng Liu
Zhe Chen
Hui Li
Pingjie Wang
Yanfeng Wang
Yu Wang
VGen
51
1
0
26 Sep 2023
Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation
Xueliang Zhao
Yuxuan Wang
Chongyang Tao
Chenshuo Wang
Dongyan Zhao
43
6
0
22 Oct 2022
ActionCLIP: A New Paradigm for Video Action Recognition
Mengmeng Wang
Jiazheng Xing
Yong Liu
VLM
152
362
0
17 Sep 2021
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues
Hung Le
Doyen Sahoo
Nancy F. Chen
Guosheng Lin
42
30
0
20 Oct 2020
1