M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?

M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?

Papers citing "M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?"

Title
No papers