ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.16855
  4. Cited By
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

22 December 2024
Xin Zhang
Yanzhao Zhang
Wen Xie
Mingxin Li
Ziqi Dai
Dingkun Long
Pengjun Xie
Meishan Zhang
Wenjie Li
Hao Fei
ArXivPDFHTML

Papers citing "GME: Improving Universal Multimodal Retrieval by Multimodal LLMs"

9 / 9 papers shown
Title
FinRAGBench-V: A Benchmark for Multimodal RAG with Visual Citation in the Financial Domain
FinRAGBench-V: A Benchmark for Multimodal RAG with Visual Citation in the Financial Domain
Suifeng Zhao
Zhuoran Jin
Sujian Li
Jun Gao
5
0
0
23 May 2025
MIRACL-VISION: A Large, multilingual, visual document retrieval benchmark
MIRACL-VISION: A Large, multilingual, visual document retrieval benchmark
Radek Osmulski
Gabriel de Souza P. Moreira
Ronay Ak
Mengyao Xu
Benedikt Schifferer
Even Oldridge
VLM
20
0
0
16 May 2025
Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality
Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality
Xueguang Ma
Luyu Gao
Shengyao Zhuang
Jiaqi Samantha Zhan
Jamie Callan
Jimmy Lin
281
1
0
05 May 2025
A Multi-Granularity Retrieval Framework for Visually-Rich Documents
A Multi-Granularity Retrieval Framework for Visually-Rich Documents
Mingjun Xu
Zehui Wang
Hengxing Cai
Renxin Zhong
3DV
53
0
0
01 May 2025
UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities
UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities
Woongyeong Yeo
Kangsan Kim
Soyeong Jeong
Jinheon Baek
Sung Ju Hwang
54
0
0
29 Apr 2025
The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Mingkai Tian
Guorong Li
Yuankai Qi
Amin Beheshti
Javen Qinfeng Shi
Anton van den Hengel
Qingming Huang
VGen
47
0
0
31 Mar 2025
Joint Fusion and Encoding: Advancing Multimodal Retrieval from the Ground Up
Joint Fusion and Encoding: Advancing Multimodal Retrieval from the Ground Up
Lang Huang
Qiyu Wu
Zhongtao Miao
T. Yamasaki
301
0
0
27 Feb 2025
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information Retrieval
Ze Liu
Junjie Zhou
Yueze Wang
Zheng Liu
Defu Lian
OffRL
253
0
0
17 Feb 2025
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
Mohammad Mahdi Abootorabi
Amirhosein Zobeiri
Mahdi Dehghani
Mohammadali Mohammadkhani
Bardia Mohammadi
Omid Ghahroodi
M. Baghshah
Ehsaneddin Asgari
RALM
111
5
0
12 Feb 2025
1