Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.10479
Cited By
v1
v2
v3 (latest)
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
14 April 2025
Jinguo Zhu
Weiyun Wang
Zhe Chen
Ziwei Liu
Shenglong Ye
Lixin Gu
Yuchen Duan
H. Tian
Weijie Su
Jie Shao
Zhangwei Gao
Erfei Cui
Yue Cao
Yangzhou Liu
Xingguang Wei
Hongjie Zhang
Haomin Wang
Wenyuan Xu
Hao Li
Jiahao Wang
Dengnian Chen
Songze Li
Yinan He
Tan Jiang
Jiapeng Luo
Yi Wang
Conghui He
Botian Shi
Xinsong Zhang
Wenqi Shao
Junjun He
Yingtong Xiong
Wenwen Qu
Peng Sun
Penglong Jiao
Han Lv
Lijun Wu
Kai Zhang
Huipeng Deng
Jiaye Ge
Kai Chen
Limin Wang
Min Dou
Lewei Lu
X. Zhu
Tong Lu
Dahua Lin
Yu Qiao
Jifeng Dai
Wenhai Wang
Wei Wang
MLLM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models"
11 / 161 papers shown
Title
Scene Text Visual Question Answering
Ali Furkan Biten
Rubèn Pérez Tito
Andrés Mafla
Lluís Gómez
Marçal Rusiñol
Ernest Valveny
C. V. Jawahar
Dimosthenis Karatzas
108
360
0
31 May 2019
HellaSwag: Can a Machine Really Finish Your Sentence?
Rowan Zellers
Ari Holtzman
Yonatan Bisk
Ali Farhadi
Yejin Choi
179
2,523
0
19 May 2019
Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension
Kai Sun
Dian Yu
Dong Yu
Claire Cardie
62
102
0
21 Apr 2019
Towards VQA Models That Can Read
Amanpreet Singh
Vivek Natarajan
Meet Shah
Yu Jiang
Xinlei Chen
Dhruv Batra
Devi Parikh
Marcus Rohrbach
EgoV
111
1,253
0
18 Apr 2019
DVQA: Understanding Data Visualizations via Question Answering
Kushal Kafle
Brian L. Price
Scott D. Cohen
Christopher Kanan
AIMat
82
396
0
24 Jan 2018
Simple and Effective Multi-Paragraph Reading Comprehension
Christopher Clark
Matt Gardner
RALM
95
459
0
29 Oct 2017
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
222
2,686
0
09 May 2017
RACE: Large-scale ReAding Comprehension Dataset From Examinations
Guokun Lai
Qizhe Xie
Hanxiao Liu
Yiming Yang
Eduard H. Hovy
ELM
191
1,357
0
15 Apr 2017
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
347
3,270
0
02 Dec 2016
A Diagram Is Worth A Dozen Images
Aniruddha Kembhavi
M. Salvato
Eric Kolve
Minjoon Seo
Hannaneh Hajishirzi
Ali Farhadi
3DV
81
505
0
24 Mar 2016
Generation and Comprehension of Unambiguous Object Descriptions
Junhua Mao
Jonathan Huang
Alexander Toshev
Oana-Maria Camburu
Alan Yuille
Kevin Patrick Murphy
ObjD
131
1,357
0
07 Nov 2015
Previous
1
2
3
4