ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.23661
  4. Cited By
LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training
v1v2v3 (latest)

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

28 September 2025
Xiang An
Yin Xie
Kaicheng Yang
Wenkang Zhang
X. Zhao
Zheng Cheng
Y. Wang
Songcen Xu
Changrui Chen
Chunsheng Wu
Huajie Tan
Chunyuan Li
J. Yang
Jie Yu
Xiyao Wang
Bin Qin
Yumeng Wang
Zizhen Yan
Ziyong Feng
Ziwei Liu
Bo Li
Jiankang Deng
Jiankang Deng
    MLLMVLMSyDa
ArXiv (abs)PDFHTMLHuggingFace (35 upvotes)Github

Papers citing "LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training"

8 / 8 papers shown
Title
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following
Tianyi Xiong
Yi Ge
Ming Li
Zuolong Zhang
Pranav Kulkarni
...
Yanshuo Chen
X. Wang
Renrui Zhang
Wenhu Chen
Heng Huang
113
0
0
26 Nov 2025
Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
Yi Yang
X. Li
Yiyang Chen
Jin Song
Yihan Wang
Zipeng Xiao
Jiadi Su
You Qiaoben
Pengfei Liu
Zhijie Deng
VLM
145
0
0
20 Nov 2025
V-Thinker: Interactive Thinking with Images
V-Thinker: Interactive Thinking with Images
Runqi Qiao
Qiuna Tan
Minghan Yang
Guanting Dong
Peiqing Yang
...
Yida Xu
Lan Yang
Chong Sun
Chen Li
Honggang Zhang
MLLMLRM
301
1
0
06 Nov 2025
DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry
Changti Wu
Shijie Lian
Zihao Liu
Lei Zhang
Laurence Tianruo Yang
Kai Chen
AIMat
381
0
0
25 Oct 2025
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
Z. Chen
M. Zhang
Xinlei Yu
Xufang Luo
Mingze Sun
Zihao Pan
Yan Feng
Peng Pei
Xunliang Cai
Ruqi Huang
VGenLRM
96
7
0
21 Oct 2025
VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
Jiaying Zhu
Yurui Zhu
Xin Lu
Wenrui Yan
Dong Li
Kunlin Liu
Xueyang Fu
Zheng-Jun Zha
MQVLM
195
0
0
18 Oct 2025
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning
Tiancheng Gu
Kaicheng Yang
Kaichen Zhang
Xiang An
Ziyong Feng
Y. Zhang
Weidong Cai
Jiankang Deng
Lidong Bing
149
4
0
15 Oct 2025
A Survey on Agentic Multimodal Large Language Models
A Survey on Agentic Multimodal Large Language Models
Huanjin Yao
Ruifei Zhang
Jiaxing Huang
Jingyi Zhang
Yibo Wang
...
Ruolin Zhu
Yongcheng Jing
Shunyu Liu
Guanbin Li
Dacheng Tao
LM&RoAIFinAI4TSLRMAI4CE
201
4
0
13 Oct 2025
1