Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.14694
Cited By
HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding
12 March 2025
Rui Yang
Lin Song
Yicheng Xiao
Runhui Huang
Yixiao Ge
Ying Shan
Hengshuang Zhao
MLLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"HaploVL: A Single-Transformer Baseline for Multi-Modal Understanding"
4 / 4 papers shown
Title
LoRA-Gen: Specializing Large Language Model via Online LoRA Generation
Yicheng Xiao
Lin Song
Rui Yang
Cheng Cheng
Yixiao Ge
Xiu Li
Y. Shan
OffRL
14
0
0
13 Jun 2025
HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation
Yicheng Xiao
Lin Song
Rui Yang
Cheng Cheng
Zunnan Xu
Zhaoyang Zhang
Yixiao Ge
Xiu Li
Ying Shan
40
2
0
03 Jun 2025
SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning
Jiaqi Huang
Zunnan Xu
Jun Zhou
Ting Liu
Yicheng Xiao
Mingwen Ou
Bowen Ji
Xiu Li
Kehong Yuan
VLM
79
0
0
28 May 2025
Chameleon: Mixed-Modal Early-Fusion Foundation Models
Chameleon Team
MLLM
207
338
0
16 May 2024
1