Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.10465
Cited By
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding
14 April 2025
Tao Zhang
Xuelong Li
Zilong Huang
Yuchen Li
Weixian Lei
XueQing Deng
Shihao Chen
S. Ji
Jiashi Feng
MLLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding"
2 / 2 papers shown
Title
Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training
Yiran Chen
Hao Peng
Tong Zhang
Heng Ji
VLM
29
0
0
13 May 2025
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
Weixian Lei
Jiacong Wang
Haochen Wang
Xuelong Li
Jun Hao Liew
Jiashi Feng
Zilong Huang
32
2
0
14 Apr 2025
1