Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2409.17692
Cited By
MIO: A Foundation Model on Multimodal Tokens
26 September 2024
Zekun Wang
King Zhu
Chunpu Xu
Wangchunshu Zhou
Jiaheng Liu
Yibo Zhang
Jiashuo Wang
Ning Shi
Siyu Li
Yizhi Li
Haoran Que
Zhaoxiang Zhang
Yuanxing Zhang
Ge Zhang
Ke Xu
Jie Fu
Wenhao Huang
MLLM
AuLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MIO: A Foundation Model on Multimodal Tokens"
3 / 103 papers shown
Title
Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
Bryan A. Plummer
Liwei Wang
Christopher M. Cervantes
Juan C. Caicedo
Julia Hockenmaier
Svetlana Lazebnik
196
2,056
0
19 May 2015
CIDEr: Consensus-based Image Description Evaluation
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
286
4,484
0
20 Nov 2014
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
413
43,638
0
01 May 2014
Previous
1
2
3