
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding
Papers citing "mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding"
26 / 26 papers shown
Title |
---|
![]() MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? Yi-Fan Zhang Huanyu Zhang Haochen Tian Chaoyou Fu Shuangqing Zhang ...Qingsong Wen Zhang Zhang Liwen Wang Rong Jin Tieniu Tan |
![]() Qwen Technical Report Jinze Bai Shuai Bai Yunfei Chu Zeyu Cui Kai Dang ...Zhenru Zhang Chang Zhou Jingren Zhou Xiaohuan Zhou Tianhang Zhu |