Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.15368
Cited By
Baichuan-Omni-1.5 Technical Report
28 January 2025
Yadong Li
Jiaheng Liu
Tao Zhang
Tao Zhang
Tian Jin
Tianpeng Li
Zehan Li
L. Liu
Lingfeng Ming
Guosheng Dong
Zhuoran Zhang
Chong Li
Yuanbo Fang
Dongdong Kuang
Hao Wu
C. Zhu
Yuhui Zhang
Hongyu Guo
Fengyu Zhang
Yansen Wang
Bowen Ding
Wei Song
Xuelong Li
Yuqi Huo
Zheng Liang
Da Pan
Shusen Zhang
Shuai Zhao
Linchu Xiong
Yongpeng Wu
Jiahui Ye
Wenhao Lu
Bowen Li
Yan Zhang
Yaqi Zhou
Xin Chen
Lei Su
Jun Wang
F. Chen
Xuezhen Dong
Na Nie
Zhikai Wu
Bin Xiao
Ting Li
Shunya Dang
Ping Zhang
Yizhou Sun
Jincheng Wu
Jinjie Yang
X. Lin
Zhi-Ming Ma
Kegeng Wu
Jia Li
Aiyuan Yang
Hui Liu
J. Zhang
Xiaoxi Chen
Guangwei Ai
Feiyu Xiong
Yushen Chen
Xiaoqin Huang
Kun Li
Wenjing Luo
Yifei Duan
Lingling Zhu
Ran Xiao
Zhe Su
Jiani Pu
Dian Wang
X. Jia
Tianze Zhang
Mengyu Ai
Mang Wang
Yujing Qiao
L. Zhang
Yanjun Shen
Fan Yang
Miao Zhen
Yijie Zhou
Mingyang Chen
Fei Li
Chenzheng Zhu
Keer Lu
Yaqi Zhao
Hao Liang
Heng Chang
Yanzhao Qin
Linzhuang Sun
Jianhua Xu
Haoze Sun
Mingan Lin
Zenan Zhou
Xin Wu
AuLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Baichuan-Omni-1.5 Technical Report"
10 / 10 papers shown
Title
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix
Ziyang Ma
Yinghao Ma
Yanqiao Zhu
Chen Yang
Yi-Wen Chao
...
Wei Xue
Emmanouil Benetos
Kai Yu
Eng Siong Chng
Xie Chen
AuLLM
LRM
12
0
0
19 May 2025
CorBenchX: Large-Scale Chest X-Ray Error Dataset and Vision-Language Model Benchmark for Report Error Correction
Jing Zou
Qingqiu Li
Chenyu Lian
Lihao Liu
Xiaohan Yan
Shujun Wang
Jing Qin
VLM
2
0
0
17 May 2025
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
Yi-Fan Zhang
Xingyu Lu
X. Hu
Chaoyou Fu
Bin Wen
...
Jianfei Chen
Fan Yang
Z. Zhang
Tingting Gao
Liang Wang
OffRL
LRM
48
0
0
05 May 2025
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention
Yucheng Li
Huiqiang Jiang
Chengruidong Zhang
Qianhui Wu
Xufang Luo
...
Amir H. Abdi
Dongsheng Li
Jianfeng Gao
Yuqing Yang
Lili Qiu
35
1
0
22 Apr 2025
Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models
Wei Chen
Xin Yan
Bin Wen
Fan Yang
Tingting Gao
Di Zhang
Long Chen
MLLM
97
0
0
09 Apr 2025
VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality Generation
Yuhao Wang
Heyang Liu
Ziyang Cheng
Ronghua Wu
Qunshan Gu
Yanfeng Wang
Yu Wang
184
0
0
05 Apr 2025
Qwen2.5-Omni Technical Report
Jin Xu
Zhifang Guo
Jinzheng He
Hangrui Hu
Ting He
...
K. Dang
Bin Zhang
Xinyu Wang
Yunfei Chu
Junyang Lin
VGen
AuLLM
96
16
0
26 Mar 2025
DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies
Wei Song
Yansen Wang
Zijia Song
Yadong Li
Haoze Sun
Xin Wu
Zenan Zhou
Jianhua Xu
Jiaqi Wang
Kaicheng Yu
60
2
0
18 Mar 2025
ViSpeak: Visual Instruction Feedback in Streaming Videos
Shenghao Fu
Q. Yang
Yuan-Ming Li
Yi-Xing Peng
Kun-Yu Lin
Xihan Wei
Jian-Fang Hu
Xiaohua Xie
Wei-Shi Zheng
VLM
67
1
0
17 Mar 2025
OmniBench: Towards The Future of Universal Omni-Language Models
Yizhi Li
Ge Zhang
Yinghao Ma
Ruibin Yuan
Kang Zhu
...
Zhaoxiang Zhang
Zachary Liu
Emmanouil Benetos
Wenhao Huang
Chenghua Lin
LRM
54
11
0
23 Sep 2024
1