Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.01743
Cited By
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
3 March 2025
Abdelrahman Abouelenin
Atabak Ashfaq
Adam Atkinson
Hany Awadalla
Nguyen Bach
Jianmin Bao
Alon Benhaim
Martin Cai
Vishrav Chaudhary
C. L. Philip Chen
Dong Chen
Dongdong Chen
J. Chen
Weizhu Chen
Yen-Chun Chen
Yi-Ling Chen
Qi Dai
Xiyang Dai
Ruchao Fan
Mei Gao
Min Gao
Amit Garg
Abhishek Goswami
Junheng Hao
Amr Hendy
Yuxuan Hu
Xin Jin
Mahmoud Khademi
Dongwoo Kim
Young Jin Kim
Gina Lee
Jiajian Li
Yongbin Li
Chen Liang
Xihui Lin
Zeqi Lin
M. Liu
Yang Liu
Gilsinia Lopez
Chong Luo
Piyush Madan
Piyush Madan
V. Mazalov
Anh Nguyen
Ali Mousavi
Daniel Perez-Becker
J. Pan
Thomas Portet
Jacob Platin
Bo Ren
Kai Qiu
Sambuddha Roy
Liliang Ren
Yelong Shen
Ning Shang
Subhojit Som
Saksham Singhal
Tetyana Sych
Xia Song
Shuohang Wang
Praneetha Vaddamanu
Zehua Wang
Yiming Wang
Haoran Xu
Haibin Wu
Yifan Yang
Weijian Xu
Donghan Yu
Ziyi Yang
Jianwen Zhang
Ishmam Zabir
Yunan Zhang
Li Zhang
Wenjie Qu
Xiren Zhou
MoE
SyDa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs"
19 / 19 papers shown
Title
PoisonArena: Uncovering Competing Poisoning Attacks in Retrieval-Augmented Generation
Liuji Chen
Xiaofang Yang
Yuanzhuo Lu
Jinghao Zhang
Xin Sun
Qiang Liu
Shu Wu
Jing Dong
Liang Wang
AAML
2
0
0
18 May 2025
Flash-VL 2B: Optimizing Vision-Language Model Performance for Ultra-Low Latency and High Throughput
Bo Zhang
Shuo Li
Runhe Tian
Yang Yang
Jixin Tang
Jinhao Zhou
Lin Ma
VLM
30
0
0
14 May 2025
Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?
Andrew Rouditchenko
Saurabhchand Bhati
Edson Araujo
Samuel Thomas
Hilde Kuehne
Rogerio Feris
James R. Glass
AuLLM
VLM
44
0
0
14 May 2025
Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People
Haoshuai Zhou
Boxuan Cao
Changgeng Mo
Linkai Li
Shan Xiang Wang
AI4CE
31
0
0
13 May 2025
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning
Zhehao Zhang
Weijie Xu
Fanyou Wu
Chandan K. Reddy
29
0
0
12 May 2025
BLAB: Brutally Long Audio Bench
Orevaoghene Ahia
Martijn Bartelds
Kabir Ahuja
Hila Gonen
Valentin Hofmann
...
Noah Bennett
Shinji Watanabe
Noah A. Smith
Yulia Tsvetkov
Sachin Kumar
AuLLM
LM&MA
VLM
63
0
0
05 May 2025
Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models
Paloma Piot
Patricia Martín-Rodilla
Javier Parapar
50
0
0
04 May 2025
SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation
Quang P.M. Pham
Khoi T.N. Nguyen
Nhi H. Doan
Cuong Pham
Kentaro Inui
Dezhen Song
65
0
0
01 May 2025
Empowering Agentic Video Analytics Systems with Video Language Models
Yuxuan Yan
Shiqi Jiang
Ting Cao
Yifan Yang
Qianqian Yang
Yuanchao Shu
Yuqing Yang
Lili Qiu
VLM
70
0
0
01 May 2025
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Haoran Xu
Baolin Peng
Hany Awadalla
Dongdong Chen
Yen-Chun Chen
...
Yelong Shen
S. Wang
Weijian Xu
Jianfeng Gao
Weizhu Chen
ReLM
LRM
75
1
0
30 Apr 2025
Improving Phishing Email Detection Performance of Small Large Language Models
Zijie Lin
Zikang Liu
Hanbo Fan
31
0
0
29 Apr 2025
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance
Shixuan Liu
Zhenzhe Zheng
Xiaoyao Huang
Fan Wu
Guihai Chen
Jie Wu
35
0
0
15 Apr 2025
On The Landscape of Spoken Language Models: A Comprehensive Survey
Siddhant Arora
Kai-Wei Chang
Chung-Ming Chien
Yifan Peng
Haibin Wu
Yossi Adi
Emmanuel Dupoux
Hung-yi Lee
Karen Livescu
Shinji Watanabe
52
2
0
11 Apr 2025
From Speech to Summary: A Comprehensive Survey of Speech Summarization
Fabian Retkowski
Maike Züfle
Andreas Sudmann
Dinah Pfau
Jan Niehues
Alexander Waibel
46
0
0
10 Apr 2025
StarFlow: Generating Structured Workflow Outputs From Sketch Images
Patrice Bechard
Chao Wang
Amirhossein Abaskohi
Juan A. Rodriguez
Christopher Pal
David Vazquez
Spandana Gella
Sai Rajeswar
Perouz Taslakian
33
0
0
27 Mar 2025
OmniVox: Zero-Shot Emotion Recognition with Omni-LLMs
John Murzaku
Owen Rambow
AuLLM
46
0
0
27 Mar 2025
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
Weiming Ren
Wentao Ma
Huan Yang
Cong Wei
Ge Zhang
Wenhu Chen
Mamba
59
3
0
14 Mar 2025
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search
Yiming Jia
Jiashi Li
Xiang Yue
Bo Li
Ping Nie
Kai Zou
Wenhu Chen
LRM
79
2
0
13 Mar 2025
Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions
Zhihao He
Hang Yu
Zi Gong
Shizhan Liu
J. Li
Weiyao Lin
VLM
38
1
0
09 Oct 2024
1