Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.05795
Cited By
Large Multimodal Model Compression via Efficient Pruning and Distillation at AntGroup
10 December 2023
Maolin Wang
Yao-Min Zhao
Jiajia Liu
Jingdong Chen
Chenyi Zhuang
Jinjie Gu
Ruocheng Guo
Xiangyu Zhao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Large Multimodal Model Compression via Efficient Pruning and Distillation at AntGroup"
7 / 7 papers shown
Title
LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device Collaboration
Wenjie Qu
Pengyue Jia
Xin Li
Derong Xu
Maolin Wang
...
Zhaocheng Du
Huifeng Guo
Y. Liu
Ruiming Tang
Xiangyu Zhao
49
0
0
08 May 2025
CASP: Compression of Large Multimodal Models Based on Attention Sparsity
Mohsen Gholami
Mohammad Akbari
Kevin Cannons
Yong Zhang
65
0
0
07 Mar 2025
ToFu: Visual Tokens Reduction via Fusion for Multi-modal, Multi-patch, Multi-image Task
Vittorio Pippi
Matthieu Guillaumin
S. Cascianelli
Rita Cucchiara
M. Jaritz
Loris Bazzani
64
0
0
06 Mar 2025
Large Language Models for Generative Information Extraction: A Survey
Derong Xu
Wei-neng Chen
Wenjun Peng
Chao Zhang
Tong Xu
Xiangyu Zhao
Xian Wu
Yefeng Zheng
Yang Wang
Enhong Chen
51
146
0
29 Dec 2023
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Qinghao Ye
Haiyang Xu
Guohai Xu
Jiabo Ye
Ming Yan
...
Junfeng Tian
Qiang Qi
Ji Zhang
Feiyan Huang
Jingren Zhou
VLM
MLLM
210
906
0
27 Apr 2023
Multimodal Recommender Systems: A Survey
Qidong Liu
Jiaxi Hu
Yutian Xiao
Jingtong Gao
Xiang Zhao
33
32
0
08 Feb 2023
Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT
Sheng Shen
Zhen Dong
Jiayu Ye
Linjian Ma
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
236
576
0
12 Sep 2019
1