Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.12926
Cited By
Task-Oriented Feature Compression for Multimodal Understanding via Device-Edge Co-Inference
17 March 2025
Cheng Yuan
Ziqiang Liu
Jiashu Lv
Jiawei Shao
Yufei Jiang
Jing Zhang
Xuelong Li
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Task-Oriented Feature Compression for Multimodal Understanding via Device-Edge Co-Inference"
24 / 24 papers shown
Title
Qwen2.5-VL Technical Report
S. Bai
Keqin Chen
Xuejing Liu
Jialin Wang
Wenbin Ge
...
Zesen Cheng
Hang Zhang
Zhibo Yang
Haiyang Xu
Junyang Lin
VLM
327
685
0
20 Feb 2025
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
Shaolei Zhang
Qingkai Fang
Zhe Yang
Yang Feng
MLLM
VLM
142
42
0
07 Jan 2025
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models
Haodong Duan
Junming Yang
Junming Yang
Xinyu Fang
Lin Chen
...
Yuhang Zang
Pan Zhang
Jiaqi Wang
Dahua Lin
Kai Chen
LM&MA
VLM
142
177
0
16 Jul 2024
Towards Semantic Equivalence of Tokenization in Multimodal LLM
Shengqiong Wu
Hao Fei
Xiangtai Li
Jiayi Ji
Hanwang Zhang
Tat-Seng Chua
Shuicheng Yan
MLLM
121
37
0
07 Jun 2024
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
Yuzhang Shang
Mu Cai
Bingxin Xu
Yong Jae Lee
Yan Yan
VLM
110
126
0
22 Mar 2024
An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
Liang Chen
Haozhe Zhao
Tianyu Liu
Shuai Bai
Junyang Lin
Chang Zhou
Baobao Chang
MLLM
VLM
98
149
0
11 Mar 2024
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
Taicheng Guo
Preslav Nakov
Yaqi Wang
Ruidi Chang
Shichao Pei
Nitesh Chawla
Olaf Wiest
Xiangliang Zhang
LLMAG
LM&Ro
AI4CE
LRM
147
309
0
21 Jan 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Xiang Yue
Yuansheng Ni
Kai Zhang
Tianyu Zheng
Ruoqi Liu
...
Yibo Liu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
OSLM
ELM
VLM
239
943
0
27 Nov 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
117
288
0
21 Nov 2023
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Yang Jin
Kun Xu
Kun Xu
Liwei Chen
Chao Liao
...
Xiaoqiang Lei
Di Zhang
Wenwu Ou
Kun Gai
Yadong Mu
MLLM
VLM
54
48
0
09 Sep 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
560
4,861
0
17 Apr 2023
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIP
VLM
226
1,150
0
27 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
429
4,641
0
30 Jan 2023
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering
Pan Lu
Swaroop Mishra
Tony Xia
Liang Qiu
Kai-Wei Chang
Song-Chun Zhu
Oyvind Tafjord
Peter Clark
Ashwin Kalyan
ELM
ReLM
LRM
283
1,286
0
20 Sep 2022
Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications
Deniz Gunduz
Zhijin Qin
Iñaki Estella Aguerri
Harpreet S. Dhillon
Zhaohui Yang
Aylin Yener
Kai‐Kit Wong
C. Chae
86
452
0
19 Jul 2022
Task-Oriented Communication for Multi-Device Cooperative Edge Inference
Jiawei Shao
Yuyi Mao
Jun Zhang
60
133
0
01 Sep 2021
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
477
10,496
0
17 Jun 2021
Checkerboard Context Model for Efficient Learned Image Compression
Dailan He
Yaoyan Zheng
Baochen Sun
Yan Wang
Hongwei Qin
74
283
0
29 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
967
29,731
0
26 Feb 2021
CompressAI: a PyTorch library and evaluation platform for end-to-end compression research
Jean Bégaint
Fabien Racapé
Simon Feltman
Akshay Pushparaja
VLM
63
426
0
05 Nov 2020
6G Networks: Beyond Shannon Towards Semantic and Goal-Oriented Communications
Emilio Calvanese Strinati
Sergio Barbarossa
80
395
0
04 Nov 2020
Channel-wise Autoregressive Entropy Models for Learned Image Compression
David C. Minnen
Saurabh Singh
69
412
0
17 Jul 2020
Communication-Computation Trade-Off in Resource-Constrained Edge Inference
Jiawei Shao
Jun Zhang
44
115
0
03 Jun 2020
BottleNet++: An End-to-End Approach for Feature Compression in Device-Edge Co-Inference Systems
Jiawei Shao
Jun Zhang
73
164
0
31 Oct 2019
1