ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2411.10803
  4. Cited By

Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model

16 November 2024
Ting Liu
Liangtao Shi
Richang Hong
Yue Hu
Quanjun Yin
Linfeng Zhang
    MLLM
    VLM
ArXivPDFHTML

Papers citing "Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model"

1 / 1 papers shown
Title
Top-Down Compression: Revisit Efficient Vision Token Projection for Visual Instruction Tuning
Top-Down Compression: Revisit Efficient Vision Token Projection for Visual Instruction Tuning
Bonan li
Zicheng Zhang
Songhua Liu
Weihao Yu
Xinchao Wang
VLM
103
0
0
17 May 2025
1