ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.18610
  4. Cited By
PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs

PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs

24 May 2025
Tengxuan Liu
Shiyao Li
Jiayi Yang
Tianchen Zhao
Feng Zhou
Xiaohui Song
Guohao Dai
Shengen Yan
Huazhong Yang
Yu Wang
    MQ
ArXivPDFHTML

Papers citing "PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs"

1 / 1 papers shown
Title
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing
Tianyu Fu
Yi Ge
Yichen You
Enshu Liu
Zhihang Yuan
Guohao Dai
Shengen Yan
Huazhong Yang
Yu Wang
MoE
LRM
62
0
0
27 May 2025
1