arXiv preprint arXiv:2509.10798, 2025.

Judge Q: Trainable Queries for Optimized Information Retention in KV Cache Eviction

arXiv preprint arXiv:2509.10798, 2025.

Liu, Yijun and Wang, Yixuan and Xu, Yuzhuang and Ji, Shiyu and Xu, Yang and Zhu, Qingfu and Che, Wanxiang

Judge Q: Trainable Queries for Optimized Information Retention in KV Cache Eviction

arXiv preprint arXiv:2509.10798, 2025.

Liu, Yijun and Wang, Yixuan and Xu, Yuzhuang and Ji, Shiyu and Xu, Yang and Zhu, Qingfu and Che, Wanxiang