CRVQ: Channel-Relaxed Vector Quantization for Extreme Compression of LLMs
Transactions of the Association for Computational Linguistics
Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang
TACL
Transactions of the Association for Computational Linguistics
Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang
Transactions of the Association for Computational Linguistics
Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang