TACL

CRVQ: Channel-Relaxed Vector Quantization for Extreme Compression of LLMs

Transactions of the Association for Computational Linguistics

Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang

CRVQ: Channel-Relaxed Vector Quantization for Extreme Compression of LLMs

Transactions of the Association for Computational Linguistics

Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang