CRVQ: Channel-Relaxed Vector Quantization for Extreme Compression of LLMs
arXiv preprint arXiv:2412.09282, 2024.
Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang
Ji Shiyu
arXiv preprint arXiv:2412.09282, 2024.
Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang
arXiv preprint arXiv:2412.09282, 2024.
Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang