CRVQ: Channel-Relaxed Vector Quantization for Extreme Compression of LLMs
arXiv preprint arXiv:2412.09282, 2024.
Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang
Xu Yuzhuang
arXiv preprint arXiv:2412.09282, 2024.
Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang
arXiv preprint arXiv:2412.09282, 2024.
Xu, Yuzhuang and Ji, Shiyu and Zhu, Qingfu and Che, Wanxiang
Advances in Neural Information Processing Systems, 66357--66382, 2024.
Xu, Yuzhuang and Han, Xu and Yang, Zonghan and Wang, Shuo and Zhu, Qingfu and Liu, Zhiyuan and Liu, Weidong and Che, Wanxiang
Advances in Neural Information Processing Systems, 66357--66382, 2024.
Xu, Yuzhuang and Han, Xu and Yang, Zonghan and Wang, Shuo and Zhu, Qingfu and Liu, Zhiyuan and Liu, Weidong and Che, Wanxiang