OneBit: Towards Extremely Low-bit Large Language Models
Advances in Neural Information Processing Systems, 66357--66382, 2024.
Xu, Yuzhuang and Han, Xu and Yang, Zonghan and Wang, Shuo and Zhu, Qingfu and Liu, Zhiyuan and Liu, Weidong and Che, Wanxiang