Xu Dongliang

Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 6816--6831, 2025.

Luo, Xianzhen and Wang, Yixuan and Zhu, Qingfu and Zhang, Zhiming and Zhang, Xuanyu and Yang, Qing and Xu, Dongliang

Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 6816--6831, 2025.

Luo, Xianzhen and Wang, Yixuan and Zhu, Qingfu and Zhang, Zhiming and Zhang, Xuanyu and Yang, Qing and Xu, Dongliang

Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 12914--12926, 2024.

Wang, Yixuan and Luo, Xianzhen and Wei, Fuxuan and Liu, Yijun and Zhu, Qingfu and Zhang, Xuanyu and Yang, Qing and Xu, Dongliang and Che, Wanxiang

Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 12914--12926, 2024.

Wang, Yixuan and Luo, Xianzhen and Wei, Fuxuan and Liu, Yijun and Zhu, Qingfu and Zhang, Xuanyu and Yang, Qing and Xu, Dongliang and Che, Wanxiang