Publications

Conference Papers

CCF-A OSDI’26 Wei Gao, Yuheng Zhao, Tianyuan Wu, Shaopan Xiong, Weixun Wang, Dakai An, Lunxi Cao, Dilxat Muhtar, Zichen Liu, Haizhou Zhao, Ju Huang, Siran Yang, Yongbin Li, Wenbo Su, Jiamang Wang, Lin Qu, Bo Zheng, Wei Wang, “RollArt: Disaggregated Multi-Task Agentic RL Training at Scale”, conditionally accepted to the 20th USENIX Symposium on Operating Systems Design and Implementation (OSDI’26), Seattle, WA, USA, July 2026. (Equal contribution).

CCF-A NSDI’26 Wei Gao, Yuheng Zhao, Dakai An, Tianyuan Wu, Lunxi Cao, Shaopan Xiong, Ju Huang, Weixun Wang, Siran Yang, Wenbo Su, Jiamang Wang, Lin Qu, Bo Zheng, Wei Wang, “RollPacker: Taming Long-Tail Rollouts for RL Post-Training with Tail Batching”, in the Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation (NSDI’26), Renton, WA, USA, May 2026. (Equal contribution).

CCF-A OSDI’26 Tianyuan Wu, Lunxi Cao, Yining Wei, Wei Gao, Yuheng Zhao, Dakai An, Shaopan Xiong, Zhiqiang Lv, Ju Huang, Siran Yang, Yinghao Yu, Jiamang Wang, Lin Qu, Wei Wang, “Weave: Efficient Co-Scheduling for Disaggregated RL Post-Training”, in the Proceedings of the 20th USENIX Symposium on Operating Systems Design and Implementation (OSDI’26), Seattle, WA, USA, July 2026.

CCF-B DSN’23 Xiaoting Qin, Minghua Ma, Yuheng Zhao, Jue Zhang, Chao Du, Yudong Liu, Anjaly Parayil, Chetan Bansal, Saravan Rajmohan, Íñigo Goiri, Eli Cortez, Si Qin, Qingwei Lin, Dongmei Zhang, “How Different are the Cloud Workloads? Characterizing Large-Scale Private and Public Cloud Workloads”, in the Proceedings of the 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN’23), Porto, Portugal, June 2023.

Technical Reports

arXiv ROLL Team, “Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library”, June 2025.

Preprints

arXiv Wei Gao, Yuheng Zhao, Dilxat Muhtar, Dakai An, Xuchun Shang, Tianyuan Wu, Lunxi Cao, Shaopan Xiong, Weixun Wang, Ju Huang, Teng Ma, Siran Yang, Jiamang Wang, Lin Qu, Bo Zheng, Wei Wang, “ROSE: Rollout On Serving GPUs via Cooperative Elasticity for Agentic RL”, May 2026 (Equal contribution)

Preprint Yuheng Zhao, Suyi Li, Tianyuan Wu, Wei Gao, Guangzhen Chen, Jun Yang, Daohe Lu, Si Luo, Chengbo Li, Minchen Yu, Ruichuan Chen, Wei Wang, “ARTOS: Buffer-Centric Data Management for Elastic Reinforcement Learning”, June 2025 (Equal contribution).