Penghui Qi

According to our database1, Penghui Qi authored at least 11 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Defeating the Training-Inference Mismatch via FP16.
CoRR, October, 2025

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning.
CoRR, June, 2025

Optimizing Anytime Reasoning via Budget Relative Policy Optimization.
CoRR, May, 2025

Understanding R1-Zero-Like Training: A Critical Perspective.
CoRR, March, 2025

Balancing Pipeline Parallelism with Vocabulary Parallelism.
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
Zero Bubble Pipeline Parallelism.
CoRR, 2024

Pipeline Parallelism with Controllable Memory.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Zero Bubble (Almost) Pipeline Parallelism.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2021
SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II.
Proceedings of the 38th International Conference on Machine Learning, 2021

2019
Artificial Intelligence for Prosthetics - challenge solutions.
CoRR, 2019


  Loading...