The School of Computing and Data Science (https://www.cds.hku.hk/) was established by the University of Hong Kong on 1 July 2024, comprising the Department of Computer Science and Department of Statistics and Actuarial Science and Department of AI and Data Science.

Abstract

Large language models (LLMs) are profoundly reshaping the global economies and technology. Efficient systems for LLM pre-training are essential because they directly impact model quality, operational costs, and environmental sustainability. In this talk, Zhuang will present two system research projects designed to tackle fundamental communication challenges within LLM pre-training. ZEN (OSDI ‘25) addresses data plane challenges by optimizing synchronization strategies for sparse tensor communications. GEMINI (SOSP ’23) focuses on the management plane by redesigning the checkpoint storage system engineered to minimize failure recovery overheads.

About the speaker

Zhuang Wang is a Senior Applied Scientist at Amazon Annapurna Labs. He earned his Ph.D. in Computer Science from Rice University in 2023. His current research interests focus on efficient training and inference systems for large language models. He has published papers as the first author in prestigious venues including OSDI, SOSP, SIGCOMM, and EuroSys. Zhuang has served on the Program Committee for OSDI, ATC, and MLSys.

 

 

Division of Computer Science,
School of Computing and Data Science

Rm 207 Chow Yei Ching Building
The University of Hong Kong
Pokfulam Road, Hong Kong
香港大學計算與數據科學學院, 計算機科學系
香港薄扶林道香港大學周亦卿樓207室

Email: csenq@hku.hk
Telephone: 3917 3146

Copyright © School of Computing and Data Science, The University of Hong Kong. All rights reserved.
Don't have an account yet? Register Now!

Sign in to your account