Design and Operation of Shared Machine Learning Clusters on Campus (ASPLOS 2025)
Towards Domain-Specific Network Transport for Distributed DNN Training (NSDI 2024)
Accelerating Neural Recommendation Training with Embedding Scheduling (NSDI 2024)
An Efficient Multi-Level Inference System for Large Language Models (EuroSys 2023)