About Me
Hi I am Haley Li
I currently work on large language model inference optimization at Huawei Canada. Previously, I completed my M.Sc in computer science at the University of British Columbia applying reinforcement learning to load balancing and scheduling for serverless and search engines.
Experience
Senior Research Engineer, Huawei Canada
2025/02 — Present
Intelligent Cloud Infrastructure Lab. Large language model inference optimization. MoE-Attention disaggregation. Fault tolerant inference serving. Asynchronous reinforcement fine-tuning for large-scale MoE.
Research Assistant, University of British Columbia
2021/09 — 2023/12
Systopia Lab. Performed research in the intersection of systems and machine learning.
AI Researcher Intern, Huawei Canada
2022/05 — 2022/09
Big Data and Intelligence Platform Lab. Fine-tuning language models for linear programming solver workflows.
Undergraduate Researcher, University of British Columbia
2020/05 — 2021/08
Systopia Lab. Performed research in cloud schedulers.
Developer, Hypatia Systems
2019/01 — 2020/01
Primarily worked on frontend development on an Electron-based LaTeX editing application.
Education
M.Sc Computer Science, University of British Columbia
2021/09 — 2024/08
Machine learning and systems
B.Sc Computer Science and Mathematics, University of British Columbia
2016/09 — 2021/04