About Me

Hi I am Haley Li

I currently work on large language model inference optimization at Huawei Canada. Previously, I completed my M.Sc in computer science at the University of British Columbia applying reinforcement learning to load balancing and scheduling for serverless and search engines.

Experience

Senior Research Engineer, Huawei Canada

2025/02 — Present

Intelligent Cloud Infrastructure Lab. Large language model inference optimization. MoE-Attention disaggregation. Fault tolerant inference serving. Asynchronous reinforcement fine-tuning for large-scale MoE.

Research Assistant, University of British Columbia

2021/09 — 2023/12

Systopia Lab. Performed research in the intersection of systems and machine learning.

AI Researcher Intern, Huawei Canada

2022/05 — 2022/09

Big Data and Intelligence Platform Lab. Fine-tuning language models for linear programming solver workflows.

Undergraduate Researcher, University of British Columbia

2020/05 — 2021/08

Systopia Lab. Performed research in cloud schedulers.

Developer, Hypatia Systems

2019/01 — 2020/01

Primarily worked on frontend development on an Electron-based LaTeX editing application.

Education

M.Sc Computer Science, University of British Columbia

2021/09 — 2024/08

Machine learning and systems

B.Sc Computer Science and Mathematics, University of British Columbia

2016/09 — 2021/04