Junyi (Kyle) Shu
404 Westwood Plaza
Engineering VI, Room 496
Los Angeles, CA 90095
I am currently a postdoc researcher at UCLA working with Prof. Harry Xu. My mission is to build highly efficient, scalable infrastructure for next-generation AI. My technical foundation stems from rigorous systems research, having earned my Ph.D. from Peking University advised by Prof. Xin Jin and Prof. Xuanzhe Liu. Before that, I graduated from UC Berkeley with a Bachelor’s degree in Applied Math and Computer Science.
Beyond academic research, my career so far is defined by bridging deep technical execution with cross-functional leadership. Having engineered systems at AWS and collaborated closely with diverse cloud infrastructure teams—spanning network, storage, and operating systems—at Alibaba Cloud, I deeply understand how to align complex technologies and coordinate with experts across multiple domains. Complementing this technical expertise, I have hands-on leadership experience directing engineering teams and managing various functions of early-stage startups.
news
| Mar 26, 2026 | Prism: Cost-Efficient Multi-LLM Serving via GPU Memory Ballooning is accepted by OSDI ’26. |
|---|---|
| Sep 28, 2025 | Serverless Replication of Object Storage across Multi-Vendor Clouds and Regions is accepted by EuroSys ’26. |
| May 22, 2025 | I’ve successfully defended my PhD dissertation. I’m so grateful to everyone who has supported me on this journey. |
selected publications
* = co-first author, # = corresponding author