Junyi (Kyle) Shu

Computer Science Department. UCLA.

404 Westwood Plaza

Engineering VI, Room 496

Los Angeles, CA 90095

I am currently a postdoc researcher at UCLA working with Prof. Harry Xu. My mission is to build highly efficient, scalable infrastructure for next-generation AI. My technical foundation stems from rigorous systems research, having earned my Ph.D. from Peking University advised by Prof. Xin Jin and Prof. Xuanzhe Liu. Before that, I graduated from UC Berkeley with a Bachelor’s degree in Applied Math and Computer Science.

Beyond academic research, my career so far is defined by bridging deep technical execution with cross-functional leadership. Having engineered systems at AWS and collaborated closely with diverse cloud infrastructure teams—spanning network, storage, and operating systems—at Alibaba Cloud, I deeply understand how to align complex technologies and coordinate with experts across multiple domains. Complementing this technical expertise, I have hands-on leadership experience directing engineering teams and managing various functions of early-stage startups.

news

Mar 26, 2026 Prism: Cost-Efficient Multi-LLM Serving via GPU Memory Ballooning is accepted by OSDI ’26.
Sep 28, 2025 Serverless Replication of Object Storage across Multi-Vendor Clouds and Regions is accepted by EuroSys ’26.
May 22, 2025 I’ve successfully defended my PhD dissertation. I’m so grateful to everyone who has supported me on this journey.

selected publications

* = co-first author, # = corresponding author

  1. OSDI
    Prism: Cost-Efficient Multi-LLM Serving via GPU Memory Ballooning
    Shan Yu, Yifan Qiao, Mingyuan Ma, Yangmin Li, Shuo Yang, Xinyuan Tong, Yang Wang, Zhiqiang Xie, Yuwei An, Shiyi Cao, Ke Bao, Deepak Vij, Xiaoning Ding, Yichen Wang, Qingda Lu, Zhong Wang, Gao Gao, Harry Xu, Junyi Shu#, Jiarong Xing#, and Ying Sheng#
    In 20th USENIX Symposium on Operating Systems Design and Implementation (OSDI 26) 2026
  2. OSDI
    Burstable Cloud Block Storage with Data Processing Units
    Junyi Shu, Kun Qian, Ennan Zhai, Xuanzhe Liu, and Xin Jin#
    In 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24) 2024
  3. ASPLOS
    Disaggregated RAID Storage in Modern Datacenters
    Junyi Shu, Ruidong Zhu, Yun Ma, Gang Huang#, Hong Mei, Xuanzhe Liu, and Xin Jin#
    In Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3 2023