Junyi (Kyle) Shu

Computer Science Department. UCLA.

404 Westwood Plaza

Engineering VI, Room 496

Los Angeles, CA 90095

I am a postdoc researcher at UCLA, working with Prof. Harry Xu. My research interests lie in building efficient data management systems in the cloud for evolving use cases (e.g., LLM serving). I obtained my Ph.D. degree from Peking University, where I was advised by Prof. Xin Jin and Prof. Xuanzhe Liu. Before that, I graduated from UC Berkeley with a Bachelor’s degree in Applied Math and Computer Science.

I am a foodie and enjoy making new dishes. Japanese, Italian, Vietnamese and Mediterranean are among my favorites. I am also a wine lover. I passed WSET Level 2 and may pursue Level 3 someday. I like Gewürztraminer, Sauvignon Blanc, Riesling, and Pinot Noir.

If you have questions regarding my research or want to discuss potential collaboration, please drop me an email.

news

Mar 26, 2026 Prism: Cost-Efficient Multi-LLM Serving via GPU Memory Ballooning is accepted by OSDI ‘26.
Sep 28, 2025 Serverless Replication of Object Storage across Multi-Vendor Clouds and Regions is accepted by EuroSys ‘26.
May 22, 2025 I’ve successfully defended my PhD dissertation. I’m so grateful to everyone who has supported me on this journey.

selected publications

* = co-first author, # = corresponding author

  1. OSDI
    Prism: Cost-Efficient Multi-LLM Serving via GPU Memory Ballooning
    Shan Yu, Yifan Qiao, Mingyuan Ma, Yangmin Li, Shuo Yang, Xinyuan Tong, Yang Wang, Zhiqiang Xie, Yuwei An, Shiyi Cao, Ke Bao, Deepak Vij, Xiaoning Ding, Yichen Wang, Qingda Lu, Zhong Wang, Gao Gao, Harry Xu, Junyi Shu#, Jiarong Xing#, and Ying Sheng#
    In 20th USENIX Symposium on Operating Systems Design and Implementation (OSDI 26) 2026
  2. OSDI
    Burstable Cloud Block Storage with Data Processing Units
    Junyi Shu, Kun Qian, Ennan Zhai, Xuanzhe Liu, and Xin Jin#
    In 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24) 2024
  3. ASPLOS
    Disaggregated RAID Storage in Modern Datacenters
    Junyi Shu, Ruidong Zhu, Yun Ma, Gang Huang#, Hong Mei, Xuanzhe Liu, and Xin Jin#
    In Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3 2023