publications

* = co-first author, # = corresponding author

2026

  1. OSDI
    Prism: Cost-Efficient Multi-LLM Serving via GPU Memory Ballooning
    Shan Yu, Yifan Qiao, Mingyuan Ma, Yangmin Li, Shuo Yang, Xinyuan Tong, Yang Wang, Zhiqiang Xie, Yuwei An, Shiyi Cao, Ke Bao, Deepak Vij, Xiaoning Ding, Yichen Wang, Qingda Lu, Zhong Wang, Gao Gao, Harry Xu, Junyi Shu#, Jiarong Xing#, and Ying Sheng#
    In 20th USENIX Symposium on Operating Systems Design and Implementation (OSDI 26) 2026
  2. EuroSys
    Serverless Replication of Object Storage across Multi-Vendor Clouds and Regions
    Junyi Shu, Xiaolong Huang, Gang Huang, Hong Mei, Xuanzhe Liu, and Xin Jin#
    In Proceedings of the 21st European Conference on Computer Systems 2026

2024

  1. OSDI
    Burstable Cloud Block Storage with Data Processing Units
    Junyi Shu, Kun Qian, Ennan Zhai, Xuanzhe Liu, and Xin Jin#
    In 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24) 2024

2023

  1. ASPLOS
    Disaggregated RAID Storage in Modern Datacenters
    Junyi Shu, Ruidong Zhu, Yun Ma, Gang Huang#, Hong Mei, Xuanzhe Liu, and Xin Jin#
    In Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3 2023

2021

  1. SIGCOMM
    Cost-Effective Data Analytics across Multiple Cloud Regions
    Junyi Shu, Xin Jin, Yun Ma, Xuanzhe Liu, and Gang Huang
    In Proceedings of the SIGCOMM ’21 Poster and Demo Sessions 2021