Follow
Yizhou Shan
Yizhou Shan
Huawei Cloud
Verified email at ucsd.edu - Homepage
Title
Cited by
Cited by
Year
LegoOS: A disseminated, distributed OS for hardware resource disaggregation
Y Shan, Y Huang, Y Chen, Y Zhang
Best Paper at OSDI 2018 (USENIX Symposium on Operating Systems Design and …, 2018
4152018
Distributed Shared Persistent Memory
Y Shan, SY Tsai, Y Zhang
Proceedings of the 2017 Symposium on Cloud Computing (SoCC 2017), 2017
1632017
Disaggregating Persistent Memory and Controlling Them Remotely: An Exploration of Passive Disaggregated Key-Value Stores
SY Tsai, Y Shan, Y Zhang
2020 USENIX Annual Technical Conference (ATC 2020), 2020
1372020
Clio: A hardware-software co-designed disaggregated memory system
Z Guo, Y Shan, X Luo, Y Huang, Y Zhang
Proceedings of the 27th ACM International Conference on Architectural …, 2022
1242022
Storm: a fast transactional dataplane for remote data structures
S Novakovic, Y Shan, A Kolli, M Cui, Y Zhang, H Eran, B Pismenny, L Liss, ...
Best Paper at SYSTOR 2019 (Proceedings of the 12th ACM International …, 2019
772019
Towards a fully disaggregated and programmable data center
Y Shan, W Lin, Z Guo, Y Zhang
Proceedings of the 13th ACM SIGOPS Asia-Pacific Workshop on Systems, 18-28, 2022
172022
Inference without interference: Disaggregate llm inference for mixed downstream workloads
C Hu, H Huang, L Xu, X Chen, J Xu, S Chen, H Feng, C Wang, S Wang, ...
arXiv preprint arXiv:2401.11181, 2024
162024
HoPP: Hardware-Software Co-Designed Page Prefetching for Disaggregated Memory
H Li, K Liu, T Liang, Z Li, T Lu, H Yuan, Y Xia, Y Bao, M Chen, Y Shan
2023 IEEE International Symposium on High-Performance Computer Architecture …, 2023
112023
Core slicing: closing the gap between leaky confidential {VMs} and bare-metal cloud
Z Zhou, Y Shan, W Cui, X Ge, M Peinado, A Baumann
17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023
82023
Disaggregating and consolidating network functionalities with supernic
Y Shan, W Lin, R Kosta, A Krishnamurthy, Y Zhang
arXiv preprint arXiv:2109.07744, 2021
82021
The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving
P Zeng, Z Ning, J Zhao, W Cui, M Xu, L Guo, X Chen, Y Shan
arXiv preprint arXiv:2405.11299, 2024
42024
CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference
S Li, H Lu, T Wu, M Yu, Q Weng, X Chen, Y Shan, B Yuan, W Wang
arXiv preprint arXiv:2401.11240, 2024
32024
Skadi: Building a distributed runtime for data systems in disaggregated data centers
C Hu, C Wang, S Wang, N Sun, Y Bao, J Zhao, S Kashyap, P Zuo, X Chen, ...
Proceedings of the 19th Workshop on Hot Topics in Operating Systems, 94-102, 2023
32023
Reinvent Cloud Computing Systems for Resource Disaggregation
C Wang, Y Shan, P Zuo, H Cui
Journal of Computer Science and Technology, 2023
2*2023
Disaggregated Operating System
Y Shan, S Hallymysore, Y Huang, Y Chen, Y Zhang
Poster at SoCC 2017, 2017
22017
MemServe: Context Caching for Disaggregated LLM Serving with Elastic Memory Pool
C Hu, H Huang, J Hu, J Xu, X Chen, T Xie, C Wang, S Wang, Y Bao, ...
arXiv preprint arXiv:2406.17565, 2024
12024
SuperNIC: An FPGA-Based, Cloud-Oriented SmartNIC
W Lin, Y Shan, R Kosta, A Krishnamurthy, Y Zhang
Proceedings of the 2024 ACM/SIGDA International Symposium on Field …, 2024
12024
Distributing and Disaggregating Hardware Resources in Data Centers
Y Shan
University of California, San Diego, 2022
12022
Lego: A Distributed, Decomposed OS for Resource Disaggregation
Y Shan, Y Chen, Y Huang, S Hallymysore, Y Zhang
Poster at SOSP 2017, 2017
12017
InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference
X Pan, E Li, Q Li, S Liang, Y Shan, K Zhou, Y Luo, X Wang, J Zhang
arXiv preprint arXiv:2409.04992, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20