Research Summary
I have broad interests in computer architecture and systems. My research focuses on more efficient and resilient system architecture design. My current focused topics include:
- Large-scale Datacenter Optimization: How to enhance the shared datacenters in …
- LLM serving efficiency on scalable CPU with matrix acceleration units. [In submission]
- resource visibility in shared-state scheduler architecture. [SoCC'23]
- request visibility of microservice-level parallelism. [IPDPS'22]
- power-aware management of serverless computing systems. [IPDPS'23, CCGRID'24, SCIS'25]
- Efficient Management Facilities: How to eliminate the additional costs of …
- intra-service tracing facility for cloud datacenters. [In submission]
- power management facility for autonomous embedded systems with shadow cycles. [ICCD'24, TACO'24]
- resource management facility for autonomous embedded systems. [Ongoing, RTSS'24]
- Resilient Architecture Design: How to design low-cost hardware fault tolerance architecture for complex LLM training and serving workloads in future AI infrastructure? [Ongoing]