Kris Young

Kexin Chu

πŸ‘‹ Kexin Chu

Ph.D. Student in Computer Science

University of Connecticut | Focusing on Machine Learning Systems, LLM Infrastructure, System Security, and Disaggregated Memory

πŸ“ Connecticut, USA

πŸ’‘ Open to Collaboration! If you're interested in MLSys, Multi-Agent Systems, Disaggregated Memory, or Security, feel free to reach out β€” let's build something exciting together!

8+
Publications
4+
Years Industry Experience
3
Top Conferences

πŸ§ͺ Research Interests

πŸ€– LLM Systems

Efficient inference, KV-cache optimization, and serving systems for large language models

πŸ”’ System Security

Security and privacy in ML systems, timing side-channel mitigation

πŸ’Ύ Memory Systems

RDMA, disaggregated memory, CXL, and memory-tiered architectures


πŸ“„ Selected Publications

MCaM: Efficient LLM Inference with Multi-tier KV Cache Management

ICDCS 2025 Conference Paper

Multi-tier KV cache management system for efficient large language model inference.

πŸ“„ Read Paper β†’

ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE Inference

arXiv MoE Systems

Runtime system for MoE inference that combines adaptive expert prefetching and cache-aware routing to optimize inference under memory constraints.

πŸ“„ Read Paper β†’

Selective KV-Cache Sharing to Mitigate Timing Side-Channels in LLM Inference

Security arXiv

Security-focused approach to prevent timing attacks in LLM serving systems.

πŸ“„ Read Paper β†’

eInfer: Unlocking Fine-Grained Tracing for Distributed LLM Inference with eBPF

eBPF 2025 Workshop

eBPF-based tracing framework for distributed LLM inference systems.

πŸ“„ Read Paper β†’

SafeKV: Safe KV-Cache Sharing in LLM Serving

MLArchSys 2025 ISCA 2025 Security

Privacy-preserving KV-cache sharing mechanism for multi-tenant LLM serving.

πŸ“„ Paper πŸŽ₯ Presentation

πŸŽ“ Education

Ph.D. in Computer Science

University of Connecticut, USA | 2024 - Present

Research Focus: ML Systems, KV-cache optimization, RDMA-backed storage, and disaggregated memory architectures.
πŸ’° Predoctoral Fellowship Recipient

M.S. in Integrated Circuit Engineering

Hefei University of Technology, China

Co-supervised by A.P. Ying Wang and A.P. Cheng Liu
Specialized in computer architecture and AI acceleration

B.S. in Integrated Circuit Design & Integrated System

Hefei University of Technology, China

Foundation in digital circuit design, computer organization, and system integration
πŸ† National Scholarship Recipient (2018, 2019)


πŸ’Ό Industry Experience

Software Architect & Backend Engineer

Baidu Inc., Beijing, China

2020 - 2024

Department: Search R&D Platform - Focus on large-scale backend systems

πŸš€ Career Progression: T3 β†’ T4 (2021) β†’ T5 (2023)

Key Contributions:

C++ Golang brpc Kafka Redis MySQL Distributed Systems

πŸ† Honors & Awards

πŸ… Predoctoral Fellowship

University of Connecticut

2025

πŸ… Baidu Pride Special Award

Baidu Inc.

2022

πŸŽ“ National Scholarship

China Ministry of Education

2018, 2019


πŸ› οΈ Technical Skills

Programming Languages

C++ Python Golang C Rust CUDA

ML/AI Frameworks & Tools

PyTorch vLLM TensorFlow Triton DeepSpeed Ray LangChain

Systems & Infrastructure

RDMA Linux Kernel eBPF Distributed Systems Kubernetes Docker CXL

Databases & Message Queues

Redis MySQL Kafka PostgreSQL MongoDB

Research Areas

LLM Serving KV-Cache Optimization System Security Memory Systems Computer Architecture Performance Optimization

πŸ“¬ Get In Touch

Interested in collaboration or have questions about my research?

Send me an email β†’