Kris Young

👋 Kexin Chu

Ph.D. Student in Computer Science

University of Connecticut | Focusing on Machine Learning Systems, LLM Infrastructure, System Security, and Disaggregated Memory

📍 Connecticut, USA

📚 Google Scholar 💻 GitHub 📧 Email 📄 CV

💡 Open to Collaboration! If you're interested in MLSys, Multi-Agent Systems, Disaggregated Memory, or Security, feel free to reach out — let's build something exciting together!

Publications

Years Industry Experience

Top Conferences

🧪 Research Interests

🤖 LLM Systems

Efficient inference, KV-cache optimization, and serving systems for large language models

🔒 System Security

Security and privacy in ML systems, timing side-channel mitigation

💾 Memory Systems

RDMA, disaggregated memory, CXL, and memory-tiered architectures

📄 Selected Publications

MCaM: Efficient LLM Inference with Multi-tier KV Cache Management

ICDCS 2025 Conference Paper

Multi-tier KV cache management system for efficient large language model inference.

📄 Read Paper →

ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE Inference

arXiv MoE Systems

Runtime system for MoE inference that combines adaptive expert prefetching and cache-aware routing to optimize inference under memory constraints.

📄 Read Paper →

Selective KV-Cache Sharing to Mitigate Timing Side-Channels in LLM Inference

Security arXiv

Security-focused approach to prevent timing attacks in LLM serving systems.

📄 Read Paper →

eInfer: Unlocking Fine-Grained Tracing for Distributed LLM Inference with eBPF

eBPF 2025 Workshop

eBPF-based tracing framework for distributed LLM inference systems.

📄 Read Paper →

SafeKV: Safe KV-Cache Sharing in LLM Serving

MLArchSys 2025 ISCA 2025 Security

Privacy-preserving KV-cache sharing mechanism for multi-tenant LLM serving.

📄 Paper 🎥 Presentation

🎓 Education

Ph.D. in Computer Science

University of Connecticut, USA | 2024 - Present

Research Focus: ML Systems, KV-cache optimization, RDMA-backed storage, and disaggregated memory architectures.
💰 Predoctoral Fellowship Recipient

M.S. in Integrated Circuit Engineering

Hefei University of Technology, China

Co-supervised by A.P. Ying Wang and A.P. Cheng Liu
Specialized in computer architecture and AI acceleration

B.S. in Integrated Circuit Design & Integrated System

Hefei University of Technology, China

Foundation in digital circuit design, computer organization, and system integration
🏆 National Scholarship Recipient (2018, 2019)

💼 Industry Experience

Software Architect & Backend Engineer

Baidu Inc., Beijing, China

2020 - 2024

Department: Search R&D Platform - Focus on large-scale backend systems

🚀 Career Progression: T3 → T4 (2021) → T5 (2023)

Key Contributions:

DeepQA Web Services: Developed high-performance web services using C++/brpc for large-scale search infrastructure, handling millions of queries daily
LLM Access Control Systems: Built comprehensive access control and monitoring systems for Ernie Bot/WenXinYiYan (文心一言) using Golang, Redis, and MySQL. Ensured security and reliability for production LLM services
Real-Time Streaming Systems: Designed and implemented Kafka-based streaming pipelines for large-scale data indexing and ingestion, enabling real-time data processing
System Architecture: Led architectural design decisions for high-throughput, low-latency distributed systems serving billions of requests

C++ Golang brpc Kafka Redis MySQL Distributed Systems

🏆 Honors & Awards

🏅 Predoctoral Fellowship

University of Connecticut

2025

🏅 Baidu Pride Special Award

Baidu Inc.

2022

🎓 National Scholarship

China Ministry of Education

2018, 2019

🛠️ Technical Skills

Programming Languages

C++ Python Golang C Rust CUDA

ML/AI Frameworks & Tools

PyTorch vLLM TensorFlow Triton DeepSpeed Ray LangChain

Systems & Infrastructure

RDMA Linux Kernel eBPF Distributed Systems Kubernetes Docker CXL

Databases & Message Queues

Redis MySQL Kafka PostgreSQL MongoDB

Research Areas

LLM Serving KV-Cache Optimization System Security Memory Systems Computer Architecture Performance Optimization

📬 Get In Touch

Interested in collaboration or have questions about my research?

Send me an email →