Ph.D. student ยท University of Connecticut
Systems for AI that hold up under real workloads.
I work on machine learning systems, with interests in LLM serving, hybrid retrieval, secure inference, and trustworthy AI infrastructure.
My goal is to build systems that are not only fast, but also robust, measurable, and useful in realistic deployment settings.
LLM serving
Efficient inference, KV-cache management, MoE serving, and verification-aware runtime systems.
Hybrid retrieval
Coordinating SQL, vector search, filtering, and execution policy under real workload constraints.
Trustworthy AI systems
Secure inference, attestation, and recoverability-aware infrastructure for dependable AI deployment.