I design and run production ML infrastructure at scale—
from training pipelines to real-time inference.
MLOps, SRE, and cloud-native systems.
Deep experience in Kubernetes and GPUs.
Making AI faster, cheaper, and boringly reliable.