A company is looking for a Kubernetes Sr. Technical Lead.
Key Responsibilities
Develop and maintain an inference platform for serving large language models optimized for various GPU platforms
Work on complex AI and cloud engineering projects through the entire product development lifecycle
Build tooling and observability to monitor system health and create benchmarking frameworks for model serving performance
Required Qualifications
Deep experience building services in modern cloud environments on distributed systems
Experience working with Large Language Models (LLMs), particularly in hosting them for inference
Experience with benchmarking tools for evaluating LLM inference performance
Familiarity with distributed inference serving frameworks and performance metrics
Experience with AMD and NVIDIA GPUs and related optimization techniques
Licensed Engineer • Oceanside, California, United States