About Baseten
At Baseten, we provide the essential infrastructure, tools, and expertise needed to quickly bring exceptional AI products to market. Backed by prominent investors such as IVP, Spark Capital, Greylock, and Conviction, we are trusted by leading AI innovators like Writer, Abridge, Bland, Patreon, Descript, Retool, and Zed to ensure industry-leading performance, security, and reliability for their critical workloads. With our recent $75M Series C funding, we are rapidly growing to make AI accessible across all products.
The Role
As a Senior Software Engineer focused on ML Infrastructure at Baseten, you will take the lead in architecting and developing our machine learning inference platform that powers production AI applications. You will be instrumental in making key technical decisions related to the infrastructure that enables developers to effectively deploy, scale, and monitor ML models with optimal performance and reliability.
Example Initiatives
- Managing multi-cloud capacity effectively
- Implementing inference on B200 GPUs
- Facilitating multi-node inference
- Utilizing fractional H100 GPUs for efficient model serving
Responsibilities
Design and build scalable infrastructure systems for our ML inference platformOptimize Kubernetes deployments for cost-effective and efficient model servingEnhance our inference orchestration layer for complex model deploymentsEstablish monitoring strategies for model performance, latency, and resource utilizationDevelop advanced solutions for GPU capacity management and throughput optimizationSet infrastructure automation standards to streamline ML deployment processesCollaborate with other engineers to convert complex inference requirements into practical technical solutionsMake critical architectural decisions while balancing performance and system reliabilityLead technical discussions and mentor junior engineers on best practices in infrastructureContribute to the long-term technical strategy and infrastructure roadmapRequirements
Bachelor's degree or higher in Computer Science or a related field5+ years of experience in building production infrastructure systemsExpert-level proficiency in Go, with Python experience being a plusDeep expertise in Kubernetes within production environmentsExtensive experience with major cloud providers (AWS, GCP) and neo-cloud providers (Crusoe, DigitalOcean, Nebius) is a plusAdvanced understanding of distributed systems concepts and performance tuningProven experience in designing observability systemsA track record of leading technical initiatives and mentoring engineersExperience with ML / AI workloads and MLOps platforms is highly valuedBenefits
Competitive compensation package including flexible PTO, 401k, and covered healthcare premiumsAn extraordinary opportunity to be part of a rapidly growing startup in one of the most exciting engineering fields todayAn inclusive and supportive work culture that encourages learning and growthExposure to a variety of ML startups, facilitating unparalleled learning and networking opportunitiesApply now to embark on a rewarding journey to shape the future of AI! If you are a motivated individual with a passion for machine learning and a desire to join a collaborative and forward-thinking team, we would love to hear from you.
At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.