U.S. GenAI startup, Cambridge Office
Full Time Employment with Blitzy. We are committed to building a transformative AI platform that revolutionizes software development. Our goal is to enable you to have a long, impactful career at Blitzy with opportunity for advancement. If you want a role where you can shape the future of AI-powered infrastructure, read on!
About Blitzy
Blitzy is a Boston, MA based Generative AI Start-up on a mission to automate custom software creation to unlock the next industrial revolution. We're building an AI-powered platform capable of autonomously generating enterprise-grade software, powered by thousands of cooperative AI agents working in concert.
We're backed by multiple tier 1 investors, have success as founders at our previous start-up, and hold dozens of Generative AI patents.
Compensation : $140,000 - $180,000 / year
Location : 1 Kendall Square, Cambridge, MA (In-person role)
About the Role
We're looking for an exceptional DevOps Engineer to architect and maintain the infrastructure that powers our revolutionary AI agent ecosystem. You'll be instrumental in building scalable, resilient systems that support both our cutting-edge AI platform and modern applications. This role offers the unique opportunity to work at the intersection of traditional DevOps and emerging AI infrastructure, creating systems that enable thousands of AI agents to collaborate seamlessly.
As our DevOps Engineer, you'll take ownership of our entire infrastructure stack, from Kubernetes orchestration to AI agent deployment pipelines. You'll work directly with our engineering teams to ensure our platform can scale to support enterprise customers while maintaining the performance and reliability they demand.
What Success Looks Like
- You architect and implement robust Kubernetes infrastructure that scales effortlessly to support our growing AI agent ecosystem
- You create sophisticated CI / CD pipelines that enable rapid, reliable deployment of both traditional services and AI agents
- You develop Python-based automation that eliminates manual tasks and accelerates our development velocity
- You design monitoring and observability systems that provide deep insights into both infrastructure and AI agent performance
- You optimize our cloud infrastructure for cost-efficiency while maintaining enterprise-grade reliability
- You collaborate effectively with development teams to improve developer experience and productivity
- You proactively identify and resolve infrastructure bottlenecks before they impact customers
- You establish infrastructure best practices that support our rapid growth
- You build systems that can handle the unique challenges of AI workloads at scale
- You maintain 99.9%+ uptime for critical production services
Areas of Ownership
Core Infrastructure :
Kubernetes cluster design, deployment, and management for AI and application workloadsInfrastructure as Code using Terraform for multi-cloud environmentsContainer orchestration and optimization for AI agent deploymentNetwork architecture and security for distributed systemsAutomation & Tooling :
Python-based automation scripts for infrastructure managementHelm chart development and maintenance for application deploymentCI / CD pipeline design using modern DevOps toolsDeveloper productivity tooling and automationMonitoring & Reliability :
Comprehensive monitoring, alerting, and tracing systemsPerformance optimization for AI workloadsIncident response and disaster recovery planningCost optimization and resource managementAI Infrastructure (Unique to Blitzy) :
Infrastructure for AI agent orchestration and managementMLOps pipeline integrationScalable systems for handling AI model deploymentResource optimization for GPU / compute-intensive workloadsRequired Technical Experience
5-8 years of DevOps / Infrastructure experienceExpert-level Python proficiency for automation and scriptingDeep Kubernetes expertise : deployment, scaling, troubleshooting, and optimizationStrong experience with Helm for application package managementProven track record designing and implementing CI / CD pipelinesHands-on experience with major cloud platforms (AWS, Azure, or GCP)Terraform expertise for Infrastructure as CodeStrong Linux administration and containerization (Docker) skillsExperience with monitoring tools (Prometheus, Grafana, ELK stack)Understanding of microservices architecture and distributed systemsWays to Stand Out
CKA (Certified Kubernetes Administrator) or CKAD certificationExperience with MLOps tools (MLflow, Kubeflow, Ray, etc.)Knowledge of AI / ML infrastructure requirements and optimizationExperience with GPU orchestration and managementAPI gateway and service mesh implementation (Istio, Linkerd)GitOps experience (ArgoCD, Flux)Experience scaling infrastructure for high-growth startupsContributions to open-source infrastructure projectsExperience with multi-region, highly available deploymentsBackground in security and compliance (SOC2, HIPAA)You'll Get...
Competitive SalaryComprehensive health, dental, and vision insurance401(k) with company matchFlexible PTO policy$5,000 annual professional development budgetLatest hardware and software toolsThe opportunity to shape infrastructure for the future of software developmentWork with cutting-edge AI technology and world-class engineersModern office in Cambridge's innovation hubRegular team events and activitiesThe chance to solve novel infrastructure challenges at the intersection of DevOps and AICulture
Who we are : Our founding team consists of a Serial Gen AI Inventor and a successful Serial Entrepreneur. We work hard, maintain a curious mindset, and believe in a low-ego, high-output approach.
We move Blitzy Fast. Time is our most precious asset. We make decisions quickly and iterate rapidly, believing that a good decision today beats a perfect decision next week.
We have a Championship Mindset. We operate like a professional team - winning together by maintaining high standards, supporting each other, and staying laser-focused on our mission.
We have a Passion for Invention. As technologists pushing the boundaries of what's possible with AI, we thrive on solving problems that haven't been solved before.
What We Ask of You
This role requires someone who thrives in ambiguity and loves tackling unprecedented challenges. You'll be building infrastructure for a type of platform that's never existed before - one where thousands of AI agents collaborate to write software. This means being comfortable with rapid change, continuous learning, and creative problem-solving.
You should be excited about working independently while collaborating in-person with our team at our Cambridge headquarters. The ability to communicate complex technical concepts clearly and work effectively with both technical and non-technical stakeholders is essential.
To Apply
Apply with your resume and a brief note about :
Your most challenging infrastructure project and how you solved itWhy you're excited about building infrastructure for AI-powered software developmentInterview Process
Here's what you can expect :
Initial screening call (30 minutes)Technical discussion with Hao (45 minutes)Deep dive system design with Chaitanya (60 minutes)Final conversation with leadership (45 minutes)Offer discussionBlitzy is an equal opportunity employer committed to building a diverse and inclusive team.
PIe2a99f5dbc39-30511-39048140