Talent.com
Software Engineer Lead - Cloud Engineering
Software Engineer Lead - Cloud EngineeringKumo • Mountain View, CA, United States
Software Engineer Lead - Cloud Engineering

Software Engineer Lead - Cloud Engineering

Kumo • Mountain View, CA, United States
2 days ago
Job type
  • Full-time
Job description

About Kumo.ai

Kumo is building a next-generation AI platform that empowers organizations to make predictive decisions faster—without the overhead of traditional ML pipelines. Backed by Sequoia and led by ex-Airbnb, Pinterest, and LinkedIn leaders, we’re scaling rapidly and looking for a multi-cloud infrastructure leader to architect and run the backbone of our AI platform.

This is one of our most critical hires — your work will directly power the models and applications our customers rely on every day. If you’re passionate about multi-cloud infrastructure , Kubernetes at scale , and building the infrastructure that powers the next generation of AI applications — we’d love to talk.

Why Kumo.ai?

  • Work alongside world‑class engineers & scientists (ex-Airbnb, Pinterest, LinkedIn, Stanford).
  • Be a foundational voice in designing a platform powering enterprise‑scale AI.
  • Competitive Series B compensation package (salary + meaningful equity).

The Opportunity - The Cloud Infrastructure team is responsible for managing and scaling our Kubernetes‑based, multi‑cloud AI platform across AWS, Azure, and GCP.

  • You will own the architecture, scalability, security, and operational excellence of this platform, building the foundation that supports massive multi‑tenant clusters running Big Data and AI / ML workloads.
  • Lead our multi‑cloud expansion beyond AWS into Azure and GCP.
  • Drive the design and implementation of Kubernetes controllers, operators , and automation for scaling and reliability.
  • Implement Infrastructure as Code (Terraform, Pulumi, Crossplane) and GitOps practices to deliver commit‑to‑production automation at scale.
  • Partner closely with ML scientists, product engineers, and leadership to deliver self‑service tooling and optimize infrastructure for machine learning workloads.
  • You will be joining early enough to shape the architecture, culture, and processes that define our platform reliability and engineering velocity.
  • What You’ll Do

  • Architect & operate multi‑cloud infrastructure (AWS, Azure, GCP) to support large‑scale AI workloads.
  • Design, build and scale Kubernetes clusters (EKS, AKS, GKE, Open Source) for high availability, performance, and cost efficiency.
  • Build and maintain Kubernetes controllers, operators , and automation for cluster lifecycle management, scaling, and workload scheduling.
  • Implement observability at scale — metrics, logging, tracing — using tools like Prometheus, Grafana, and OpenTelemetry.
  • Lead IaC and GitOps automation, ensuring consistent, repeatable provisioning and deployment workflows.
  • Drive security and compliance policies (RBAC, tenant isolation, SOC2 / GDPR readiness) into platform design.
  • Partner with internal teams to enable self‑service cloud resources and smooth commit‑to‑production pipelines.
  • What You Bring

  • 8+ years building and operating cloud‑native infrastructure in production.
  • Proven multi‑cloud experience — designing and running workloads across AWS, Azure, and GCP.
  • Kubernetes expertise — 5+ years managing production clusters, with strong understanding of internals (schedulers, controllers, operators, CNI networking, security).
  • Infrastructure‑as‑Code mastery — Terraform, Pulumi, Crossplane, or similar.
  • GitOps and workflow automation experience (ArgoCD, Flux, Argo Workflows, or similar).
  • Strong skills in monitoring and performance tuning for distributed systems.
  • Proficiency in Go, Python, or Rust for automation tooling.
  • Nice to Have

  • Experience in optimizing, scaling, and maintaining multi‑tenanted AI / ML clusters across multiple cloud environments, ensuring high availability and performance.
  • Familiarity with compliance standards (SOC2, ISO27001, GDPR).
  • Contributions to open‑source cloud‑native projects .
  • Experience building customer‑facing APIs or developer tooling.
  • $175,000 - $250,000 a year

    We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

    #J-18808-Ljbffr

    Create a job alert for this search

    Software Engineer Cloud • Mountain View, CA, United States

    Related jobs
    Infrastructure Platform Engineering Lead for Software Development

    Infrastructure Platform Engineering Lead for Software Development

    OSI Engineering • Mountain View, CA, US
    Full-time
    We are seeking an experienced Infrastructure Engineer Lead to drive the design and implementation of robust security and identity management systems. In this role, you will architect and deploy Iden...Show more
    Last updated: 9 days ago • Promoted
    Cloud Engineer (AWS)

    Cloud Engineer (AWS)

    Medium • San Francisco, CA, United States
    Full-time
    Employment Type : Full-Time, Experienced.Department : Information technology.We are seeking a Cloud Engineer (AWS) who will be responsible for supporting the development of all required documentation...Show more
    Last updated: 20 days ago • Promoted
    Cloud and Storage Engineer

    Cloud and Storage Engineer

    Medium • San Francisco, CA, United States
    Full-time
    Employment Type : Full-Time, Experienced.Department : Information technology.CGS is seeking a Cloud and Storage Engineer to develop and implement full-scale Storage Area Network (SAN) architecture fo...Show more
    Last updated: 20 days ago • Promoted
    Cloud Engineer

    Cloud Engineer

    Global Modern Services, Inc. (USA) • San Francisco, CA, United States
    Full-time
    It's fun to work in a company where people truly.We're committed to bringing passion and customer focus to the business.If you like wild growth and working with happy, enthusiastic over-achievers, ...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineering Manager, Google Cloud Dataproc, Open Source

    Software Engineering Manager, Google Cloud Dataproc, Open Source

    Google Inc. • Sunnyvale, CA, United States
    Full-time
    Google place Sunnyvale, CA, USA.Like Google's own ambitions, the work of a Software Engineer goes beyond just Search.Software Engineering Managers have not only the technical expertise to take on a...Show more
    Last updated: 24 days ago • Promoted
    Backend Software Engineer, Cloud Control Plane

    Backend Software Engineer, Cloud Control Plane

    Crusoe • Sunnyvale, CA, US
    Full-time
    Crusoe's mission is to accelerate the abundance of energy and intelligence.We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrif...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Cloud Platform

    Senior Software Engineer, Cloud Platform

    Chef Robotics, Inc. • San Francisco, CA, United States
    Full-time
    Chef Robotics is on a mission to accelerate the advent of intelligent machines in the physical world.As the rise of LLMs like ChatGPT has shown, AI has the potential to drive immense change.However...Show more
    Last updated: 13 days ago • Promoted
    Software Engineer - Cloud Engineering

    Software Engineer - Cloud Engineering

    Kumo • Mountain View, CA, US
    Full-time
    Kumo is building a next-generation AI platform that empowers organizations to make predictive decisions faster—without the overhead of traditional ML pipelines. Backed by Sequoia and led by ex...Show more
    Last updated: 14 days ago • Promoted
    Lead Cloud Engineer

    Lead Cloud Engineer

    Mill • San Bruno, CA, US
    Full-time
    Mill is all about answering a simple question : how can we prevent waste? Less waste can save time, money, energy, maybe even our planet. And there's no better place to start than food.Food waste...Show more
    Last updated: 30+ days ago • Promoted
    Cloud Platform Engineer

    Cloud Platform Engineer

    ClassDojo • San Francisco, CA, United States
    Full-time
    ClassDojo's goal is to give every child on Earth an education they love.We started by building a powerful network for communication. ClassDojo’s flagship app is the #1 communication app connecting K...Show more
    Last updated: 23 days ago • Promoted
    Software Engineer, Cloud Infrastructure

    Software Engineer, Cloud Infrastructure

    Monograph • San Francisco, CA, United States
    Full-time
    San Francisco, California, New York, New York.We're on a mission to make it possible for every person, team, and company to be able to tailor their software to solve any problem and take on any cha...Show more
    Last updated: less than 1 hour ago • Promoted • New!
    Staff Software Engineer, Site Reliability Engineering, Google Cloud

    Staff Software Engineer, Site Reliability Engineering, Google Cloud

    Google Inc. • San Francisco, CA, United States
    Full-time
    Staff Software Engineer, Site Reliability Engineering, Google Cloud.X Applicants in San Francisco : Qualified applications with arrest or conviction records will be considered for employment in acco...Show more
    Last updated: 13 days ago • Promoted
    Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

    Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

    Epoch Biodesign • San Francisco, CA, United States
    Full-time
    We are looking for a highly skilled engineer with deep expertise in building and operating observability platforms at scale. You will design, develop, and run Crusoe’s next-generation observability ...Show more
    Last updated: 26 days ago • Promoted
    Lead Platform / Cloud Engineer

    Lead Platform / Cloud Engineer

    salesforce.com, inc. • San Francisco, CA, United States
    Full-time
    Salesforce is the #1 AI CRM, where humans with agents drive customer success together.And innovation isn't a buzzword - it's a way of life. The world of work as we know it is changing and we're look...Show more
    Last updated: 10 days ago • Promoted
    Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

    Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

    Crusoe Energy Systems LLC • San Francisco, CA, United States
    Full-time
    We are looking for a highly skilled engineer with deep expertise in building and operating observability platforms at scale. You will design, develop, and run Crusoe’s next-generation observability ...Show more
    Last updated: 28 days ago • Promoted
    Senior Cloud Engineer

    Senior Cloud Engineer

    Nuon Inc. • San Francisco, CA, United States
    Full-time
    As a Senior Software Engineer, Cloud at Nuon, you will be responsible for building and maintaining features to manage cloud infrastructure across multiple platforms. You should have extensive backen...Show more
    Last updated: 1 day ago • Promoted
    Lead Platform / Cloud Engineer

    Lead Platform / Cloud Engineer

    Salesforce • San Francisco, CA, United States
    Full-time
    To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Salesforce is the #1 AI CRM, where humans with age...Show more
    Last updated: 13 days ago • Promoted
    Amazon Dedicated Cloud Engineer II, Platform Engineering & Emerging Technology

    Amazon Dedicated Cloud Engineer II, Platform Engineering & Emerging Technology

    Amazon • San Francisco, CA, United States
    Full-time
    Job ID : 3115003 | Amazon Development Center U.Would you like to implement innovative cloud computing solutions and solve the world's most complex technical problems? Do you have a deep passion and ...Show more
    Last updated: 1 day ago • Promoted