Talent.com
Director, Site Reliability Engineering - Infrastructure Platform

Director, Site Reliability Engineering - Infrastructure Platform

Okta for DevelopersSan Francisco, CA, United States
5 hours ago
Job type
  • Permanent
Job description

Director, Site Reliability Engineering - Infrastructure Platform

Join as the Director of Infrastructure Platform and Shared Services at Okta for Developers. Oversee multiple teams focused on Edge networking, Kubernetes platform, CI / CD, observability, and automation platform & tooling.

What Youll Be Doing

  • Lead the Infra platform and shared services organization and various initiatives across the SRE & Infrastructure organization.
  • Lead the DevOps transformation, microservice journey, and next-generation Infra platform capabilities in partnership with architects and product engineering.
  • Build a world-class observability platform and monitoring capabilities enabled with self-service.
  • Accelerate the velocity of SRE and product engineering by developing robust platforms, powerful tooling, and intuitive self-service capabilities.
  • Own the design and operation of scalable, self-service Cloud infrastructure platforms (e.g., Kubernetes, service mesh, CI / CD pipelines, IaC & Edge Infrastructure).
  • Lead, mentor, and grow a high-performing team of engineers and managers across platform, infrastructure, and shared services domains.
  • Perform engineering design evaluations and ensure the completion of projects within resource, budget, and scheduling constraints.
  • Improve SDLC processes for Cloud infrastructure as code, including the maturity of CI / CD pipelines, change and release management.
  • Manage service and business expectations and prioritize resource allocation.
  • Maintain a deep knowledge of industry best practices, evolving trends, and technologies.

What Youll Bring To The Role

  • 8+ years of experience in technical leadership & people management.
  • Extensive experience using Agile and DevOps methodologies to build product infrastructure and shared service at scale.
  • 3+ years of experience running large-scale infrastructure platforms supporting a SaaS / Cloud service in a public Cloud, preferably AWS; experience supporting a multi-Cloud environment is a plus.
  • Strong expertise in cloud-native architectures, containerization (Kubernetes), IaC (Terraform), and CI / CD pipelines.
  • Strong background and hands-on experience in SW development, PaaS and automation.
  • Deep experience with building and operating observability platforms and monitoring tools (Grafana, Splunk, APM, etc.) in a large scale environment.
  • Demonstrated ability to lead cross-functional teams and manage large-scale programs.
  • Effective verbal, written communication and interpersonal skills.
  • Computer Science degree or related degree or equivalent experience.
  • Additional Requirements

  • This position requires the ability to access federal environments and / or have access to protected federal data. As a condition of employment, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g., a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee) upon hire.
  • Below is the annual base salary range for candidates located in California. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit : .

    The annual base salary range for this position for candidates located in the San Francisco Bay area is between : $266,000$398,000 USD.

    Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

    #J-18808-Ljbffr

    Create a job alert for this search

    Director Site Engineering • San Francisco, CA, United States

    Related jobs
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    GenentechSouth San Francisco, CA, United States
    Full-time
    It's what drives us to innovate.To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more ...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantumPalo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer, Recommendation Infrastructure - USDS

    Site Reliability Engineer, Recommendation Infrastructure - USDS

    Tik TokSan Jose, CA, United States
    Full-time
    About the Team The USDS TikTok Recommendations Infra SRE team works with engineering and product teams to build and run large-scale, globally distributed, observable, fault-tolerant systems.SREs on...Show moreLast updated: 6 days ago
    • Promoted
    Sr. Director of Solutions Engineering

    Sr. Director of Solutions Engineering

    Epoch BiodesignSan Francisco, CA, United States
    Full-time
    Denver, CO - US, San Francisco, CA - US, Seattle, WA - US, Sunnyvale, CA - US.Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a worl...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Manager, Site Reliability Engineering

    Sr Manager, Site Reliability Engineering

    PayPal, Inc.San Jose, CA, United States
    Full-time
    PayPal has been revolutionizing commerce globally for more than 25 years.Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empow...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer - Infrastructure

    Site Reliability Engineer - Infrastructure

    VerkadaSan Mateo, CA, United States
    Full-time
    Designed with simplicity in mind, Verkada's six product lines - video security cameras, access control, environmental sensors, alarms, workplace, and intercoms - provide unparalleled building secur...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer, Frontier Systems Infrastructure

    Site Reliability Engineer, Frontier Systems Infrastructure

    OpenAISan Francisco, CA, United States
    Full-time
    The Frontier Systems team at OpenAI builds, launches, and supports the largest supercomputers in the world that OpenAI uses for its most cutting edge model training. We take data center designs, tur...Show moreLast updated: 2 days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Hewlett Packard Enterprise Development LPSan Jose, CA, United States
    Full-time
    Principal Site Reliability Engineer.This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week from an HPE office. Hewlett Packard Enterprise is the gl...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineering Tech Lead

    Site Reliability Engineering Tech Lead

    DataHub IncPalo Alto, CA, United States
    Full-time
    DataHub is an AI & Data Context Platform adopted by over 3,000 enterprises, including Apple, CVS Health, Netflix, and Visa. Innovated jointly with a thriving open-source community of 13,000+ members...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ReplitFoster City, CA, United States
    Full-time
    Replit is the agentic software creation platform that enables anyone to build applications using natural language.With millions of users worldwide and over 500,000 business users, Replit is democra...Show moreLast updated: 30+ days ago
    • Promoted
    Director of Site Reliability Engineering

    Director of Site Reliability Engineering

    Crypto Pro NetworkSan Francisco, California, US
    Full-time
    Director of Site Reliability Engineering.Interested in working on cutting-edge blockchain technology and creating equitable access to the global financial system? Since 2014, the mission-driven tea...Show moreLast updated: 1 day ago
    • Promoted
    Director, Reliability Engineering

    Director, Reliability Engineering

    F5 NetworksSan Jose, CA, United States
    Full-time
    At F5, we strive to bring a better digital world to life.Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital...Show moreLast updated: 28 days ago
    • Promoted
    Site Reliability Engineer (SRE)for Infrastructure

    Site Reliability Engineer (SRE)for Infrastructure

    Staffing the UniverseSan Francisco, CA, United States
    Full-time
    Site Reliability Engineer (SRE) For Infrastructure.Location : San Francisco Bay Area, CA / Seattle, WA / Los Angeles, CA. Job Description : Responsibilities : .Managing infrastructure services, responsibl...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer - Recommendation Infrastructure

    Site Reliability Engineer - Recommendation Infrastructure

    Tik TokSan Jose, CA, United States
    Full-time
    Our Recommendation Infrastructure Team is responsible for building up and optimizing the architecture for our recommendation system to provide the most stable and best experience for our TikTok use...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Rockwoods IncPleasanton, CA, US
    Full-time
    Note : Candidates must have relevant experience in Medical / Healthcare domains, this is mandatory.Senior SRE Engineer - Pleasanton, 5 days office. Primary work : 24x7 On-call support and setting up mo...Show moreLast updated: 24 days ago
    • Promoted
    Director, Cloud Ops / Site Reliability

    Director, Cloud Ops / Site Reliability

    Decision Engines, Inc.Palo Alto, CA, United States
    Full-time
    We are looking for an experienced Cloud Ops leader who will be responsible for operating what will be the world’s largest enterprise-grade intelligent business process automation platform.We are pi...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    HPESan Jose, CA, United States
    Full-time
    Principal Site Reliability Engineer.This role has been designed as ‘Hybrid’ with an expectation that you will work on average 2 days per week from an HPE office. Hewlett Packard Enterprise is the gl...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PrimerSan Francisco, California, US
    Full-time
    Primer helps B2B products break out of the B2C-centric marketing box.Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market...Show moreLast updated: 1 day ago