Talent.com
Principal Site Reliability Engineer, ML Platform
Principal Site Reliability Engineer, ML PlatformZscaler • Short Hills, New Jersey, United States
Principal Site Reliability Engineer, ML Platform

Principal Site Reliability Engineer, ML Platform

Zscaler • Short Hills, New Jersey, United States
30+ days ago
Job type
  • Full-time
Job description

About Zscaler

Serving thousands of enterprise customers around the world including 45% of Fortune 500 companies, Zscaler (NASDAQ : ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world’s largest security cloud, Zscaler accelerates digital transformation so enterprises can be more agile, efficient, resilient, and secure. The pioneering, AI-powered Zscaler Zero Trust Exchange™ platform, which is found in our SASE and SSE offerings, protects thousands of enterprise customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location.

Named a Best Workplace in Technology by Fortune and others, Zscaler fosters an inclusive and supportive culture that is home to some of the brightest minds in the industry. If you thrive in an environment that is fast-paced and collaborative, and you are passionate about building and innovating for the greater good, come make your next move with Zscaler.

Our Engineering team built the world’s largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your vision and passion to our team of cloud architects, software engineers, security experts, and more who are enabling organizations worldwide to harness speed and agility with a cloud-first strategy.

Processing billions of transactions and generating trillions of data points daily, we believe data is the key to disrupting the cybersecurity market through AI. Reporting to the EVP of AI Innovations, who leads this team directly under our CEO, this is a career-defining opportunity to influence Zscaler’s AI strategy and drive innovation. This position is hybrid, based in our New Jersey office three days a week. Exceptional remote candidates will also be considered. As a Principal Site Reliability Engineer - ML Platform, you will :

  • Architect, build, and maintain large-scale distributed systems to support end-to-end AI pipelines, including data collection, feature engineering, model training, evaluation, deployment, and real-time serving
  • Act as the owner of Site Reliability Engineering (SRE) for AI-driven applications deployed on AWS, ensuring performance, availability, observability, and scalability
  • Collaborate with the engineering team to design and implement CI / CD pipelines, infrastructure provisioning, scripting automation for deployment and customer-facing services, robust monitoring frameworks using tools and techniques for real-time statistics and performance tracking across production systems
  • Drive innovation and best practices in integrating Kubernetes, ArgoCD, and similar tools into cloud environments, with a focus on AI / ML pipelines and GPU-based cloud structures (e.g., SkyPilot)
  • Serve as the group's FinOps expert and AWS admin, taking ownership of hosting cost optimization and all administrative aspects of the AWS account for ZAIRe

What We're Looking for (Minimum Qualifications) :

  • 10+ years of experience in Site Reliability Engineering, cloud infrastructure, and / or applications architecture, with a strong foundation in Kubernetes and Docker
  • Proven programming expertise in Python, SQL, and distributed processing technologies such as Spark, BigQuery, or Apache Beam
  • Hands-on experience building and maintaining CI / CD pipelines, leveraging infrastructure-as-code tools like ArgoCD, Terraform, or similar
  • Strong knowledge of cloud platforms (AWS preferred, GCP acceptable), including certification or equivalent skills specific to cloud-native system management
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • What Will Make You Stand Out (Preferred Qualifications) :

  • Working knowledge of AI / ML pipelines and frameworks (e.g., SkyPilot, mobile ML training) and experience with GPU-optimized cloud infrastructure
  • Experience with SQL / NoSQL databases, ML automation platforms, and tools for full production lifecycle of AI-based products
  • Advanced degree (Master’s or Ph.D.) in Computer Science, Machine Learning, or related field, with a demonstrated ability to lead projects and innovate quickly in a fast-paced environment
  • #LI-Hybrid

    #LI-KM9

    Zscaler’s salary ranges are benchmarked and are determined by role and level. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations and could be higher or lower based on a multitude of factors, including job-related skills, experience, and relevant education or training.

    The base salary range listed for this full-time position excludes commission / bonus / equity (if applicable) + benefits.

    Base Pay Range

    $164,500 - $235,000 USD

    At Zscaler, we are committed to building a team that reflects the communities we serve and the customers we work with. We foster an inclusive environment that values all backgrounds and perspectives, emphasizing collaboration and belonging. Join us in our mission to make doing business seamless and secure.

    Our Benefits program is one of the most important ways we support our employees. Zscaler proudly offers comprehensive and inclusive benefits to meet the diverse needs of our employees and their families throughout their life stages, including :

  • Various health plans
  • Time off plans for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks, and more!
  • Learn more about Zscaler’s Future of Work strategy, hybrid working model, and benefits here .

    By applying for this role, you adhere to applicable laws, regulations, and Zscaler policies, including those related to security and privacy standards and guidelines.

    Zscaler is committed to providing equal employment opportunities to all individuals. We strive to create a workplace where employees are treated with respect and have the chance to succeed. All qualified applicants will be considered for employment without regard to race, color, religion, sex (including pregnancy or related medical conditions), age, national origin, sexual orientation, gender identity or expression, genetic information, disability status, protected veteran status, or any other characteristic protected by federal, state, or local laws. See more information by clicking on the Know Your Rights : Workplace Discrimination is Illegal link.

    Pay Transparency

    Zscaler complies with all applicable federal, state, and local pay transparency rules.

    Zscaler is committed to providing reasonable support (called accommodations or adjustments) in our recruiting processes for candidates who are differently abled, have long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support.

    Create a job alert for this search

    Site Reliability Engineer • Short Hills, New Jersey, United States

    Related jobs
    Site Reliability Engineer (Hybrid)

    Site Reliability Engineer (Hybrid)

    Selective Insurance • Millburn, NJ, United States
    Temporary
    At Selective, we don't just insure uniquely, we employ uniqueness.Selective's unique position as both a leading insurance group and an employer of choice is recognized in a wide variety of awards a...Show more
    Last updated: 30+ days ago • Promoted
    Civil Engineer PM / Dept Head W / WW

    Civil Engineer PM / Dept Head W / WW

    The WorkPlace Group • Holmdel, NJ, US
    Full-time
    Dept Head / Project Manager -Wastewater Treatment.Relocation Assistance Provided • • •.Are you ready to elevate your career with an employer that values growth, innovation, and.Our client’s cultur...Show more
    Last updated: 8 days ago • Promoted
    Sr. Platform Engineer

    Sr. Platform Engineer

    Apptad Inc • Piscataway, New Jersey, USA
    Full-time
    We are seeking an experienced Platform Engineer specializing in Apache Solr and Lucidworks Fusion to design deploy optimize and maintain our enterprise search infrastructure.The ideal candida...Show more
    Last updated: 13 days ago • Promoted
    Sr. Business Systems Engineer - Direct Procurement & MRP

    Sr. Business Systems Engineer - Direct Procurement & MRP

    CoreWeave • Livingston, NJ, United States
    Permanent
    CoreWeave is The Essential Cloud for AI™.Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence....Show more
    Last updated: 17 days ago • Promoted
    Software Development Engineer

    Software Development Engineer

    Amazon • Matawan, NJ, USA
    Full-time
    Join Amazon's engineering team and help us build innovative solutions to complex problems.As a Software Development Engineer, you will design, develop, and test software applications and services.W...Show more
    Last updated: 22 days ago • Promoted
    Chief Engineer

    Chief Engineer

    ABM.Com • Newark, NJ, US
    Full-time
    ABM (NYSE : ABM) is a leading provider of facility solutions with revenues of approximately $6.United States and various international locations. ABM's comprehensive capabilities include electrical &...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Soho Dragon • Morristown, New Jersey, United States
    Full-time
    SoHo Dragon represents a Fortune 500 Financial Technology firm with offices in Northern, NJ, that is looking to hire a Site Reliability Engineer. Contract duration : 6 months (right to hire).Financia...Show more
    Last updated: 19 days ago • Promoted
    Principal Engineer (AI & Carrier Integration, P&C Insurance Domain experience)

    Principal Engineer (AI & Carrier Integration, P&C Insurance Domain experience)

    TMS LLC • Parsippany, New Jersey, USA
    Full-time
    We are seeking a highly experienced Principal Engineer to lead the design and development of next-generation AI-driven solutions for insurance carrier integration. The ideal candidate will have deep...Show more
    Last updated: 17 days ago • Promoted
    Software Engineer, GPS Knowledge Systems

    Software Engineer, GPS Knowledge Systems

    Bristol Myers Squibb • New Brunswick, New Jersey, USA
    Full-time
    Those arent words that are usually associated with a job.But working at Bristol Myers Squibb is anything but usual.Here uniquely interesting work happens every day in every department.From optimizi...Show more
    Last updated: 21 days ago • Promoted
    Vice President, Advanced Manufacturing Engineering

    Vice President, Advanced Manufacturing Engineering

    Telescope Recruitment • Somerset, NJ, United States
    Full-time
    Our employer, founded in 2000, is a leading provider of premium metal payment cards and secure authentication solutions.Headquartered in Somerset, New Jersey, the company serves major financial ins...Show more
    Last updated: 30+ days ago • Promoted
    Principal Software Engineer

    Principal Software Engineer

    Index Engines • Holmdel, New Jersey, United States
    Full-time
    Index Engines is the world’s leading AI powered analytics engine to detect data corruption due to ransomware.The company’s CyberSense® product empowers organizations to detect ransomware and data c...Show more
    Last updated: 30+ days ago • Promoted
    Civil Engineer PM / Dept Head W / WW (Holmdel)

    Civil Engineer PM / Dept Head W / WW (Holmdel)

    The WorkPlace Group • Holmdel, New Jersey, US
    Part-time
    Dept Head / Project Manager -Wastewater Treatment.Relocation Assistance Provided • • •.Are you ready to elevate your career with an employer that values growth, innovation, and.Our client's culture enab...Show more
    Last updated: 2 hours ago • Promoted • New!
    LIMS Implementation Specialist

    LIMS Implementation Specialist

    Clover Solutions • Parsippany, NJ, United States
    Full-time
    Job Title : LIMS Implementation Specialist.Job Type : Long Term Contract Opportunity.Onsite with occasional travel to Texas). You will conduct code reviews of others against the Labware configuration ...Show more
    Last updated: 15 days ago • Promoted
    Reliability Engineer

    Reliability Engineer

    GE Vernova • Parsippany, New Jersey, USA
    Full-time +1
    As the Reliability Engineer for Metem a GE Vernova business you will be an active contributor to the success of the organization by improving the reliability availability and performance of our equ...Show more
    Last updated: 1 day ago • Promoted
    Manufacturing Site Lean Experts | Techno-functional | Remote

    Manufacturing Site Lean Experts | Techno-functional | Remote

    Samprasoft • New Brunswick, NJ, US
    Remote
    Full-time
    Location : Would like them to be in CST but open to anywhere in US or Mexico.There may be some travel expected, expenses paid.Show more
    Last updated: 30+ days ago • Promoted
    Multifamily Lead Engineer

    Multifamily Lead Engineer

    Resource Innovations • Newark, NJ, US
    Full-time
    Quick Apply
    We are seeking a highly skilled and motivated Lead Engineer with a strong background in multifamily and C&I custom and energy management DSM program engineering to join our dynamic team.As a Le...Show more
    Last updated: 30+ days ago
    LIMS implementation specialist

    LIMS implementation specialist

    Veracity • Parsippany, NJ, United States
    Full-time
    LIMS implementation specialist.Parsippany, NJ (Onsite- with occasional travel to Texas).You will conduct code reviews of others against the LabWare configuration guidelines.You will create and desi...Show more
    Last updated: 30+ days ago • Promoted
    MRO Engineer

    MRO Engineer

    DHD Consulting • Monroe Township, New Jersey, United States
    Full-time
    Quick Apply
    The MRO Engineer is responsible for leading new equipment development, installation, and.This role serves as the technical point of contact for all engineering-related issues, ensuring.Initiate and...Show more
    Last updated: 30+ days ago