Talent.com
Senior Staff DevOps Engineer - Machine Learning

Senior Staff DevOps Engineer - Machine Learning

ServicenowSanta Clara, California, United States
30+ days ago
Job type
  • Full-time
Job description

Company Description

It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500®. Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But this is just the beginning of our journey. Join us as we pursue our purpose to make the world work better for everyone.

Job Description

Please note that this role requires you to be in our Santa Clara office for two days per week.

PLATO (Platform Engineering and AI Technology Organization) at ServiceNow is a customer-focused innovative group building intelligent software using a variety of technology stacks to enable end-to-end, industry-leading work experiences for our customers. We are a group of people deeply invested in the success of our customers that happen to have expertise and knowledge in advanced technologies and software engineering best practices. We are data driven, structured, committed and we enjoy what we are doing. We prioritize robustness, performance and user experience over the technology stack and tools.

We are a group of technology professionals and platform engineers with a dual mission. We build and evolve the AI platform, and partner with teams to build products and end-to-end AI-powered work experiences. In equal measure, we lay the foundations, research, experiment, and de-risk AI technologies that unlock new work experiences in the future.

As a Senior Staff Machine Learning Engineer - Site  Reliability Engineer you will :

  • Contribute to the design, development and implementation of infrastructure, platform, deployment and observability features that power AI workloads.
  • Collaborate with researchers, AI engineers, and infrastructure teams to ensure our GPU clusters perform efficiently, scale well, and remain reliable.
  • Contribute to the continuous improvement of the SRE practice by turning operational use cases into requirements for software tooling.
  • Contribute to the execution of deployment and support activities for AI / ML developers;
  • Build high-quality, clean, scalable and reusable code by enforcing best practices around software engineering architecture and processes (Code Reviews, Unit testing, etc.);
  • Work with the product owners to understand detailed requirements and own your code from design, implementation, test automation and delivery of high-quality product to our users;
  • Experience with operating LLMs on NVIDIA GPUs.
  • Be a mentor for colleagues and help promote knowledge-sharing.

Qualifications

To be successful in this role you have :

  • Experience in leveraging or critically thinking about how to integrate AI into work processes, decision-making, or problem-solving. This may include using AI-powered tools, automating workflows, analyzing AI-driven insights, or exploring AI's potential impact on the function or industry.
  • Proficient in prompt engineering and developing LLM based features
  • Experience with methods of training and fine tuning large language models, such as distilation, supervised fine-tunning and policy optimization
  • Experience in using AI productivity tools such as Cursor, Windsurf, etc

  • 8+ years of experience with infrastructure and platform operations, deployments, SRE, and DevOps with a continued focus on improving Platform health;
  • 6+ years of experience operating highly-available distributed workloads on Kubernetes following a DevOps approach.
  • 6+ years of development experience with Python, GoLang, Java or similar languages;
  • Experience with DevOps tooling  (e.g. Helm / Ansible / Kubernetes / Prometheus / Splunk / GitLab CI);
  • Strong working experience operating distributed systems built on Linux and J2EE;
  • Experience with software-defined networking, infrastructure as code and configuration management;
  • Experience building software for compliance and security in regulated environments
  • Ability to drive outcome in projects with material technical risk.
  • Additional Information

    Work Personas

    We approach our distributed world of work with flexibility and trust. Work personas (flexible, remote, or required in office) are categories that are assigned to ServiceNow employees depending on the nature of their work and their assigned work location. Learn more here . To determine eligibility for a work persona, ServiceNow may confirm the distance between your primary residence and the closest ServiceNow office using a third-party service.

    Equal Opportunity Employer

    ServiceNow is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, sex, sexual orientation, national origin or nationality, ancestry, age, disability, gender identity or expression, marital status, veteran status, or any other category protected by law. In addition, all qualified applicants with arrest or conviction records will be considered for employment in accordance with legal requirements.

    Accommodations

    We strive to create an accessible and inclusive experience for all candidates. If you require a reasonable accommodation to complete any part of the application process, or are unable to use this online application and need an alternative method to apply, please contact [email protected] for assistance.

    Export Control Regulations

    For positions requiring access to controlled technology subject to export control regulations, including the U.S. Export Administration Regulations (EAR), ServiceNow may be required to obtain export control approval from government authorities for certain individuals. All employment is contingent upon ServiceNow obtaining any export license or other approval that may be required by relevant export control authorities.

    From Fortune. ©2025 Fortune Media IP Limited. All rights reserved. Used under license.

    Create a job alert for this search

    Staff Machine Learning Engineer • Santa Clara, California, United States

    Related jobs
    • Promoted
    Principal DevOps Engineer

    Principal DevOps Engineer

    Informatica LLCRedwood City, CA, United States
    Full-time
    Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous minds eager to solve the world's most...Show moreLast updated: 23 days ago
    • Promoted
    Staff Machine Learning Engineer, Level 6

    Staff Machine Learning Engineer, Level 6

    Snap Inc.Palo Alto, CA, United States
    Full-time
    Staff Machine Learning Engineer, Level 6 page is loaded.Staff Machine Learning Engineer, Level 6.Apply locations Palo Alto, California Seattle, Washington Santa Monica - 2850 Ocean Park Blvd San Fr...Show moreLast updated: 14 days ago
    • Promoted
    Senior / Staff Machine Learning Engineer, Perception

    Senior / Staff Machine Learning Engineer, Perception

    AgtonomySan Francisco, CA, United States
    Full-time
    At Agtonomy, we’re not just building tech—we’re transforming how vital industries get work done.Our Physical AI and fleet services turn heavy machinery into intelligent, autonomous systems that tac...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    QuantcastSan Francisco, CA, United States
    Full-time
    At Quantcast, we're redefining what's possible in digital advertising.As a global Demand Side Platform (DSP) powered by AI, we help marketers connect with the right audiences and deliver measurable...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Staff Software Engineer, Machine Learning San Francisco (USA) Remote (USA) Discord USD 3[...]

    Senior Staff Software Engineer, Machine Learning San Francisco (USA) Remote (USA) Discord USD 3[...]

    GamecompaniesSan Francisco, CA, United States
    Remote
    Full-time
    Senior Staff Software Engineer, Machine Learning - removed.Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our ...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    HiveSan Francisco, CA, United States
    Full-time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    SukiRedwood City, CA, United States
    Full-time
    Get AI-powered advice on this job and more exclusive features.The Future of Healthcare Needs You.At Suki, were building technology that listens, understands, and gets out of the way so clinicians c...Show moreLast updated: 30+ days ago
    Senior DevOps Engineer

    Senior DevOps Engineer

    IntelliPro Group Inc.Palo Alto, CA, CA, US
    Full-time
    Quick Apply
    Senior DevOps Engineer Position Type : FTE Location : Palo Alto, CA Salary Range / Rate (Currency) : $100,000 - $300,000 Job ID# : 158174 Job Summary (Responsibilities and Requirements) : Responsibiliti...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Staff ML Platform Engineer, Scalable Infra & Search

    Staff ML Platform Engineer, Scalable Infra & Search

    Apple Inc.San Francisco, CA, US
    Full-time
    A leading technology company seeks a Staff ML Engineer for their Machine Learning Platform Technologies team in San Francisco. The role involves building scalable infrastructures for machine learnin...Show moreLast updated: less than 1 hour ago
    • Promoted
    Staff / Principal DevOps Engineer (FortiAppSec)

    Staff / Principal DevOps Engineer (FortiAppSec)

    FortinetSunnyvale, CA, US
    Full-time
    We are seeking a highly skilled DevOps Engineer to join our team.In this role, you will design, implement, and maintain scalable, resilient, and secure infrastructure. You will work closely with Dev...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Staff Engineer, Virtual Machines, Google Distributed Cloud

    Senior Staff Engineer, Virtual Machines, Google Distributed Cloud

    Google Inc.Sunnyvale, CA, United States
    Full-time
    Google place Sunnyvale, CA, USA.Bachelor's degree or equivalent practical experience.Experience either developing or deploying / maintaining virtual machine environments on bare metal (VMWare, etc.Ex...Show moreLast updated: 7 hours ago
    • Promoted
    Senior Machine Learning / MLOps Engineer

    Senior Machine Learning / MLOps Engineer

    GamecompaniesSan Francisco, CA, United States
    Full-time
    We’re seeking a Senior Machine Learning Engineer to join our Data Services and Moderation team within Unity Ads! In this role, you’ll focus on transforming ML workflows and ETL pipelines into scala...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    CoinbaseSan Francisco, CA, United States
    Full-time
    Coinbase to help build the first natively onchain advertising ecosystem.As the first Machine Learning Engineer on the team, you’ll lead the development of ranking systems and infrastructure from th...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Software Engineer, Machine Learning

    Staff Software Engineer, Machine Learning

    DiscordSan Francisco, CA, United States
    Full-time
    Discord is used by over 200 million people every month for many different reasons, but there’s one thing that nearly everyone does on our platform : . Over 90% of our users play games, spending a comb...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Staff Machine Learning Engineer

    Senior Staff Machine Learning Engineer

    RipplingSan Francisco, CA, United States
    Full-time
    Rippling gives businesses one place to run HR, IT, and Finance.It brings together all of the workforce systems that are normally scattered across a company, like payroll, expenses, benefits, and co...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Staff Machine Learning Engineer

    Senior Staff Machine Learning Engineer

    PatreonSan Francisco, CA, United States
    Full-time
    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer, Relevance

    Senior Machine Learning Engineer, Relevance

    PatreonSan Francisco, CA, United States
    Full-time
    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior / Staff Machine Learning Engineer, Planning

    Senior / Staff Machine Learning Engineer, Planning

    PlusSanta Clara, CA, United States
    Full-time
    Plus is a global provider of highly automated driving and fully autonomous driving solutions with headquarters in Silicon Valley, California. Named by Forbes as one of Americas Best Startup Employer...Show moreLast updated: 13 hours ago