Talent.com
Machine Learning Systems Platform Engineer

Machine Learning Systems Platform Engineer

Blue SignalSan Francisco, CA, United States
8 days ago
Job type
  • Full-time
Job description

Confidential Opening : Machine Learning Systems Platform Engineer

Location : San Francisco, CA (Hybrid Preferred)

Overview

A stealth-mode innovator at the forefront of AI infrastructure is seeking a dynamic Machine Learning Systems Platform Engineer to build the backbone of their next-generation ML ecosystem. This team is leading the charge in developing tools and platforms that empower world-class ML teams to experiment, scale, and deploy faster than ever before.

In this key engineering role, you will architect and optimize the systems that make high-performance AI development possible. From training and tuning to inference and monitoring, your work will enable cutting-edge ML initiatives across the organization. You will work closely with ML scientists and engineers to ensure seamless integration of models into production environments.

Key Responsibilities

  • Build and maintain robust infrastructure to support machine learning workloads at scale, including training pipelines, tuning environments, and deployment frameworks.
  • Develop and automate MLOps pipelines for reproducibility, experiment tracking, model versioning, and validation.
  • Optimize cloud and on-prem GPU compute utilization across orchestration platforms.
  • Lead the implementation of tools for model rollback, observability, and system health monitoring.
  • Collaborate with cross-functional teams to ensure reliability, scalability, and maintainability of ML systems.

Qualifications

  • 3+ years of experience in designing and deploying ML infrastructure or production-grade MLOps tools.
  • Fluency in backend development and infrastructure engineering, especially with Python, Go, Bash, Terraform, or Helm.
  • Experience with ML orchestration tools such as Kubeflow, Airflow, MLflow, Ray, or Metaflow.
  • Proficient in containerization and cloud-native technologies, including Docker, Kubernetes, Argo, or managed ML platforms like SageMaker.
  • Deep understanding of cloud environments (AWS, GCP, or Azure) and GPU-accelerated workloads.
  • Preferred Skills

  • Exposure to distributed training techniques (FSDP, DeepSpeed, Horovod).
  • Knowledge of CI / CD strategies for ML and data drift detection methods.
  • Awareness of privacy, compliance, and security practices in ML systems.
  • Prior experience in infrastructure-first or developer-oriented AI organizations.
  • Compensation and Benefits

  • Base salary range : $160,000 to $230,000 DOE
  • Significant equity package and comprehensive benefits
  • Opportunity to work at the core of transformative AI innovation
  • Why Apply?

    This is a rare opportunity to own and shape the ML platform behind AI that will define the next era. If you thrive in system-level problem solving and want to leave your mark on how machine learning is built at scale, this role is for you.

    Apply today to learn more about this confidential opportunity and how you can play a part in the future of AI engineering.

    #J-18808-Ljbffr

    Create a job alert for this search

    Machine Learning Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    Principal Systems Engineer

    Principal Systems Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Principal Systems Engineer (MBSE).Key Responsibilities Perform systems engineering tasks in support of software development for a Space Ground mission-focused software ...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Machine Learning Engineer specializing in Personalization for a remote position.Key Responsibilities Build and deploy robust ML systems / algorithms for personalized prod...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Sales Systems Engineer

    Principal Sales Systems Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Principal Sales Systems Engineer - RoIP.Key Responsibilities Support Radio over IP (RoIP) deployments during system lifecycle, including pre-sales and post-sales Provi...Show moreLast updated: 2 days ago
    • Promoted
    Machine Learning Systems Engineer, RL Engineering

    Machine Learning Systems Engineer, RL Engineering

    Menlo VenturesSan Francisco, CA, United States
    Full-time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Engineer, Relevance

    Machine Learning Engineer, Relevance

    PatreonSan Francisco, CA, United States
    Full-time
    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans ...Show moreLast updated: 1 day ago
    • Promoted
    Senior / Staff Machine Learning Systems Engineer

    Senior / Staff Machine Learning Systems Engineer

    Abridge AI Inc.San Francisco, CA, United States
    Full-time
    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare.Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation eff...Show moreLast updated: 21 days ago
    • Promoted
    Machine Learning Engineer - Intelligent Agents & Systems

    Machine Learning Engineer - Intelligent Agents & Systems

    Zyphra Technologies Inc.Palo Alto, CA, United States
    Full-time
    Agentic Systems and Interaction projects.You will be at the forefront of building a next-generation desktop and browser-based agent that can autonomously navigate the web, interact with filesystems...Show moreLast updated: 30+ days ago
    • Promoted
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show moreLast updated: 30+ days ago
    • Promoted
    Generative Machine Learning Engineer

    Generative Machine Learning Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Generative Machine Learning Engineer.Key Responsibilities Develop and implement Deep Learning Pipelines using cutting-edge models Sustain the infrastructure for Deep L...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Senior Machine Learning Engineer to join their Data Science team.Key Responsibilities Design data architecture for real-time model endpoints and batch solutions Engine...Show moreLast updated: 30+ days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior MLOps Engineer to design and scale infrastructure for AI research and product development. Key Responsibilities Identify and resolve infrastructure and software b...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Deep Learning Engineer

    Senior Deep Learning Engineer

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Senior Deep Learning Software Engineer - Autonomous Vehicles.Key Responsibilities Train, fine-tune, optimize, and customize perception DNNs in low precision (FP16 / INT8)...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Systems Engineer, Tooling & Infrastructure, Optimus

    Machine Learning Systems Engineer, Tooling & Infrastructure, Optimus

    TeslaPalo Alto, CA, United States
    Full-time
    Machine Learning Systems Engineer, Tooling & Infrastructure, Optimus.Join to apply for the Machine Learning Systems Engineer, Tooling & Infrastructure, Optimus role at Tesla.Get AI-powered advice o...Show moreLast updated: 4 days ago
    • Promoted
    Machine Learning Data Engineer - Systems & Retrieval

    Machine Learning Data Engineer - Systems & Retrieval

    Zyphra Technologies Inc.Palo Alto, CA, United States
    Full-time
    Machine Learning Data Engineer - Systems & Retrieval.This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from raw ...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Machine Learning Engineer (Recommendation Systems)

    Sr. Machine Learning Engineer (Recommendation Systems)

    ZipRecruiterSan Francisco, CA, United States
    Full-time
    At Philo, we\'re a group of technology and product people who set out to build the future of television, marrying the best in modern technology with the most compelling medium ever invented — in sh...Show moreLast updated: 4 days ago
    • Promoted
    • New!
    Azure Machine Learning Engineer

    Azure Machine Learning Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for an Azure Machine Learning Customer Engineer.Key Responsibilities Lead AI strategy and implementation engagements for Azure Machine Learning and related services Guide cl...Show moreLast updated: 9 hours ago
    • Promoted
    Machine Learning Systems Engineer (1 Year Fixed Term)

    Machine Learning Systems Engineer (1 Year Fixed Term)

    Stanford UniversityStanford, California, US
    Temporary
    The Department of Ophthalmology in the School of Medicine at Stanford University is launching an interdisciplinary Neuro-AI project dedicated to building a foundation model of the brain.Check out t...Show moreLast updated: 23 hours ago
    • Promoted
    Research Engineer - Machine Learning & Systems

    Research Engineer - Machine Learning & Systems

    World LabsSan Francisco, CA, United States
    Full-time
    We are looking for a versatile Research Engineer with a strong background in machine learning or 3D, software development, and systems design. This role is ideal for someone excited about bridging c...Show moreLast updated: 14 days ago