Talent.com
Senior ML Ops Engineer
Senior ML Ops EngineerAxiado • San Jose, California, USA
Senior ML Ops Engineer

Senior ML Ops Engineer

Axiado • San Jose, California, USA
12 days ago
Job type
  • Full-time
Job description

We are looking for a Senior MLOps Engineer to own and build the end-to-end machine learning lifecycle with a special focus on secure reliable deployment to edge devices.

You are a systems thinker and a hands-on engineer. You will be responsible for everything from the initial data pipeline to the final on-device model verification. You will design our data-labeling feedback loops build the CI / CD pipelines that convert and deploy models and implement the monitoring systems that tell us how those models areactuallyperforming in the wildboth in terms of speed and quality.

This role is a unique blend of data engineering DevOps ML security and performance optimization. You will be the engineer who ensures our models are not only fast but also trusted secure and continuously improving.

Key Responsibilities

1. Data & Labeling Lifecycle Management :

Architectandimplementscalable data processing pipelines for ingesting validating and versioning massive datasets (e.g. using DVC Pachyderm or custom S3 / Artifactory solutions).

Design and buildthe infrastructure for ourHuman-in-the-Loop (HITL)andAI-in-the-Loop (Active Learning)data labeling systems. This includes creating the feedback loops that identify high-value data for re-labeling.

Conduct deepdata analysisto identify data drift dataset bias and feature drift ensuring the statistical integrity of our training and validation sets.

2. On-Device Model Monitoring :

Design and deploylightweight on-device telemetry agents to monitorinference qualityandconcept drift not just operational metrics.

Implement statistical monitoring on model outputs (e.g. confidence distributions output ranges) and create automated alerting systems to flag model degradation.

Build the backend dashboards (e.g. Grafana custom dashboards) to aggregate and visualize on-device performance and quality metrics from a fleet of edge devices.

3. Model Conversion & Deployment (CI / CD for ML) :

Build and maintaina robust CI / CD pipeline (e.g. GitLab CI Jenkins GitHub Actions) that automates model training conversion quantization (PTQ / QAT) and packaging.

Manage themodel conversionprocess translating models from PyTorch / TensorFlow into optimized formats (e.g. ONNX TFLite) for ourAI inference engine.

Orchestrate model deployment to edge devices managing model versioning and enabling reliable Over-the-Air (OTA) updates.

4. On-Device Model Security & Verification :

Implement a robustmodel verificationframework using cryptographic signatures to ensureentity verification(i.e. that the model running on-device is the one we deployed).

Design and apply security protocols (e.g. secure boot model encryption) toprevent model injection attacksand unauthorized model tampering on the device.

Collaborate with firmware and hardware security teams to ensure our MLOps pipeline adheres to a hardware root of trust.

5. Performance Optimization :

Analyze and optimize ML model performance for our specificAI inference engine.

Applygraph-level optimizations(e.g. operator fusion pruning) andOP-level optimizations(e.g. rewriting custom ops leveraging hardware-specific data types) to maximize throughput and minimize latency.

Qualifications :

5 yearsof experience in MLOps DevOps or Software Engineering with a focus on ML systems.

Proven experience building and managing thefull MLOps lifecycle from data ingestion to production monitoring.

Strong programming skills in Python and deep experience with ML frameworks (e.g. PyTorch TensorFlow).

Demonstrable experience withmodel conversion and optimizationfor edge devices (e.g. using ONNX TFLite TensorRT orApache TVM).

Strong understanding ofdata engineeringprinciples and experience with data-labeling strategies (HITL / Active Learning).

Excellent understanding of CI / CD principles and tools (e.g. Git Docker GitLab CI).

Preferred Qualifications (The Plus Factors)

Hands-on experience withKubernetes (K8s)for MLOps orchestration (e.g. Kubeflow Argo Workflows).

Familiarity with GPU scheduling and virtualization platforms such asRun : AI.

Proficiency in managing MLOps infrastructure on at least one majorcloud platform(AWS GCP Azure).

Experience with embedded systems security cryptographic signing or hardware security modules (HSMs).

Experience in C for deploying high-performance inference code.

Additional Information :

Axiado is committed to attracting developing and retaining the highest caliber talent in a diverse and multifaceted environment. We are headquartered in the heart of Silicon Valley with access to the worlds leading research technology and talent.

We are building an exceptional team to secure every node on the internet. For us solving real-world problems takes precedence over purely theoretical problems. As a result we prefer individuals with persistence intelligence and high curiosity over pedigree alone. Working hard and smart continuous learning and mutual support are all part of who we are.

Axiado is an Equal Opportunity Employer. Axiado does not discriminate on the basis of race religion color sex gender identity sexual orientation age non-disqualifying physical or mental disability national origin veteran status or any other basis covered by appropriate law. All employment is decided on the basis of qualifications merit and business need.

Remote Work : No

Employment Type : Full-time

Key Skills

APIs,C / C++,Computer Graphics,Go,React,Redux,Node.js,AWS,Library Services,Assembly,GraphQL,High Voltage

Experience : years

Vacancy : 1

Create a job alert for this search

Senior Ml Engineer • San Jose, California, USA

Related jobs
Senior ML Platform Engineer : Scale LLM Infrastructure

Senior ML Platform Engineer : Scale LLM Infrastructure

GEICO • Palo Alto, CA, United States
Full-time
A leading insurance company in California is seeking a Senior ML Platform Engineer to enhance their machine learning infrastructure. This role involves designing scalable systems for Large Language ...Show more
Last updated: 3 days ago • Promoted
Senior Machine Learning Engineer

Senior Machine Learning Engineer

Quantix Search • Fremont, CA, United States
Full-time
Senior Machine Learning Engineer.San Francisco | Hybrid, 3 days / week | $200K - $280K + equity.We are excited to partner with a rapidly growing healthtech startup that has successfully raised $40M i...Show more
Last updated: 8 days ago • Promoted
Sr. ML SWE

Sr. ML SWE

Third Wave Automation • Union City, CA, United States
Full-time
Third Wave Automation is a rapidly growing startup that has demonstrated its core technology components, proven its market fit, and just closed its Series C funding. If you are excited about cutting...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, ML Runtime & Optimization

Machine Learning Engineer, ML Runtime & Optimization

Pony.ai • Fremont, CA, United States
Full-time
Founded in 2016 in Silicon Valley, Pony.Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony. CNBC Disruptor list of the 50 most innovative and disruptive tech comp...Show more
Last updated: 30+ days ago • Promoted
Senior Platform and EngOps Engineer - Cluster Operations

Senior Platform and EngOps Engineer - Cluster Operations

NVIDIA • Santa Clara, CA, United States
Full-time
NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern compu...Show more
Last updated: 6 days ago • Promoted
Dermatology - MOHS Opportunity in San Jose, CA

Dermatology - MOHS Opportunity in San Jose, CA

HealthEcareers - Client • Scotts Valley, California, United States
Full-time +1
Fulfilling the promise of medicine.Kaiser Permanente / The Permanente Medical Group.The Permanente Medical Group, Inc.TPMG - Kaiser Permanente Northern California) is one of the largest medical gro...Show more
Last updated: 30+ days ago • Promoted
Senior ML Engineer — Scalable Personalization Pipelines

Senior ML Engineer — Scalable Personalization Pipelines

Adobe Inc. • San Jose, CA, United States
Full-time
An innovative firm is seeking a Senior Machine Learning Engineer to lead the development of cutting-edge machine learning models that enhance personalized customer experiences.This role offers the ...Show more
Last updated: 18 hours ago • Promoted • New!
Senior ML Model Engineer – Ads Optimization

Senior ML Model Engineer – Ads Optimization

Samsung Electronics America • Mountain View, CA, United States
Full-time
An advanced advertising technology company is seeking a Machine Learning Model Engineer to join their fast-growing team.This role offers the opportunity to work with cutting-edge technologies and u...Show more
Last updated: 3 days ago • Promoted
Senior Software Engineer - ML Infrastructure

Senior Software Engineer - ML Infrastructure

Applied Intuition • Sunnyvale, CA, United States
Full-time
Applied Intuition is the vehicle intelligence company that accelerates the global adoption of safe, AI-driven machines.Founded in 2017 and now valued at $15 billion following its recent Series F fu...Show more
Last updated: 6 days ago • Promoted
AI / ML Engineer (Dublin, CA)

AI / ML Engineer (Dublin, CA)

Articul8 • Dublin, CA, United States
Full-time
At Articul8, we build enterprise-grade Generative AI solutions that help global organizations unlock new value from their data. Our platform is trusted by some of the world's most innovative compani...Show more
Last updated: 30+ days ago • Promoted
Android AI / ML Engineer - On-Device

Android AI / ML Engineer - On-Device

Apolis • Fremont, CA, United States
Full-time
Mountain View, CA (On-site preferred, at least 4 days / week).We are looking for a highly capable engineer to design and deploy real-time, on-device machine learning solutions for mobile devices.This...Show more
Last updated: 16 days ago • Promoted
Senior Software Engineer, Ads ML Infrastructure

Senior Software Engineer, Ads ML Infrastructure

Tik Tok • San Jose, CA, United States
Full-time
About the team The ads system at TikTok operates on a massive scale and serves millions of advertisers, clients and influencers across the world. The quality of the ads system highly depends on the ...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Operations Contractor

Machine Learning Operations Contractor

Coherent • Fremont, CA, United States
Full-time
Machine Learning Operations (MLOps) Contractor.The emphasis will be on yield improvement, screening accuracy, and design optimization. The MLOps Contractor will develop and validate models based on ...Show more
Last updated: 30+ days ago • Promoted
ML Ops Engineer

ML Ops Engineer

Omni Inclusive • San Leandro, CA, United States
Full-time
ML Ops Engineer to drive the full lifecycle of machine learning solutions-from data exploration and model development to scalable deployment and monitoring. This role bridges the gap between data sc...Show more
Last updated: 30+ days ago • Promoted
NMR Operation Engineer

NMR Operation Engineer

Diverse Lynx • Cupertino, CA, United States
Full-time
Objective / Engagement Overview.We are seeking a hands-on and detail-oriented NMR operator to support the operation, maintenance, and data interpretation of NMR instrumentation on a contract basis....Show more
Last updated: 30+ days ago • Promoted
Implementation Engineer - Process Focused (Manhattan WMS / OMS)

Implementation Engineer - Process Focused (Manhattan WMS / OMS)

Trident Consulting • San Ramon, CA, United States
Full-time
Job Title : Implementation Engineer - Process Focused (Manhattan WMS / OMS).Location : Remote / 60 - 75% Travel Required (North America). CEVA Logistics provides global supply chain solutions to connect p...Show more
Last updated: 19 days ago • Promoted
Senior ML Platform Engineer — Scale Global Pipelines

Senior ML Platform Engineer — Scale Global Pipelines

Samsung Ads • Mountain View, CA, United States
Full-time
A leading advertising technology firm is seeking a Machine Learning Platform Engineer to design and develop cutting-edge machine learning products. This role offers the opportunity to work on large-...Show more
Last updated: 2 days ago • Promoted
ML Engineer

ML Engineer

RIT Solutions, Inc. • Fremont, CA, United States
Full-time
Onsite in Fremont, CA (MUST BE LOCAL).In-depth knowledge of Python for high-performance data-intensive applications.Familiarity with at least one modern deep learning framework (Pytorch, Jax, Tenso...Show more
Last updated: 30+ days ago • Promoted