Talent.com
Machine Learning Operations Engineer
Machine Learning Operations EngineerRobotics And Ai Institute • Cambridge, Massachusetts, United States
Machine Learning Operations Engineer

Machine Learning Operations Engineer

Robotics And Ai Institute • Cambridge, Massachusetts, United States
30+ days ago
Job type
  • Full-time
Job description

Our Mission

Our mission is to solve the most important and fundamental challenges in AI and Robotics to enable future generations of intelligent machines that will help us all live better lives.

Machine Learning Operations (ML-Ops) Engineers build infrastructure that supports the entire lifecycle of Machine Learning (ML) projects from development to scaling and to deployment. If you have a passion for building the foundation that enables robotics research and engineering, you will want to join us!

What You Will Do

  • Design, develop, and maintain company-wide platforms and tooling that utilize Kubernetes infrastructure to enable machine learning and data processing applications
  • Enable self-service access to ML-compute for our on-prem and cloud compute clusters, including support for job scheduling, workload scalability and workload fault tolerance
  • Enhance observability across ML applications through integrations with tools and services such as FluentD, Prometheus, Grafana and DataDog
  • Integrate ML applications with experiment tracking and management services like Weights and Biases
  • Elevate code quality and champion best practices in our engineering processes
  • Collaborate with Machine Learning Engineers, Data Engineers, DEVOPs engineers and researchers to build scalable solutions that improve engineering and research velocity.

What You Will Bring

  • BS or MS in Computer Science, Engineering, or equivalent
  • 3+ years of experience in an MLOPs, DevOps, ML Engineering or software engineering role
  • Strong hands-on experience deploying and managing applications running on Kubernetes
  • Experience developing MLOPS platforms to manage the lifecycle of ML experiments; including one or more of data and artifact management, reproducibility, fault-tolerance, experiment tracking and model serving
  • Experience with Docker and Python environment management tools such as pip, poetry, uv or similar
  • Proficient in software practices such as version control (Git), CI / CD (Github Actions, ArgoCD), Infrastructure as Code(Terraform).
  • Extra Skills We Value

  • Experience with Kueue, or similar job scheduling mechanisms
  • Experience with workflow orchestration tools such as Airflow, Metaflow, Argo Workflows or similar
  • Hands-on experience deploying and managing cloud infra on platforms like GCP and AWS
  • Experience with hybrid-cloud compute and data environments
  • Experience with Ray, Pytorch Lightning or similar scalable AI / ML platforms
  • Experience with application and system, logging with tools and services like FluentD, Prometheus, Grafana and DataDog or similar
  • Experience with Bazel build tool or similar
  • Experience with ML model serving frameworks such as Torchserve, ONNX runtime or similar
  • Experience working with research teams in an academic or industrial environment.
  • We provide equal employment opportunities to all employees and applicants for employment and prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

    Create a job alert for this search

    Machine Learning Engineer • Cambridge, Massachusetts, United States

    Related jobs
    Machine Learning Operations Engineer

    Machine Learning Operations Engineer

    CYVL • Boston, MA, United States
    Full-time
    This range is provided by CYVL.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Cyvl is a Boston-based tech startup revolutionizing how civil eng...Show more
    Last updated: 3 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Givzey • Newton, Massachusetts, United States
    Full-time
    We're looking for a talented and motivated Machine Learning Engineer to join our team and help develop cutting-edge AI solutions. In this role, you'll have the opportunity to shape and create our ma...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Gentuity, LLC • Sudbury, MA, United States
    Full-time
    Gentuity is an exciting and highly innovative medical technology firm, active in the research and development, clinical translation, and commercialization of vascular imaging devices.This opportuni...Show more
    Last updated: 9 days ago • Promoted
    Machine Learning Engineer LLM Engineering

    Machine Learning Engineer LLM Engineering

    Liberate • Boston, Massachusetts, United States
    Full-time
    By starting with voice, the most valuable and complex channel in insurance, the company proved that even the hardest problems can be automated. Now expanding into full workflow automation across sal...Show more
    Last updated: 30+ days ago • Promoted
    Lead Machine Learning Engineer

    Lead Machine Learning Engineer

    Klaviyo • Boston, Massachusetts, United States
    Full-time
    At Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair sh...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer - Generative AI

    Machine Learning Engineer - Generative AI

    Zoox • Boston, Massachusetts, United States
    Full-time
    The Perception team at Zoox is at the forefront of leveraging GenAI to create synthetic data, unlocking scalable training and evaluation for our autonomous system's perception and entire stack.As a...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer Boston, MA ( )

    Machine Learning Engineer Boston, MA ( )

    PetsApp • Boston, MA, United States
    Full-time
    Remark is reshaping the e‑commerce landscape by harnessing the power of AI and personalized support to solve a critical problem : the low conversion rates plaguing online shopping.Our platform conne...Show more
    Last updated: 2 days ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Flagship Pioneering • Boston, Massachusetts, United States
    Full-time
    ProFound Therapeutics is pioneering the discovery of the expanded human proteome to unlock a new universe of potential therapeutics. By integrating multi-omics, advanced computation, and translation...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Operations Engineer

    Machine Learning Operations Engineer

    Cyvl • Boston, Massachusetts, United States
    Full-time
    Cyvl is a Boston-based tech startup revolutionizing the way civil engineering firms and governments map and manage transportation infrastructure. Our enterprise-grade hardware and software solutions...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Leidos Inc • Tewksbury, MA, United States
    Full-time
    Leidos' Security Enterprise Solutions (SES).You will help build and maintain ML models and pipelines, collaborate with cross-functional teams, and contribute to deploying scalable machine learning ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Acceler8 Talent • Boston, MA, United States
    Full-time
    A fast‑growing startup building the infrastructure that lets enterprises deploy, scale, and operate AI reliably and efficiently. Optimise and accelerate training and inference pipelines.Build and ma...Show more
    Last updated: 4 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Bracebridge Capital • Boston, Massachusetts, United States
    Full-time
    Bracebridge Capital, LLC is a leading hedge fund manager with approximately $12 billion of net assets under management.The firm pursues investment strategies primarily within the global fixed incom...Show more
    Last updated: 30+ days ago • Promoted
    Lead Machine Learning Engineer

    Lead Machine Learning Engineer

    Capital One • Boston, MA, United States
    Full-time +1
    Lead Machine Learning Engineer.As a Capital One Machine Learning Engineer (MLE), you'll be part of an Agile team dedicated to productionizing machine learning applications and systems at scale.You'...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer (I, II, Senior)

    Machine Learning Engineer (I, II, Senior)

    Ikigai Labs • Cambridge, Massachusetts, United States
    Full-time
    The Ikigai platform unlocks the power of generative AI for tabular data.We enable business users to connect disparate data, leverage no-code AI / ML, and build enterprise-wide AI apps in just a few c...Show more
    Last updated: 30+ days ago • Promoted
    Ai & Machine Learning Engineer - Manager - Consulting - Open Location

    Ai & Machine Learning Engineer - Manager - Consulting - Open Location

    Ernst & Young Oman • Boston, MA, United States
    Full-time
    At EY, we’re all in to shape your future with confidence.We’ll help you succeed in a globally connected powerhouse of diverse teams and take your career wherever you want it to go.Join EY and help ...Show more
    Last updated: 3 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Leidos • Tewksbury, MA, US
    Full-time
    Leidos’ Security Enterprise Solutions (SES).You will help build and maintain ML models and pipelines, collaborate with cross-functional teams, and contribute to deploying scalable machine lea...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Air Space Intelligence • Boston, Massachusetts, United States
    Full-time
    ASI enables success for the world's most complex operations.From critical infrastructure to defense, we serve major airlines and U. Backed by top-tier investors—including Andreessen Horowitz, Spark ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Operations Engineer

    Machine Learning Operations Engineer

    Cyvl, Inc. • Boston, MA, United States
    Full-time
    Cyvl is a Boston-based tech startup revolutionizing the way civil engineering firms and governments map and manage transportation infrastructure. Our enterprise-grade hardware and software solutions...Show more
    Last updated: 30+ days ago • Promoted