Talent.com
Machine Learning Operations Engineer
Machine Learning Operations EngineerRobotics and AI Institute • Cambridge, MA, US
Machine Learning Operations Engineer

Machine Learning Operations Engineer

Robotics and AI Institute • Cambridge, MA, US
1 day ago
Job type
  • Full-time
Job description

Job Description

Job Description

Our Mission

Our mission is to solve the most important and fundamental challenges in AI and Robotics to enable future generations of intelligent machines that will help us all live better lives.

Machine Learning Operations (ML-Ops) Engineers build infrastructure that supports the entire lifecycle of Machine Learning (ML) projects from development to scaling and to deployment. If you have a passion for building the foundation that enables robotics research and engineering, you will want to join us!

What You Will Do

  • Design, develop, and maintain company-wide platforms and tooling that utilize Kubernetes infrastructure to enable machine learning and data processing applications
  • Enable self-service access to ML-compute for our on-prem and cloud compute clusters, including support for job scheduling, workload scalability and workload fault tolerance
  • Enhance observability across ML applications through integrations with tools and services such as FluentD, Prometheus, Grafana and DataDog
  • Integrate ML applications with experiment tracking and management services like Weights and Biases
  • Elevate code quality and champion best practices in our engineering processes
  • Collaborate with Machine Learning Engineers, Data Engineers, DEVOPs engineers and researchers to build scalable solutions that improve engineering and research velocity.

What You Will Bring

  • BS or MS in Computer Science, Engineering, or equivalent
  • 3+ years of experience in an MLOPs, DevOps, ML Engineering or software engineering role
  • Strong hands-on experience deploying and managing applications running on Kubernetes
  • Experience developing MLOPS platforms to manage the lifecycle of ML experiments; including one or more of data and artifact management, reproducibility, fault-tolerance, experiment tracking and model serving
  • Experience with Docker and Python environment management tools such as pip, poetry, uv or similar
  • Proficient in software practices such as version control (Git), CI / CD (Github Actions, ArgoCD), Infrastructure as Code(Terraform).
  • Extra Skills We Value

  • Experience with Kueue, or similar job scheduling mechanisms
  • Experience with workflow orchestration tools such as Airflow, Metaflow, Argo Workflows or similar
  • Hands-on experience deploying and managing cloud infra on platforms like GCP and AWS
  • Experience with hybrid-cloud compute and data environments
  • Experience with Ray, Pytorch Lightning or similar scalable AI / ML platforms
  • Experience with application and system, logging with tools and services like FluentD, Prometheus, Grafana and DataDog or similar
  • Experience with Bazel build tool or similar
  • Experience with ML model serving frameworks such as Torchserve, ONNX runtime or similar
  • Experience working with research teams in an academic or industrial environment.
  • We provide equal employment opportunities to all employees and applicants for employment and prohibit discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

    Create a job alert for this search

    Machine Learning Engineer • Cambridge, MA, US

    Related jobs
    Machine Learning Engineer

    Machine Learning Engineer

    Lawrence Harvey • Boston, MA, United States
    Full-time
    Design & deploy on-prem NLP / ML solutions.Build intelligent agents for real-time comms, integrations, and workflows.Develop dashboards, APIs & infrastructure exposing ML insights to non-technical te...Show more
    Last updated: 18 days ago • Promoted
    Machine Learning Engineer Pluralsight

    Machine Learning Engineer Pluralsight

    STARTUPDOPE MEDIA • Boston, MA, United States
    Full-time
    Pluralsight proudly creates the creators of tomorrow : the people who develop the technology that lifts the human condition. We do this through the tech industry’s leading learning platform for serio...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer Boston, MA

    Machine Learning Engineer Boston, MA

    Bracebridge Capital, LLC • Boston, MA, United States
    Full-time
    Bracebridge Capital, LLC is a leading hedge fund manager with approximately $12 billion of net assets under management.The firm pursues investment strategies primarily within the global fixed incom...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Gecko Robotics • Boston, MA, United States
    Full-time
    Gecko Robotics is helping the world’s most important organizations ensure the availability, reliability, and sustainability of critical infrastructure. Gecko's complete and connected solutions combi...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    PeKe Labs • Boston, MA, United States
    Full-time
    We're hiring our first ML Software Engineer to work alongside our cofounder, Thane Hunt, in developing our software stack to help interpret raw data s. We're building the next generation of technolo...Show more
    Last updated: 23 days ago • Promoted
    Machine Learning Engineer Boston, US > >

    Machine Learning Engineer Boston, US > >

    Airspace Intelligence • Boston, MA, United States
    Full-time
    ASI enables success for the world's most complex operations.From critical infrastructure to defense, we serve major airlines and U. Backed by top-tier investors—including Andreessen Horowitz, Spark ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Air Space Intelligence • Boston, MA, United States
    Full-time
    ASI enables success for the world's most complex operations.From critical infrastructure to defense, we serve major airlines and U. Backed by top-tier investors—including Andreessen Horowitz, Spark ...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Analog Devices • Boston, MA, United States
    Full-time
    Machine Learning Engineer at Analog Devices.You will play a key role in developing and implementing machine learning models that enhance semiconductor solutions. This position offers the opportunity...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    1010 Analog Devices Inc. • Boston, MA, United States
    Permanent
    NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    C the Signs • Boston, MA, US
    Full-time
    The Machine Learning Engineer will be responsible for the end-to-end development and deployment of Large language and machine learning models, with a primary focus on data preprocessing, model trai...Show more
    Last updated: 15 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Leidos Inc • Tewksbury, MA, United States
    Full-time
    Leidos' Security Enterprise Solutions (SES).You will help build and maintain ML models and pipelines, collaborate with cross-functional teams, and contribute to deploying scalable machine learning ...Show more
    Last updated: 30+ days ago • Promoted
    AI-Machine Learning Engineer

    AI-Machine Learning Engineer

    Dana-Farber Cancer Institute • Boston, MA, United States
    Full-time
    The Artificial Intelligence & Machine Learning Engineer / Scientist I works within the Artificial Intelligence Operations and Services group (AIOS) in the Informatics & Analytics department of Dana-F...Show more
    Last updated: 12 days ago • Promoted
    Biomedical Machine Learning Engineer

    Biomedical Machine Learning Engineer

    1010 Analog Devices Inc. • Wilmington, MA, United States
    Full-time +1
    NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Bracebridge Capital, LLC • Boston, MA, United States
    Full-time
    Bracebridge Capital, LLC is a leading hedge fund manager with approximately $12 billion of net assets under management.The firm pursues investment strategies primarily within the global fixed incom...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Centum Search • Boston, MA, United States
    Full-time
    Centum Search is representing a well-funded, rapidly growing technology company that develops AI-powered tools for enterprise clients. They are expanding their engineering team and looking for a.Des...Show more
    Last updated: 28 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Gecko Robotics, Inc. • Boston, MA, United States
    Full-time
    We’re changing the way engineering and robotics connect to the built world.We need creative problem solvers to help accelerate our mission. We believe it’s time for a new kind of engineering role ca...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Operations Engineer

    Machine Learning Operations Engineer

    Cyvl, Inc. • Boston, MA, United States
    Full-time
    Cyvl is a Boston-based tech startup revolutionizing the way civil engineering firms and governments map and manage transportation infrastructure. Our enterprise-grade hardware and software solutions...Show more
    Last updated: 30+ days ago • Promoted
    Senior Engineer, Machine Learning

    Senior Engineer, Machine Learning

    1010 Analog Devices Inc. • Wilmington, MA, United States
    Full-time +1
    NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...Show more
    Last updated: 20 days ago • Promoted