Talent.com
MLops Engineer

MLops Engineer

ArrayoBoston, MA, United States
Hace 3 días
Tipo de contrato
  • A tiempo completo
Descripción del trabajo

MLops Engineer (Training Scalability & Workflow Optimization)

We are seeking an MLops Engineer to lead the scaling of machine learning training pipelines and ensure the robustness and efficiency of our end-to-end ML workflows. This role focuses on leveraging Flyte , Kubernetes (GPU optimization), Docker , and distributed training frameworks such as Ray to optimize and streamline our ML infrastructure.

Overview

This role focuses on leveraging Flyte , Kubernetes (GPU optimization), Docker , and distributed training frameworks such as Ray to optimize and streamline our ML infrastructure.

Responsibilities

  • Workflow Orchestration : Develop and maintain ML workflows using Flyte to manage complex ML pipelines for training, testing, and deployment.
  • Training Scalability : Architect and scale large-scale ML training systems on GPU-backed Kubernetes clusters , including auto-scaling and performance tuning for multi-node / multi-GPU workloads.
  • Distributed Computing : Implement distributed model training pipelines using frameworks like Ray for parallelization and resource efficiency.
  • Containerization : Design, build, and optimize Docker images for ML workloads with a focus on reproducibility and security.
  • Resource Optimization : Debug and optimize GPU utilization, memory, and compute bottlenecks during training and inference phases.
  • Monitoring & Maintenance : Integrate monitoring for ML jobs, track resource consumption, and enforce cost-efficient resource utilization.
  • Collaboration : Work closely with data scientists and ML engineers to productize and scale ML experiments.

Qualifications

  • Strong proficiency with Kubernetes (GPU scheduling, Helm, cluster autoscaling).
  • Hands-on experience with Flyte or similar workflow orchestration tools (Airflow, Prefect).
  • Deep knowledge of distributed ML training (e.g., PyTorch DDP, Ray, Horovod).
  • Expertise in Docker and container lifecycle management.
  • Solid understanding of GPU hardware / software stack (CUDA, NCCL).
  • Familiarity with CI / CD for ML (MLops pipelines using tools like GitHub Actions, ArgoCD).
  • Bonus : Familiarity with observability tools for ML systems (Prometheus, Grafana).
  • #J-18808-Ljbffr

    Crear una alerta de empleo para esta búsqueda

    Mlops Engineer • Boston, MA, United States

    Ofertas relacionadas
    • Oferta promocionada
    Solution Deployment Engineer

    Solution Deployment Engineer

    Flock SafetyBoston, MA, US
    A tiempo completo
    Flock Safety is the leading safety technology platform, helping communities thrive by taking a proactive approach to crime prevention and security. Our hardware and software suite connects cities, l...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Model-Based Systems Engineer (MBSE)

    Model-Based Systems Engineer (MBSE)

    GCR Professional ServicesWoburn, MA, US
    A tiempo completo
    MBSE Systems Engineer Contract 40 hours weekly 6+ months Role can be Hybrid, TBDThe Systems Engineer should have experience architecting systems, working across the Systems Engineering Lifecycle, a...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior MLOps Engineer

    Senior MLOps Engineer

    EBSCO Information ServicesBoston, MA, United States
    A tiempo completo
    EBSCO Information Services (EBSCO) delivers a fully optimized research experience, seamlessly integrated with a powerful discovery platform to support the information needs and maximize the researc...Mostrar másÚltima actualización: hace 3 días
    • Oferta promocionada
    Senior Software Engineer (ML Operations)

    Senior Software Engineer (ML Operations)

    WhoopBoston, MA, US
    A tiempo completo
    At WHOOP, we're on a mission to unlock human performance and healthspan.WHOOP empowers members to perform at a higher level through a deeper understanding of their bodies and daily lives.We are...Mostrar másÚltima actualización: hace 21 días
    • Oferta promocionada
    Principal Fuel Systems Engineer (R3300) (Remote)

    Principal Fuel Systems Engineer (R3300) (Remote)

    Shield AIBoston, MA, United States
    Teletrabajo
    A tiempo completo +1
    Founded in 2015, Shield AI is a venture-backed deep-tech company with the mission of protecting service members and civilians with intelligent systems. Its products include the V-BAT aircraft, Hivem...Mostrar másÚltima actualización: hace 19 días
    • Oferta promocionada
    Telemedicine Physician

    Telemedicine Physician

    QuickMDScituate, MA, US
    A tiempo completo
    QuickMD is a leading telemedicine provider, delivering high-quality virtual care across 44 states.Since our founding in 2019, we have helped more than 100,000 patients access essential medical trea...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior ML Platform Engineer

    Senior ML Platform Engineer

    WhoopBoston, MA, US
    A tiempo completo
    At WHOOP, we're on a mission to unlock human performance and healthspan.WHOOP empowers members to perform at a higher level through a deeper understanding of their bodies and daily lives.We are...Mostrar másÚltima actualización: hace 16 días
    • Oferta promocionada
    Free CDL Training and Job Placement for a Fresh Start

    Free CDL Training and Job Placement for a Fresh Start

    EmergeCohasset, MA, US
    A tiempo completo
    We are a government-funded job training program for people who have been impacted by the justice system (arrest, probation, parole, or incarceration), and we help them become CDL truck drivers.Free...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Senior Systems Applications Engineer (GMSL)

    Senior Systems Applications Engineer (GMSL)

    1010 Analog Devices Inc.Wilmington, MA, United States
    A tiempo completo +1
    NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...Mostrar másÚltima actualización: hace 19 días
    • Oferta promocionada
    Senior Software Engineer, ML Infrastructure

    Senior Software Engineer, ML Infrastructure

    MotionalBoston, MA, US
    A tiempo completo
    Our team builds the foundational infrastructure that empowers Machine Learning Engineers to develop the next generation of self-driving technology. We design and operate the high-performance, large-...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior Structural Engineer

    Senior Structural Engineer

    Rivermoor Engineering, LLCScituate, Massachusetts, US
    A tiempo completo
    Senior Structural Engineer Position requires 10 plus years experience in the design of concrete, steel and wood framed structures with a knowledge of RAM, TEDDS and / or RISA design software.Success...Mostrar másÚltima actualización: hace 1 día
    • Oferta promocionada
    Technical Sales Engineer

    Technical Sales Engineer

    D.W. ClarkTaunton, MA, US
    Indefinido
    Manufacturing Engineering, Mechanical Engineering, Metallurgy.Clark casts engineered components for critical infrastructure and defense. Our mission is to advance how castings are made and the syste...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Principal Software Engineer, ML Infrastructure

    Principal Software Engineer, ML Infrastructure

    MotionalBoston, MA, US
    A tiempo completo
    Our team builds the foundational infrastructure that empowers Machine Learning Engineers to develop the next generation of self-driving technology. We design and operate the high-performance, large-...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Maintenance Reliability Engineer

    Maintenance Reliability Engineer

    CyberCodersGloucester, MA, US
    A tiempo completo
    Maintenance Reliability Engineer.Maintenance Reliability Engineer - Food Manufacturing .Gloucester, MA - On Site .Year + Benefits & Bonus . This candidate must have 3+ Years of Ex...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Engineer, Senior

    Engineer, Senior

    Constellation EnergyCanton, MA, US
    A tiempo completo
    As the nation's largest producer of clean, carbon-free energy, Constellation is focused on our purpose : accelerating the transition to a carbon-free future. We have been the leader in clean ener...Mostrar másÚltima actualización: hace 3 días
    • Oferta promocionada
    Machine Learning Operations Engineer

    Machine Learning Operations Engineer

    Cyvl, Inc.Boston, MA, United States
    A tiempo completo
    Cyvl is a Boston-based tech startup revolutionizing the way civil engineering firms and governments map and manage transportation infrastructure. Our enterprise-grade hardware and software solutions...Mostrar másÚltima actualización: hace más de 30 días
    • Oferta promocionada
    Senior MLOps Engineer, vLLM Inference

    Senior MLOps Engineer, vLLM Inference

    Red HatBoston, MA, United States
    A tiempo completo +1
    Senior MLOps Engineer, vLLM Inference page is loaded## Senior MLOps Engineer, vLLM Inferenceremote type : Hybridlocations : Boston : Dublin - MSO : Remote Ireland : Waterford Citytime type : Full timepos...Mostrar másÚltima actualización: hace 3 días
    • Oferta promocionada
    MuleSoft QA Engineer

    MuleSoft QA Engineer

    UniFirstWilmington, MA, US
    A tiempo completo
    This is a hybrid role with 50% on-site requirement in Wilmington, MA.The ideal candidate will validate.Functional Solution Documents (FSDs). The role requires proficiency in.API test automation tool...Mostrar másÚltima actualización: hace 1 día