Talent.com
DevOps Engineer
DevOps EngineerAddison Group • Arlington, TX, US
No longer accepting applications
DevOps Engineer

DevOps Engineer

Addison Group • Arlington, TX, US
16 hours ago
Job type
  • Temporary
Job description

Get AI-powered advice on this job and more exclusive features.

This range is provided by Addison Group. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$100.00 / hr - $115.00 / hr

Location : Irving, Tx

Assignment Type : 6-Month Contract-to-Hire

Compensation : $100 / hr-$115 / hr

Work Schedule : Monday-Friday, 9 : 00am-5 : 00pm CST, Hybrid (3 days on-site)

We are seeking a highly skilled Senior Kubernetes Engineer to join our Platform Engineering function in Dallas. In this role, you will design, implement, and optimize GPU-accelerated container platforms at scale, enabling high-performance workloads (AI / ML, HPC, LLM training) across hybrid or on-prem environments. You will have deep expertise with both NVIDIA and Kubernetes ecosystems, including GPU scheduling, device plugins and custom operators.

Responsibilities

  • Architecting and operating Kubernetes clusters optimized for GPU workloads, leveraging NVIDIA GPU Operator, Network Operator and DCGM
  • Developing, deploying and maintaining custom Kubernetes operators and controllers to automate infrastructure services
  • Integrating NVIDIA device plugins, Multi-Instance GPU (MIG) and GPU sharing features into the scheduling layer
  • Optimizing GPU utilization and job placement through scheduler extensions, such as kube-scheduler plugins, Slurm and Volcano
  • Collaborating with HPC, ML and DevOps teams to ensure multi-tenant, high-throughput cluster performance
  • Driving observability and telemetry integrations using Prometheus, Grafana, DCGM Exporter and OpenTelemetry
  • Implementing secure multi-user and multi-namespace GPU isolation, with RBAC and policy enforcement, such as OPA or Gatekeeper
  • Maintaining CI / CD pipelines for Kubernetes infrastructure using GitOps, ArgoCD and FluxCD
  • Contributing to infrastructure-as-code, using Terraform, Helm, and Kustomize
  • Participating in performance tuning, incident response and production readiness reviews

Seniority level

  • Mid-Senior level
  • Employment type

  • Contract
  • Job function

  • Engineering and Information Technology
  • Industries

  • Investment Banking
  • J-18808-Ljbffr

    Create a job alert for this search

    Engineer • Arlington, TX, US