Talent.com
GPU Infrastructure Engineer

GPU Infrastructure Engineer

VirtualVocationsOakland, California, United States
3 days ago
Job type
  • Full-time
Job description

A company is looking for a GPU and HPC Infrastructure Engineer - New College Grad 2025.

Key Responsibilities

Contribute to the automation of datacenter operations and lifecycle management for large-scale Machine Learning systems

Implement monitoring and health management capabilities for GPU assets to ensure reliability and scalability

Develop software for NVLINK topography management and build automated test infrastructure for distributed systems

Required Qualifications

Pursuing or recently completed a BS or MS in Computer Science, Engineering, Physics, Mathematics, or a comparable degree

Software engineering experience on large-scale production systems

Strong knowledge of a systems programming language (Go, Python) and understanding of Data Structures and Algorithms

High-level knowledge of Linux system administration and cluster management systems (Kubernetes, SLURM)

Understanding of performance, security, and reliability in complex distributed systems

Create a job alert for this search

Infrastructure Engineer • Oakland, California, United States

Related jobs
  • Promoted
Senior Infrastructure Engineer

Senior Infrastructure Engineer

PicarroSanta Clara, CA, United States
Full-time
Santa Clara, CA, is a leading technology company specializing in high-precision gas analyzers and optical spectroscopy instruments, built on Cavity Ring-Down Spectroscopy (CRDS) for ultra-sensitive...Show moreLast updated: 30+ days ago
  • Promoted
Flight Software Infrastructure Engineer

Flight Software Infrastructure Engineer

Reliable RoboticsMountain View, CA, United States
Permanent
We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
  • Promoted
Director, GPU System Architect

Director, GPU System Architect

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Backend Infrastructure Engineer

Backend Infrastructure Engineer

Strategic Employment Partners (SEP)San Francisco, CA, US
Full-time
Join a stealth-mode startup on a mission to redefine how people shop online.Our client is building a hyper-personalized, AI-powered shopping experience backed by some of the most successful names i...Show moreLast updated: 6 hours ago
  • Promoted
Staff Infrastructure Engineer

Staff Infrastructure Engineer

ScribdSan Francisco, CA, United States
Full-time
At Scribd (pronounced “scribbed”), our mission is to spark human curiosity.Join our team as we create a world of stories and knowledge, democratize the exchange of ideas and information, and empowe...Show moreLast updated: 7 days ago
HVAC Infrastructure Engineer

HVAC Infrastructure Engineer

Foxconn Industrial Internet - FIISan Jose, CA, US
Full-time
Quick Apply
We are seeking a highly skilled and motivated HVAC Infrastructure Engineer to join our dynamic team.The ideal candidate will be responsible for the design, maintenance, and construction of data cen...Show moreLast updated: 30+ days ago
  • Promoted
Infrastructure Engineering - Traffic

Infrastructure Engineering - Traffic

Pantera CapitalSan Francisco, CA, United States
Full-time
AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show moreLast updated: 30+ days ago
  • Promoted
Senior GPU Architect

Senior GPU Architect

NVIDIASanta Clara, CA, United States
Full-time
We are now looking for a Senior GPU Architect!.The NVIDIA GPU Architecture group is looking for world class architects and software developers to join and lead our various architecture efforts.A ke...Show moreLast updated: 4 days ago
  • Promoted
Hardcore Engineer - Infrastructure / Supercomputing

Hardcore Engineer - Infrastructure / Supercomputing

xAIPalo Alto, CA, US
Full-time
AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show moreLast updated: 30+ days ago
  • Promoted
Infrastructure Engineers wanted

Infrastructure Engineers wanted

RustsyndiSan Francisco, CA, United States
Full-time
Infrastructure Engineers wanted at EdgeDB.Join EdgeDB, an open-source database built on top of Postgres, and help scale out our cloud infrastructure. As an SRE / Infrastructure Engineer at EdgeDB, you...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer (Infrastructure)

Software Engineer (Infrastructure)

GreptileSan Francisco, CA, United States
Full-time
K – $210K • $75K – $125K Equity • Up to $25,000 in relocation assistance.Greptile is an AI code reviewer that catches bugs and anti-patterns in pull requests with complete context of the codebase.H...Show moreLast updated: 16 days ago
  • Promoted
Lead Engineer - Card Edge and Cable Interconnection

Lead Engineer - Card Edge and Cable Interconnection

JobotSanta Clara, CA, US
Full-time
This company is one of the fastest growing interconnectivity organizations in the world.Their products and services provide modern data center with the right tools for space saving and efficient en...Show moreLast updated: 21 days ago
  • Promoted
Datacenter / GPU Infrastructure Capacity Planning Tech PgM

Datacenter / GPU Infrastructure Capacity Planning Tech PgM

US Tech Solutions, Inc.Mountain View, CA, US
Full-time
Location : Mountain View, CA (onsite in a hybrid model).The client is looking for an experienced Infrastructure Capacity Planning & Order Management TPM to support our Compute Resources Order Manage...Show moreLast updated: 4 days ago
  • Promoted
Infrastructure Engineer

Infrastructure Engineer

Mercor, Inc.San Francisco, CA, United States
Full-time
We use our platform to source, vet, and onboard expert contractors who help train AI models in a wide variety of domains. Our technology is so effective it’s used by all of the top 5 AI labs.We scal...Show moreLast updated: 12 days ago
  • Promoted
Principal GPU Software Engineer II

Principal GPU Software Engineer II

F. Hoffmann-La Roche GruppeSanta Clara, CA, United States
Full-time
At Roche you can show up as yourself, embraced for the unique qualities you bring.Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted ...Show moreLast updated: 12 days ago
  • Promoted
Infrastructure Engineer - Developer Productivity

Infrastructure Engineer - Developer Productivity

Recruiting From ScratchSan Francisco, CA, United States
Full-time
Who is Recruiting from Scratch : .Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients. Our team is 100% remote and we work with teams across North Ameri...Show moreLast updated: 10 days ago
Infrastructure Engineer

Infrastructure Engineer

Lever Demo - IS OpportunitiesSan Francisco, California, United States, 94102
Full-time
PLEASE READ : these jobs are testing jobs of Lever's testing environment - please do not apply for this job.Lever was founded ten years ago to tackle the most strategic challenge that companies face...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
AI Infrastructure Engineer

AI Infrastructure Engineer

LanceDBSan Francisco, CA, United States
Full-time
From hyper-scalable vector search to advanced retrieval for RAG, from streaming training data to interactive exploration of large-scale AI datasets, LanceDB is the best foundation for your AI appli...Show moreLast updated: 2 hours ago
Infrastructure Engineer - Previously Confidential

Infrastructure Engineer - Previously Confidential

Lever Demo - IS OpportunitiesSan Francisco, California, United States, 94102
Full-time
PLEASE READ : these jobs are testing jobs of Lever's testing environment - please do not apply for this job.Lever was founded ten years ago to tackle the most strategic challenge that companies face...Show moreLast updated: 30+ days ago
  • Promoted
AI Infrastructure Engineer

AI Infrastructure Engineer

StackAISan Francisco, CA, United States
Full-time
As a Series A company, your work will be foundational, enabling safe, efficient, and reliable AI workflows from end to end. Design and implement scalable backend architectures for AI workloads (infe...Show moreLast updated: 7 days ago