Talent.com
Senior HPC Engineer, Infrastructure Specialist Team
Senior HPC Engineer, Infrastructure Specialist TeamNVIDIA • Remote, CA, US
Senior HPC Engineer, Infrastructure Specialist Team

Senior HPC Engineer, Infrastructure Specialist Team

NVIDIA • Remote, CA, US
30+ days ago
Job type
  • Full-time
  • Remote
Job description

NVIDIA is looking for a Senior HPC Engineer to join its Professional Services team. Academi c, c ommercial and government groups around the world are using NVIDIA products to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the largest and fastest AI / HPC systems in the world! NVIDIA is looking for someone with the ability to work on a dynamic customer-focused team that requires excellent interpersonal skills. This role will be interacting with customers, partners and internal teams to analyze, define , and implement large - scale AI / HPC projects. These efforts include a combination of networking, system design and automation and validation.

What you will be doing :

Primary responsibilities will include deploying, managing, and validating AI / HPC infrastructure in l inux -based environments for new and existing customers.

Be the domain expert with customers during planning calls through implementation.

Handover-related documentation and perform knowledge transfers required to support customers as they begin rolling out some of the most sophisticated systems in the world!

Provide feedback to internal teams such as opening bugs, documenting workarounds, and suggesting improvements.

What we need to see :

5+ years providing in-depth support and deployment services ; solving problems for hardware and software products.

Knowledge and experience with l inux s ystem a dministration, process management, package management, task scheduling, kernel management, boot procedures / troubleshooting, performance reporting / optimization / logging, network-routing / advanced networking (tuning and monitoring).

Cluster management technologies ( bonus credit for BCM (Base Command Manager) ).

Minimum of a four-year degree from an accredited university or college in Computer Science, or Electrical or Computer Engineering or equivalent experience.

Scripting proficiency (Bash, Python, Ansible, etc . ).

Excellent interpersonal skills and the ability to deliver resolutions for customer issues as they arise.

Strong organizational skills and ability to prioritize / multi-task easily with limited supervision.

Experience with s chedulers such as SLURM, LSF, UGE, etc.

A willingness to travel to customer sites within the United States.

Automation tooling background (Ansible, Puppet, etc.).

Experience with benchmarking tools such as HPL, NCCL tests, MLPERF .

Kubernetes experience .

Ways to stand out from crowd :

InfiniBand experience.

Experience with GPU (Graphics Processing Unit) focused hardware / software.

Experience with MPI (Message Passing Interface) .

Storage technologies such as Lustre or GPFS.

Familiarity with Dell and Supermicro GPU platforms

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

The base salary range is 116,000 USD - 230,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Create a job alert for this search

Senior Infrastructure Engineer • Remote, CA, US

Related jobs
Security and Infra Engineer

Security and Infra Engineer

Quality Choice Solutions • CA, United States
Full-time
Quick Apply
Must Haves : 4-yr Technical Degree 8+ yrs IT Infrastructure and Security Engineering 5+ yrs scripting proficiency using Python and PowerShell 3+ yrs o...Show more
Last updated: 3 days ago
OB / GYN Hospitalist – Kaweah Health Medical Center | Visalia, CA

OB / GYN Hospitalist – Kaweah Health Medical Center | Visalia, CA

Ob Hospitalist Group • Visalia, US
Full-time +1
Looking to practice meaningful medicine in a flexible, lifestyle-forward role? OB Hospitalist Group (OBHG) is seeking dedicated OB / GYN physicians to join our team at.California’s busiest and ...Show more
Last updated: 30+ days ago • Promoted
Principal Engineer, Standards and Design

Principal Engineer, Standards and Design

Metrolink • California, CA, US
Full-time
PURPOSE OF POSITION The Southern California Regional Rail Authority, the operator of the METROLINK Commuter Rail System, is seeking a Principal Engineer, Design and Standards, who will support the ...Show more
Last updated: 30+ days ago • Promoted
Multi-Disciplined Engineer IV

Multi-Disciplined Engineer IV

Elevait Solutions • CA, United States
Full-time
Quick Apply
Hlk206149683"> Job Title : Multi-Disciplined Technician III Location : Sunnyvale CA< / ...Show more
Last updated: 4 days ago
Implementation & Integrations Engineer (Adtech)

Implementation & Integrations Engineer (Adtech)

Thanks • CA, US
Remote
Full-time
Quick Apply
The future of advertising is beautiful.Implementation & Integrations Engineer to join our US team and play a founding role in ensuring seamless technical onboarding across our advertising platf...Show more
Last updated: 30+ days ago
Network and Computer Systems Administrator (Team Lead) Senior

Network and Computer Systems Administrator (Team Lead) Senior

Amewas • California, California, USA
Full-time
At AMEWAS we dont just support defense- we shape it.For over 40 years weve been a trusted partner of the Department of Defense (DoD) by providing cutting-edge engineering testing and evaluation for...Show more
Last updated: 16 days ago • Promoted
Senior Technical Specialist

Senior Technical Specialist

Simarn Solutions • California, California, United States
Full-time +1
Quick Apply
Information Technology / Data Engineering / ETL.We are seeking a Senior Technical Specialist with deep expertise in Informatica PowerCenter and SAP data integration. The role focuses on ETL developm...Show more
Last updated: 3 days ago
Lead Integration support engineer (Web Methods & Control M)

Lead Integration support engineer (Web Methods & Control M)

Aroha Technologies • CA, US
Full-time
Position : Lead Integration support engineer (Web Methods & Control M) Location : Culver City, CA (onsite) Duration : Long Term Client : Media Client Job Description : Web Methods 1.Good understanding o...Show more
Last updated: 16 days ago • Promoted
AI Infrastructure Specialist

AI Infrastructure Specialist

Openkyber • CA, United States
Full-time
Quick Apply
RESPONSIBILITIES : Kforce's client in Los Angeles, CA is seeking a Senior Full Stack Observability Engineer / Architect to lead the design, implementation, and optimization of full stack observab...Show more
Last updated: 1 day ago
Network SME

Network SME

Barrow Wise Consulting • CA, USA
Full-time
Quick Apply
Enjoy problem-solving, need a venue to display your creativity, and emerging technologies pique your interest; if so, Barrow Wise Consulting, LLC is for you. As a multi-disciplined leader, you under...Show more
Last updated: 8 days ago
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Govx • California, United States, California, United States
Remote
Full-time
GOVX is seeking an experienced Senior Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our production systems through automation, observability, and operat...Show more
Last updated: 30+ days ago • Promoted
Director of AI Infrastructure

Director of AI Infrastructure

Openkyber • CA, United States
Full-time
Quick Apply
Cerebra Consulting Inc is a System Integrator and IT Services Solution provider with a focus on Big Data, Business Analytics, Cloud Solutions, Amazon Web Services, Salesforce, Oracle EBS, Peoplesof...Show more
Last updated: 21 days ago
Engineer - Transportation / Traffic

Engineer - Transportation / Traffic

4Creeks, Inc. • Visalia, CA, United States
Full-time
Creeks is seeking a Civil Engineer to work under the direction of a Senior Civil / Traffic Engineer or Department Manager on our Public Works team in our Visalia or Clovis office.Responsibilities for...Show more
Last updated: 30+ days ago
Urology Hanford

Urology Hanford

Adventist Health • Tulare, US
Full-time
We are seeking a skilled and compassionate Urologist to join our team in Hanford, Ca.As a Urologist, you will diagnose and treat disorders and diseases of the urinary tract and reproductive organs ...Show more
Last updated: 30+ days ago • Promoted
Senior Software Engineer, Network & Communications

Senior Software Engineer, Network & Communications

Firestorm • Remote, California, United States,
Remote
Permanent
Quick Apply
At Firestorm, we’re on a mission to revolutionize how defense solutions are designed and delivered.We call this vision “democratized deterrence. As a VC-backed company at the intersection of defense...Show more
Last updated: 30+ days ago
Network Engineer

Network Engineer

Match Point Solutions • CA, United States
Full-time
Quick Apply
MatchPoint Solutions is a fast-growing, young, energetic global IT-Engineering services company with clients across the US. We provide technology solutions to various clients like Uber, Robinhood, N...Show more
Last updated: 30+ days ago
Network Security Architect

Network Security Architect

Dawar Consulting, Inc. • California, CA, us
Full-time
Quick Apply
Our Client is seeking an experienced.Senior Network Security Engineer.Duration : Long Term (Possibility of Further Extension). The ideal candidate will have hands-on architect, and setup expertise in...Show more
Last updated: 10 days ago
Storage DevOps Engineer

Storage DevOps Engineer

MetroSys • CA, US
Full-time
Quick Apply
Overview MetroSys is seeking a highly skilled Storage DevOps Engineer with strong automation, scripting, and infrastructure-as-code expertise. This role focuses on building and supporting automation...Show more
Last updated: 15 days ago