Talent.com
No longer accepting applications
Sr. Solution Engineer - HPC & AI Systems

Sr. Solution Engineer - HPC & AI Systems

SupermicroSan Jose, CA, United States
15 days ago
Job type
  • Full-time
Job description

Job Summary

We are seeking a highly skilled and motivated Senior Solution Engineer to lead efforts in benchmarking, performance tuning, and platform automation for HPC and AI workloads. This role is critical to ensuring our systems meet performance targets for RFQs and support scalable, automated deployment across diverse environments.

Essential Duties and Responsibilities

Includes the following essential duties and responsibilities (other duties may also be assigned) :

  • HPC & AI Benchmarking

Execute performance benchmarks for HPC and AI workloads, including MLPerf, across various GPU systems.

  • Analyze and optimize system configurations to meet RFQ performance requirements.
  • Performance Tuning & Cluster Optimization
  • Identify bottlenecks and implement tuning strategies for compute, memory, and I / O performance.

  • Scale and maintain large compute clusters for high-demand workloads.
  • Platform Automation & Infrastructure Engineering
  • Automate deployment and configuration using Ansible, Terraform, and Docker.

  • Develop infrastructure-as-code solutions to support reproducible and scalable environments.
  • Backend & Middleware Development
  • Build backend services and middleware to support large-scale distributed deployments.

  • Ensure reliability, modularity, and performance of backend systems.
  • CI / CD & DevOps Integration
  • Design and maintain CI / CD pipelines to support agile development and minimize downtime.

  • Collaborate with DevOps teams to streamline software delivery and system updates.
  • System Administration & OS Engineering
  • Perform hands-on installation, tuning, and troubleshooting of Linux systems, especially Red Hat-based environments.

  • Manage software-defined storage and networking components including DNS, DHCP, PXE, and cluster provisioning.
  • Maintain and configure Proof-of-Concept (PoC) system components.
  • Support the system certification processes required by ISV partners.
  • Containerization & Orchestration
  • Deploy and manage containerized workloads using Kubernetes.

  • Integrate container orchestration with benchmarking and automation workflows.
  • Documentation & Collaboration
  • Maintain clear and comprehensive technical documentation.

  • Work closely with cross-functional teams including hardware, QA, and product management.
  • Qualifications

  • Bachelor or Master degree in Computer Science or a related field
  • Minimum 8 years of professional experiences with Python and Shell script
  • Proven experience with HPC and AI benchmarking tools (e.g., MLPerf)
  • Strong proficiency in Ansible, Terraform, Docker, and Python
  • Hands-on experience configuring and troubleshooting Linux OS, servers and network switches
  • Solid understanding of software-defined storage, networking (DNS, DHCP, PXE), system provisioning and state-of-the-art datacenter operations
  • Excellent problem-solving, documentation, and collaboration skills
  • Ability to work independently and lead technical initiatives
  • Salary Range

    $170,000 - $190,000

    The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

    EEO Statement

    Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

    #J-18808-Ljbffr

    Create a job alert for this search

    Ai Solution Engineer • San Jose, CA, United States

    Related jobs
    • Promoted
    Senior Solution Architect, HPC and AI - NVIS

    Senior Solution Architect, HPC and AI - NVIS

    NVIDIASanta Clara, CA, United States
    Full-time
    Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for a hardworking Solution Architect (SA) to join the NVIDIA AI Enterpri...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. System Engineer / Rack Solution (27692)

    Sr. System Engineer / Rack Solution (27692)

    SupermicroSan Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 18 days ago
    • Promoted
    Sr. System Engineer / Rack Solution (27694)

    Sr. System Engineer / Rack Solution (27694)

    SupermicroSan Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 18 days ago
    • Promoted
    • New!
    Sr. SoC Prototyping Engineer, AI Hardware

    Sr. SoC Prototyping Engineer, AI Hardware

    TeslaPalo Alto, CA, United States
    Full-time
    As a key member of the Design Verification team, the Sr.SoC Prototyping Engineer will deliver prototypes of custom SoCs for early software development. This role focuses on architecting, building, a...Show moreLast updated: 14 hours ago
    • Promoted
    AI Engineer Sr.

    AI Engineer Sr.

    Omega Solutions IncBelmont, CA, United States
    Full-time
    Worldwide, the Volkswagen Group has a long tradition of dramatic innovations.The Volkswagen Group with its headquarters in Wolfsburg is one of the world's leading automobile manufacturers and the l...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Staff Engineer, AI Models and Applications

    Sr. Staff Engineer, AI Models and Applications

    Advanced Micro Devices, Inc.San Jose, CA, United States
    Full-time
    WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded syst...Show moreLast updated: 14 days ago
    • Promoted
    Capgemini Invent / Synapse - Sr. Principal Engineer - Robotics AI

    Capgemini Invent / Synapse - Sr. Principal Engineer - Robotics AI

    CapgeminiSan Francisco, CA, United States
    Full-time
    Capgemini Invent / Synapse - Sr.Principal Engineer - Robotics AI.At Capgemini Invent, we believe difference drives change. As inventive transformation consultants, we blend our strategic, creative a...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Solutions Architect, Agentic AI

    Senior Solutions Architect, Agentic AI

    NVIDIASanta Clara, CA, United States
    Full-time
    NVIDIA is seeking an outstanding Senior AI Engineer or Solutions Architect to join our growing team focused on partner enablement for Agentic AI. In this role, you will lead by example, acting as bo...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Machine Learning / AI Engineer

    Sr. Machine Learning / AI Engineer

    RivianPalo Alto, CA, United States
    Full-time
    Rivian is on a mission to keep the world adventurous forever.This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract.As a company...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. System Engineer / Rack Solution (27693)

    Sr. System Engineer / Rack Solution (27693)

    Super Micro ComputerSan Jose, CA, United States
    Full-time
    Supermicro® is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customer...Show moreLast updated: 3 days ago
    • Promoted
    AI Solution Engineer

    AI Solution Engineer

    MLabsSan Francisco, CA, United States
    Full-time
    AI Solution Engineer (Product Owner Focus).We are a rapidly growing AI enterprise focused on empowering businesses to deploy conversational, AI‑powered phone agents at scale.We are looking for a un...Show moreLast updated: 1 day ago
    • Promoted
    Senior Solutions Architect, Generative AI

    Senior Solutions Architect, Generative AI

    NVIDIASanta Clara, CA, United States
    Full-time
    NVIDIA is looking for an AI Solutions Architect with hands-on experience in efficient AI model training and / or deployment for a customer facing role. Primary responsibilities will be to help acceler...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. System Engineer / Rack Solution (27693)

    Sr. System Engineer / Rack Solution (27693)

    SupermicroSan Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 18 days ago
    • Promoted
    AI System Engineer, Sr. Staff

    AI System Engineer, Sr. Staff

    Sk Hynix AmericaSan Jose, California, United States
    Full-time
    Job Title : AI System Engineer, Sr.At SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data center...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Solutions Engineer

    Sr Solutions Engineer

    Keysight TechnologiesSan Jose, CA, United States
    Full-time
    Keysight is at the forefront of technology innovation, delivering breakthroughs and trusted insights in electronic design, simulation, prototyping, test, manufacturing, and optimization.Our ~15,000...Show moreLast updated: 1 day ago
    • Promoted
    Sr. Avionics Hardware Engineer

    Sr. Avionics Hardware Engineer

    Reliable RoboticsMountain View, CA, United States
    Permanent
    We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. AI engineer

    Sr. AI engineer

    Diverse LynxSan Jose, CA, United States
    Full-time
    Diverse Lynx LLC is an Equal Employment Opportunity employer.All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solel...Show moreLast updated: 7 days ago
    • Promoted
    Solutions Engineer

    Solutions Engineer

    Searchability NS&DSan Francisco, CA, US
    Full-time
    Solutions Engineer / Pre-Sales Engineer.San Francisco, CA (On-site, five days per week).A high-growth AI startup in the healthcare technology space is seeking a. This role blends technical depth wit...Show moreLast updated: 10 days ago