Talent.com
Sr. Solution Engineer - HPC & AI Systems
Sr. Solution Engineer - HPC & AI SystemsSupermicro • San Jose, CA, United States
Sr. Solution Engineer - HPC & AI Systems

Sr. Solution Engineer - HPC & AI Systems

Supermicro • San Jose, CA, United States
13 hours ago
Job type
  • Full-time
Job description

Job Req ID : 27728

About Supermicro :

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary :

We are seeking a highly skilled and motivated Senior Solution Engineer to lead efforts in benchmarking, performance tuning, and platform automation for HPC and AI workloads. This role is critical to ensuring our systems meet performance targets for RFQs and support scalable, automated deployment across diverse environments.

Essential Duties and Responsibilities :

Includes the following essential duties and responsibilities (other duties may also be assigned) :

  • HPC & AI Benchmarking

o Execute performance benchmarks for HPC and AI workloads, including MLPerf, across various GPU systems.

o Analyze and optimize system configurations to meet RFQ performance requirements.

  • Performance Tuning & Cluster Optimization
  • o Identify bottlenecks and implement tuning strategies for compute, memory, and I / O performance.

    o Scale and maintain large compute clusters for high-demand workloads.

  • Platform Automation & Infrastructure Engineering
  • o Automate deployment and configuration using Ansible, Terraform, and Docker.

    o Develop infrastructure-as-code solutions to support reproducible and scalable environments.

  • Backend & Middleware Development
  • o Build backend services and middleware to support large-scale distributed deployments.

    o Ensure reliability, modularity, and performance of backend systems.

  • CI / CD & DevOps Integration
  • o Design and maintain CI / CD pipelines to support agile development and minimize downtime.

    o Collaborate with DevOps teams to streamline software delivery and system updates.

  • System Administration & OS Engineering
  • o Perform hands-on installation, tuning, and troubleshooting of Linux systems, especially Red Hat-based environments.

    o Manage software-defined storage and networking components including DNS, DHCP, PXE, and cluster provisioning.

    o Maintain and configure Proof-of-Concept (PoC) system components.

    o Support the system certification processes required by ISV partners.

  • Containerization & Orchestration
  • o Deploy and manage containerized workloads using Kubernetes.

    o Integrate container orchestration with benchmarking and automation workflows.

  • Documentation & Collaboration
  • o Maintain clear and comprehensive technical documentation.

    o Work closely with cross-functional teams including hardware, QA, and product management.

    Qualifications :

  • Bachelor or Master degree in Computer Science or a related field
  • Minimum 8 years of professional experiences with Python and Shell script
  • Proven experience with HPC and AI benchmarking tools (e.g., MLPerf).
  • Strong proficiency in Ansible, Terraform, Docker, and Python.
  • Hands-on experience configuring and troubleshoot Linux OS, servers and network switches.
  • Solid understanding of software-defined storage, networking (DNS, DHCP, PXE), system provisioning and state-of-the-art datacenter operations.
  • Excellent problem-solving, documentation, and collaboration skills.
  • Ability to work independently and lead technical initiatives.
  • Salary Range

    $170,000 - $190,000

    The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

    EEO Statement

    Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

    Create a job alert for this search

    Ai Solution Engineer • San Jose, CA, United States

    Related jobs
    Sr. Solution Architect

    Sr. Solution Architect

    Supermicro • San Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Machine Learning / AI Engineer

    Sr. Machine Learning / AI Engineer

    Rivian • Palo Alto, CA, United States
    Full-time
    Rivian is on a mission to keep the world adventurous forever.This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract.As a company...Show more
    Last updated: 30+ days ago • Promoted
    AI Engineer (Core - Platform)

    AI Engineer (Core - Platform)

    BUILD • San Francisco, CA, United States
    Full-time
    As a Engineer, you will be instrumental in shaping the technical direction and core product at Build.You'll work alongside the founders & early team to build state-of-the-art AI agents and the plat...Show more
    Last updated: 13 days ago • Promoted
    AI Systems Engineer

    AI Systems Engineer

    VirtualVocations • Santa Clara, California, United States
    Full-time
    A company is looking for an Engineering & AI Systems Engineer to design and implement internal tools that enhance operational efficiency. Key Responsibilities Build and deploy internal tools to ad...Show more
    Last updated: 30+ days ago • Promoted
    Head of HPC Solution Architecture - AI

    Head of HPC Solution Architecture - AI

    Blue Signal • Fremont, CA, United States
    Full-time
    Head of HPC Solution Architecture - AI.Our client, a leading provider of HPC Solutions for over three decades, offering a comprehensive range of workstations, servers, and clusters, is seeking a.He...Show more
    Last updated: 26 days ago • Promoted
    AI Solution Architect

    AI Solution Architect

    Cognizant • Daly City, CA, US
    Full-time
    Imagine a world where businesses predict market shifts before they happen, anticipate customer needs with precision, make the smartest decisions in real-time, and augment their business processes f...Show more
    Last updated: 21 hours ago • Promoted • New!
    AI System Solution Architect

    AI System Solution Architect

    Cango Inc. • San Francisco, CA, United States
    Full-time
    Join Cango Inc as a Senior Solutions Architect focusing on LLM and diffusion model inference on large-scale GPU clusters. Design end-to-end technical architecture for LLM and Diffusion model inferen...Show more
    Last updated: 19 hours ago • Promoted • New!
    AI Solution Architect

    AI Solution Architect

    VirtualVocations • Oakland, California, United States
    Full-time
    A company is looking for an AI Solution Architect to provide technical leadership and oversight for AI solutions in Microsoft Azure environments. Key Responsibilities Serve as the technical and ar...Show more
    Last updated: 30+ days ago • Promoted
    AI Solutions Engineer

    AI Solutions Engineer

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for an AI Solutions Engineer to build and deploy intelligent, real-time solutions for clients using platforms like Cresta and Kore. Key Responsibilities : Configure, develop, a...Show more
    Last updated: 23 days ago • Promoted
    AI Solution Engineer (Onsite)

    AI Solution Engineer (Onsite)

    Cognizant • San Francisco, CA, United States
    Full-time
    Cognizant is one of the world's leading professional services companies, transforming clients' business, operating, and technology models for the digital era. Our unique industry-based, consultative...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Engineer (Gen AI Platform Services, Agentic Systems)

    Senior AI Engineer (Gen AI Platform Services, Agentic Systems)

    Capital One • San Francisco, CA, United States
    Part-time
    You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing.You want to work on problems that will help change banking for good.Passion for s...Show more
    Last updated: 30+ days ago • Promoted
    Principal AI Engineer, Intelligent Sensors

    Principal AI Engineer, Intelligent Sensors

    1010 Analog Devices Inc. • Rio Robles, CA, United States
    Full-time +1
    NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Solution Architect - Enterprise

    Sr. Solution Architect - Enterprise

    Supermicro • San Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
    Last updated: 30+ days ago • Promoted
    AI Solution Engineer

    AI Solution Engineer

    MLabs • San Francisco, CA, United States
    Full-time
    AI Solution Engineer (Product-Led Onboarding).San Francisco, CA (Jackson Square).Our client, is a rapidly growing AI company focused on empowering enterprises to deploy and scale.Emergence Capital,...Show more
    Last updated: 4 days ago • Promoted
    AI Solutions Engineer

    AI Solutions Engineer

    ButterflyMX, Inc. • San Francisco, California, US
    Full-time
    ButterflyMX is on a mission to empower people to open and manage doors & gates from a smartphone.Our products are installed in more than 20,000+ multifamily, commercial, gated communities, and ...Show more
    Last updated: 13 days ago • Promoted
    Sr Software Engineer - AI

    Sr Software Engineer - AI

    The Trade Desk • San Francisco, CA, United States
    Full-time
    The Trade Desk is a global technology company with a mission to create a better, more open internet for everyone through principled, intelligent advertising. Handling over 1 trillion queries per day...Show more
    Last updated: 1 day ago • Promoted
    Sr. AI / ML Engineer

    Sr. AI / ML Engineer

    Vectra • San Jose, CA, United States
    Full-time
    Vectra is the leader in AI-driven threat detection and response for hybrid and multi-cloud enterprises.The Vectra AI Platform delivers integrated signal across public cloud, SaaS, identity, and dat...Show more
    Last updated: 30+ days ago • Promoted
    AI Solution Engineer

    AI Solution Engineer

    MLabs Ltd • San Francisco, CA, United States
    Full-time
    AI Solution Engineer (Product-Led Onboarding).San Francisco, CA (Jackson Square).AI company focused on empowering enterprises to deploy and scale. Emergence Capital, Scale Venture Partners, and the ...Show more
    Last updated: 6 days ago • Promoted