Talent.com
HPC Engineer

HPC Engineer

Avalore, LLCAnnapolis Junction, MD, US
30+ days ago
Job type
  • Full-time
  • Quick Apply
Job description

What you will be doing

Playing a key role in defining and operating some of the most complex compute platforms that the client has to bring to bear against complex problems. These systems enable complex analysis, simulation and modeling leveraging massively parallel computing and disparate holding of very large data sets, to answer difficult questions. To do this you will assist the users in deploying jobs to these systems to harness the capabilities of these systems producing answers in the form of analytic product, models and simulations. This mission enablement is the heart of the hardest problems to solve.

  • Responsible for the normal day-to-day HPC operations and maintenance of the HPC systems
  • Provide day to day systems administration duties for Nvidia GPUs, Commodity Cluster Systems and Cray HPC environments
  • Perform system monitoring, software installations, debug, upgrades, health checks, and identification / implementation of automated business processes
  • Provide assessments, on-going performance analysis and recommendations for future architectures
  • Responsible for operating all the host systems for the analysis
  • Works in a liaison role, linking the analysts and their specialty codes and applications, to the computing systems that are focused on yielding in-depth technically sound results.
  • Oversees analytic applications running on a clustered HPC fabric including CPU and GPU systems
  • Managing job submission to clients applications and codes using MPI / OpenMPI
  • Provide in-depth analytic results, to achieve a best-tool-for-the-job approach.
  • Partners with data scientists, engineers, and analysts conducting specialized scientific and engineering analysis.
  • Escalate issues and problems to hardware support and / or engineering management as necessary
  • Responsible for continuous performance analysis and tuning the HPC environment
  • Assist with the identification, troubleshooting, and repair of software problems impacting performance of implemented HPC solutions
  • Perform installation of software patches including upgrades to operating systems and firmware
  • Assist with the resolution of trouble tickets and software problems identified by system’s users
  • Identify and expand services and functionalities offered in HPC environment
  • Be a primary point of contact to resolve any hardware or software malfunctions, including working with service personnel as necessary
  • Review system logs to identify and resolve software and systems related issues
  • Prepare reports related to the operational efficiency of the hardware and execution of users jobs
  • Experience with MPI / OpenMPI, SLURM, and Linux Operating Systems essential
  • Prior experience as a Systems Administrator essential, with a preference for experience working with clustered systems including GPUs in the hardware stack
  • Experience with high speed networking, and CUDA preferred
  • Software integration experience a plus
  • Other duties could be required to support the customer’s mission

Requirements

  • Minimum of 6 years demonstrated on-the-job experience
  • Demonstrated on-the-job experience with integrating functionality from disparate systems via scripting / tooling / automation
  • Demonstrated on-the-job experience with the Sponsor's system security environment and requirements
  • Demonstrated experience leading systems architecture, operations, maintenance and administration
  • Clearance : Active TS / SCI with an appropriate current polygraph is required to be considered for this role; Ability to receive privileged access rights.

    Benefits

    Eligibility requirements apply.

  • Employer-Paid Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (401k, IRA) with a generous matching program
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation, Sick & Public Holidays)
  • Short Term & Long Term Disability
  • Training & Development
  • Employee Assistance Program
  • Create a job alert for this search

    Hpc Engineer • Annapolis Junction, MD, US

    Related jobs
    • Promoted
    Technical Expert - HPC

    Technical Expert - HPC

    BAE Systems USAAnnapolis Junction, MD, United States
    Full-time
    At BAE Systems, we're driven by a shared purpose : to protect and connect our world through innovative solutions in defense, security, and aerospace. As a global leader in our industry, we're committ...Show moreLast updated: 4 days ago
    • Promoted
    Reverse / Embedded Engineer

    Reverse / Embedded Engineer

    Leidos IncColumbia, MD, United States
    Full-time
    Leidos is an innovation company rapidly addressing the world's most vexing challenges in national security and health.Our 47,000 employees collaborate to create smarter technology solutions for cus...Show moreLast updated: 30+ days ago
    • Promoted
    VMWARE SYSTEM ENGINEER

    VMWARE SYSTEM ENGINEER

    US MainBALTIMORE, MD, US
    Full-time
    Role : VMWARE System Engineer Location : Onsite in Maryland Mode : Contract Only local to Maryland or Virginia will be considered Experience : o 10+ years of experience managing VMware vSphere, ESXi, and...Show moreLast updated: 2 days ago
    • Promoted
    High Compute Engineer

    High Compute Engineer

    Leidos IncBethesda, MD, United States
    Full-time
    At Leidos, we deliver innovative solutions through the efforts of our diverse and talented people who are dedicated to our customers' success. We empower our teams, contribute to our communities, an...Show moreLast updated: 24 days ago
    • Promoted
    Chief Engineer

    Chief Engineer

    Leidos IncAlexandria, VA, United States
    Full-time
    Leidos National Security Sector combines technology-enabled services and mission software capabilities in the areas of cyber, logistics, security operations, and decision analytics to support our d...Show moreLast updated: 10 days ago
    Senior HPC System Engineer

    Senior HPC System Engineer

    GliaCell TechnologiesMD, US
    Full-time
    Quick Apply
    Are you a Senior HPC System Engineer who is ready for a new challenge that will launch your career to the next level?.Tired of being treated like a company drone?.Tired of promi...Show moreLast updated: 30+ days ago
    Principal HPC Software Engineer

    Principal HPC Software Engineer

    GliaCell TechnologiesMD, US
    Full-time
    Quick Apply
    Are you a Principal HPC Software Engineer who is ready for a new challenge that will launch your career to the next level?. Tired of being treated like a company drone?.Tired of ...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Systems Engineer

    Lead Systems Engineer

    ManTechBelcamp, MD, United States
    Full-time
    The Lead Systems Engineer will work in support of the Army for the Network Modernization & Mission Network Technical Service Support program (NetMod). NetMod sets forth the work efforts required to ...Show moreLast updated: 16 days ago
    • Promoted
    VTC Engineer - DHS OSDSS

    VTC Engineer - DHS OSDSS

    ITC Federal, IncWashington, DC, United States
    Full-time
    Health, Dental and Vision, 401(k), Tuition Reimbursement, Flexible Spending Account (FSA), 11 Paid Federal Holidays, 3 weeks' Paid Time Off. ITC) is an information technology and consulting company ...Show moreLast updated: 30+ days ago
    • Promoted
    EW Systems Engineer - Technology Development

    EW Systems Engineer - Technology Development

    The Johns Hopkins University Applied Physics LaboratoryLaurel, MD, United States
    Full-time
    Do you love innovating concepts for next generation Electronic Warfare (EW) systems?.Do you enjoy transitioning technology developments from science and technology (S&T) programs to acquisition pro...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer III (Mechanic)

    Engineer III (Mechanic)

    The Fairmont HotelWashington, DC, United States
    Full-time
    If creating memories and being part of an exceptional guest experience appeals to you, perhaps you would be interested in joining the outstanding team of hospitality professionals at the Fairmont W...Show moreLast updated: 30+ days ago
    • Promoted
    Hiring our Heroes Skillbridge - Systems Engineer

    Hiring our Heroes Skillbridge - Systems Engineer

    SYSTEMS PLANNING AND ANALYSIS, INC.Alexandria, VA, US
    Full-time
    Systems Planning and Analysis, Inc.SPA) delivers high-impact, technical solutions to complex national security issues.With over 50 years of business expertise and consistent growth, we are known fo...Show moreLast updated: 18 days ago
    • Promoted
    GCP Dataplex Engineer

    GCP Dataplex Engineer

    PerficientColumbia, MD, US
    Full-time
    We are seeking a highly skilled and execution-driven.Perficient is always looking for the best and brightest talent and we need you! We're a quickly-growing, global digital consulting leader, a...Show moreLast updated: 10 days ago
    • Promoted
    HPC Technical ExpertFunctional Expert

    HPC Technical ExpertFunctional Expert

    Omega Enterprise Solutions, LLCAnnapolis Junction, MD, United States
    Full-time
    HPC Technical Expert / Functional Expert.Omega Enterprise Solutions is a Maryland-based, Service-Disabled Veteran-Owned Small Business (SDVOSB) with a special focus on the U.Department of Defense (Do...Show moreLast updated: 30+ days ago
    • Promoted
    Senior System Engineer

    Senior System Engineer

    ManTechBelcamp, MD, United States
    Full-time
    The Senior Systems Engineer will work in support of the Army for the Network Modernization & Mission Network Technical Service Support program (NetMod). NetMod sets forth the work efforts required t...Show moreLast updated: 30+ days ago
    • Promoted
    Controls Engineer

    Controls Engineer

    JobotBaltimore, MD, US
    Full-time
    Well known company with an international reach is growing their offices!.This Jobot Job is hosted by : Alex Dickinson.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us ...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    GCP AI Engineer w / Agentspace

    GCP AI Engineer w / Agentspace

    PerficientColumbia, MD, US
    Full-time
    We are seeking a highly skilled and execution-driven.This role demands deep expertise in.Perficient is always looking for the best and brightest talent and we need you! We're a quickly-growing,...Show moreLast updated: 10 hours ago
    • Promoted
    FPGA Hardware Engineer

    FPGA Hardware Engineer

    Leidos IncLinthicum, MD, United States
    Full-time
    FPGA Hardware Designer Engineer in our.National Security Sector's (NSS) Cyber & Analytics Business Area (CABA).Active TS / SCI with a polygraph is required. Our talented team is at the forefront in S...Show moreLast updated: 30+ days ago