Talent.com
Senior HPC Cluster Systems Administrator
Senior HPC Cluster Systems AdministratorLawrence Berkeley National Laboratory • Berkeley, CA, United States
Senior HPC Cluster Systems Administrator

Senior HPC Cluster Systems Administrator

Lawrence Berkeley National Laboratory • Berkeley, CA, United States
16 days ago
Job type
  • Full-time
Job description

Berkeley Lab's ( LBNL ) Information Technology Division ( IT ) has an opening for a Senior HPC Cluster Systems Administrator to join their ScienceIT Team !

In this exciting role, you will support the Berkeley Lab research community by building, integrating, and maintaining Linux-based resources, high-performance computing cluster systems, and Kubernetes clusters. This role provides extensive expertise in High Performance Computing infrastructure and delivers advanced Linux solutions to further scientific endeavors at Berkeley Lab. The mission of Scientific Computing under ScienceIT is to facilitate groundbreaking fundamental research globally by providing essential computing tools, networks, and expertise to enable pioneering science.

This position has an anticipated start date of January 5, 2026.

We're here for the same mission, to bring science solutions to the world. Join our team and YOU will play a supporting role in our goal to address global challenges! Have a high level of impact and work for an organization associated with 17 Nobel Prizes!

Why join Berkeley Lab?

We invest in our employees by offering a total rewards package you can count on :

  • Exceptional health and retirement benefits , including pension or 401K-style plans
  • Opportunities to grow in your career - check out our Tuition Assistance Program
  • A culture where you'll belong - we are invested in our teams!
  • In addition to accruing vacation and sick time, we also have an annual Winter Holiday Shutdown
  • Parental bonding leave (for both mothers and fathers)
  • Pet insurance

What You Will Do :

  • Perform Linux system and HPC cluster maintenance and installations, operating system upgrades, system security hardening and intrusion detection, storage and file system management, system hardware, customization of user group working environment, troubleshooting, network monitoring, and crash recovery.
  • Design, deploy, and manage scalable applications using Kubernetes, ensuring the availability, performance, and readiness of the Kubernetes infrastructure.
  • Automate deployment, scaling, and management of containerized applications, and collaborating with DevOps and development teams to streamline CI / CD pipelines.
  • Design, deploy, and manage the global storage platform to ensure high performance, massive scalability, reliability, and future-proof solutions.
  • Support storage technologies such as Lustre, VAST, and networks.
  • Resolve I / O issues related to business applications, including diagnosing and resolving complex storage, Linux, and networking challenges in a fast-paced environment.
  • Research new storage management technologies, techniques, and provide recommendations.
  • Participate in developing system administration, security, and network policies, documentation, and tools oriented towards efficient systems management.
  • Participate in cluster support to staff and researchers, including initial installation, integration, and ongoing maintenance of Linux High-Performance Computing cluster systems. This includes travel to remote sites if as needed.
  • Co-leading technical efforts with other senior system administrators in areas of HPC technologies such as job schedulers, high-performance interconnects, parallel file systems, cybersecurity, cluster management, container orchestration, VM infrastructure, networking, performance tuning, or data center planning.
  • Co-leading group projects of small to medium size and complexity, to implement and deploy new computing technologies and associated services to the research community.
  • What We Are Looking For :

  • A Bachelor's Degree (or equivalent knowledge / training) in Computer Science, Engineering, or a related discipline, and a minimum of 12 years of relevant experience in Linux system administration within a large distributed computing environment, including experience providing systems and end-user support for multiple scientific or computational research groups or an equivalent combination of education and experience.
  • Demonstrated ability to manage large-scale, performance-critical environments, including capacity planning, scaling, and optimization.
  • Significant experience deploying, scaling, and managing Kubernetes clusters, with a strong understanding of its architecture (pods, deployments, services, ingress) and container orchestration. Proven proficiency with CI / CD tools like Jenkins or GitLab CI.
  • Proven experience with Red Hat derivatives (CentOS, Scientific Linux, Rocky Linux), Debian, Ubuntu, and large-scale system and configuration management tools (Kickstart, Ansible, Puppet, Chef, Warewulf). Expertise in supporting standard services (NFS, LDAP, SMB, MySQL, Apache / Nginx HTTPD).
  • Strong HPC expertise, including Linux, job schedulers, high-performance interconnects, parallel file systems, cybersecurity, container orchestration, cluster management, VM infrastructure, networking, performance tuning, scientific application support, and data center planning.
  • Proficiency in Python and Bash for building, optimizing, and debugging scientific codes (C, C++, Fortran, Java), including experience with compilers (GCC, Intel), debuggers, Makefiles, and version-control (git, Subversion).
  • Expertise in storage system design and optimization (Lustre, S3, VAST, Weka, Ceph, DDN), including a deep understanding of the storage stack (kernel to user space, including file systems, block storage, I / O schedulers, VFS), storage benchmarking, and performance tuning (throughput, latency, IOPS, workload-specific optimizations).
  • Excellent oral and written communication skills including experience organizing and presenting customer focused technical data, reports, and projects to audiences with varying degrees of technical expertise.
  • Strong interpersonal skills including experience with research facilitation and project management in a multidisciplinary team environment.
  • Desired Qualifications :

  • An Advanced Degree (or equivalent knowledge / training) in Computer Science, Engineering, or a related discipline.
  • Experience with software engineering and / or software development.
  • Familiarity with Kubernetes-related tools like Helm, Istio, and Prometheus.
  • Demonstrated experience supporting research at a National Lab and / or in an academic or research environment.
  • Additional Information :

  • Application Deadline : For full consideration, please apply with a resume and a cover letter describing your interest by December 5, 2025 .
  • Appointment type : This is a full-time, career appointment, exempt (monthly paid) from overtime pay.
  • Salary Information : This position is expected to pay $178,644 - $218,364 annually, which fits within the full salary range of $158,808 - $267,996 annually for job code C70.4. It is not typical for an individual to be offered a salary at or near the top of the range for a position. Salary for this position will be commensurate with the final candidate's qualification and experience, including skills, knowledge, relevant education, certifications, and aligned with the internal peer group.
  • Background Check : This position may be subject to a background check. Any convictions will be evaluated to determine if they directly relate to the responsibilities and requirements of the position. Having a conviction history will not automatically disqualify an applicant from being considered for employment.
  • Work Modality : This position is eligible for a hybrid work schedule - a combination of teleworking and performing work on site at Lawrence Berkeley National Lab, 1 Cyclotron Road, Berkeley, CA 94720. Work schedules are dependent on business needs. Individuals working a hybrid schedule must reside within 150 miles of Berkeley Lab. Starting May 7, a REAL ID or other acceptable form of identification is required to access Berkeley Lab sites (for more information click here ).
  • Relocation : This position is eligible for relocation assistance.
  • Work Authorization : Applicants must be legally authorized to work in the United States. Berkeley Lab does not provide visa sponsorship for this position.
  • Want to learn more about working at Berkeley Lab? Please visit : careers.lbl.gov

    Equal Employment Opportunity Employer : The foundation of Berkeley Lab is our Stewardship Values : Team Science, Service, Trust, Innovation, and Respect; and we strive to build community with these shared values and commitments. Berkeley Lab is an Equal Opportunity Employer. We heartily welcome applications from all who could contribute to the Lab's mission of leading scientific discovery, excellence, and professionalism. In support of our rich global community, all qualified applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, protected veteran status, or other protected categories under State and Federal law.

    Berkeley Lab is a University of California employer. It is the policy of the University of California to undertake affirmative action and anti-discrimination efforts, consistent with its obligations as a Federal and State contractor.

    Misconduct Disclosure Requirement : As a condition of employment, the finalist will be required to disclose if they are subject to any final administrative or judicial decisions within the last seven years determining that they committed any misconduct, are currently being investigated for misconduct, left a position during an investigation for alleged misconduct, or have filed an appeal with a previous employer.

    Create a job alert for this search

    System Administrator • Berkeley, CA, United States

    Related jobs
    IT Systems Administrator

    IT Systems Administrator

    Envoy Inc. • San Francisco, CA, United States
    Full-time
    Envoy builds workspace management technology that makes it simple to run secure, compliant, and connected workplaces across every location. Over 16,000 workplaces and properties around the world rel...Show more
    Last updated: 9 days ago • Promoted
    IT Systems Administrator

    IT Systems Administrator

    Menlo Ventures • San Francisco, CA, United States
    Full-time
    San Francisco Bay Area $110K – $125K.Envoy's compensation package includes a market-competitive salary, equity for all full-time roles, and excellent benefits. Final offers may vary within the provi...Show more
    Last updated: 1 hour ago • Promoted • New!
    MEP Superintendent - Mission Critical

    MEP Superintendent - Mission Critical

    Metric Geo • Sonoma, CA, US
    Full-time
    Mission Critical MEP Superintendent – San Francisco, CA.Join a top-tier national GC delivering cutting-edge.Lead field coordination of MEP systems across large-scale data center projects.Driv...Show more
    Last updated: 30+ days ago • Promoted
    Slurm Administration & Systems Architecture

    Slurm Administration & Systems Architecture

    Midjourney • Sonoma, CA, United States
    Full-time
    We are seeking a highly skilled HPC / AI / ML Cluster Engineer to support the design, deployment, and ongoing operations of large-scale HPC environments powered by Slurm. This role centers on cluster en...Show more
    Last updated: 30+ days ago • Promoted
    Senior HPC Cluster Systems Administrator

    Senior HPC Cluster Systems Administrator

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    Full-time
    Information Technology Division (.Senior HPC Cluster Systems Administrator to join their.In this exciting role, you will support the Berkeley Lab research community by building, integrating, and ma...Show more
    Last updated: 16 days ago • Promoted
    Staff Systems Engineer

    Staff Systems Engineer

    Bio-Rad Laboratories • Hercules, CA, United States
    Full-time
    Working within Bio-Rad's Life Science R&D Group as a Systems Engineer, you will take engineering concepts, requirements and transform them into functional prototypes and finished products that impr...Show more
    Last updated: 28 days ago • Promoted
    Senior Project Architect (Sonoma)

    Senior Project Architect (Sonoma)

    BRERETON • Sonoma, California, US
    Part-time
    Senior Project Architect / Manager Architecture + Interiors.Brereton is hiring a seasoned Senior Project Architect / Manager to lead the execution of complex architectural and interiors projects from ...Show more
    Last updated: 2 days ago • Promoted
    Senior Superintendent (Sonoma)

    Senior Superintendent (Sonoma)

    Foster Lawson • Sonoma, California, US
    Part-time
    I'm currently partnered with a leading General Contractor in the San Francisco Bay Area who are looking to make additions to their field operations team. This is a financially secure general contrac...Show more
    Last updated: 6 hours ago • Promoted • New!
    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • Sonoma, CA, United States
    Full-time
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...Show more
    Last updated: 14 days ago • Promoted
    Sales Support Engineer

    Sales Support Engineer

    SITECH NorCal and SITECH Oregon • Sonoma, CA, US
    Full-time
    SITECH NorCal has a need for a Sales Support Engineer who is based at our San Leandro, CA location.The Sales Support Engineer role combines technical knowledge with sales skills.Key to this role is...Show more
    Last updated: less than 1 hour ago • Promoted • New!
    IT Systems Administrator

    IT Systems Administrator

    Envoy • San Francisco, CA, United States
    Full-time
    Envoy builds workspace management technology that makes it simple to run secure, compliant and connected workplaces across every location. Over 16,000 workplaces and properties worldwide rely on Env...Show more
    Last updated: 9 days ago • Promoted
    Senior Estimator

    Senior Estimator

    RILA Recruitment • Sonoma, CA, United States
    Full-time
    Our client is an established General Contractor and is seeking a Senior Estimator to join their team.The company specializes in Design-Build construction projects throughout the Bay Area.Medical, D...Show more
    Last updated: 8 days ago • Promoted
    Senior Procurement Systems Specialist

    Senior Procurement Systems Specialist

    Exelixis • Alameda, CA, United States
    Full-time
    This Procurement Systems role within the Strategic Sourcing & Procurement department (SS&P) supports the systems requirements of SS&P. This team collaborates with our corporate technical resources i...Show more
    Last updated: 26 days ago • Promoted
    Multiplatform Systems Administrator - Equity & Flexible PTO

    Multiplatform Systems Administrator - Equity & Flexible PTO

    Astranis • San Francisco, CA, United States
    Full-time
    A leading aerospace technology company in San Francisco is hiring a versatile System Administrator.This role requires managing Windows, Linux, and MacOS environments while providing technical suppo...Show more
    Last updated: 1 day ago • Promoted
    Superintendent (Sonoma)

    Superintendent (Sonoma)

    MKL Careers • Sonoma, California, US
    Part-time
    Job Title : Superintendent Active Healthcare Projects.Location : San Francisco Bay Area, CA.We are seeking an experienced Superintendent to oversee field operations for active healthcare constructio...Show more
    Last updated: 6 hours ago • Promoted • New!
    Senior Strategic Projects Lead

    Senior Strategic Projects Lead

    Recruiting from Scratch • Sonoma, CA, United States
    Full-time
    If you’ve built your foundation in MBB consulting, Investment Banking, or Private Equity but are looking for a role with more ownership, faster growth, and real impact — this is your chance.Senior ...Show more
    Last updated: 30+ days ago • Promoted
    Systems Administrator Sr - San Francisco

    Systems Administrator Sr - San Francisco

    Shiva IT Services • San Francisco, CA, United States
    Full-time
    Provide Tier 1 & 2 desktop user support.Provide support for HUD Baseline desktop and Microsoft Office Suites.Perform network printer / Multi-Function Device installs, upgrades & problem diagnosis.Per...Show more
    Last updated: 30+ days ago • Promoted
    IT Systems Administrator - Linux

    IT Systems Administrator - Linux

    Astranis Space Technologies • San Francisco, CA, United States
    Permanent
    IT Systems Administrator - Linux.Astranis builds advanced satellites for high orbits, expanding humanity’s reach into the solar system. Today Astranis satellites provide dedicated, secure networks t...Show more
    Last updated: 30+ days ago • Promoted