Talent.com
HPC/AI Data Performance Engineer
HPC/AI Data Performance EngineerBerkeley Lab • Bay Area, California, US
HPC/AI Data Performance Engineer

HPC/AI Data Performance Engineer

Berkeley Lab • Bay Area, California, US
30+ days ago
Job type
  • Full-time
  • Permanent
Job description

In this exciting role, you will serve as a Data Performance Engineer in NERSC’s Application Performance Group, architecting HPC and AI data services that advance fundamental science. You’ll optimize storage systems for Doudna, NERSC’s next supercomputer, and develop I/O and data management solutions for the American Science Cloud. Working with lab, academic, and industry partners, you will help design and deploy advanced storage and data management solutions, support I/O needs for HPC and AI applications, and collaborate with scientists to optimize data workflows for their research.

The selected candidate(s) will be hired at the Computer Systems Engineer 3 or 4 (CSE3 or CSE4) depending on your level skills and experience.

What you’ll do, at Level 3:

  • Develop AI storage and I/O services on NERSC’s advanced computing and data systems to support fundamental science.

  • Support storage and I/O software on NERSC supercomputers, deploy new cutting-edge tools and frameworks for scalable scientific workflows.

  • Provide expert I/O engineering engagement and training events to scientists and users of NERSC computing resources.

  • Engage with the AI and HPC storage community to stay on top of the latest advancements in services and software.

  • Shape future NERSC supercomputers, evaluating new storage systems for AI and HPC.

  • Collaborate with scientists and industry partners to enable transformative science.

  • Determine methods and procedures on new assignments and may coordinate activities of other personnel.

  • Network with key contacts outside their own area of expertise.

  • Work on and resolve complex issues where analysis of situations or data requires an in-depth evaluation of variable factors.

  • Exercise judgment in selecting methods, techniques, and evaluation criteria for obtaining results.

In addition to above, at Level 4:

  • Mentor early career staff members in computing services, techniques and projects

  • Stay abreast of new and emerging trends in Storage, I/O and AI, through collaborations, workshops, and conferences; translate these new directions into actionable opportunities for NERSC or NERSC users.

  • Develop strategy for addressing both performance, as well as productivity requirements of the science community.

  • Work on and resolve significant and unique issues where analysis of situations or data requires an evaluation of intangibles.

  • Exercise independent judgment in methods, techniques, and evaluation criteria for obtaining results.

Requirements, at Level 3:

  • Bachelor’s degree in Physical Sciences, Computer Science or related field or equivalent is required. Masters and PhD degrees in similar disciplines are preferred.

  • Typically requires a minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or equivalent experience.

  • Wide-ranging experience in the areas of data management, storage and I/O as applied to scientific data.

  • Ability to troubleshoot and resolve complex issues in creative and effective ways.

  • Ability to network and collaborate with key contacts outside their own area of expertise.

  • Excellent oral and written communication skills.

  • Excellent software development skills

  • Proven ability to work productively both independently and as part of an interdisciplinary team balancing divergent objectives involving research, code development, supporting software and consulting with scientists.

In addition to above, at Level 4:

  • Typically requires a minimum of 12 years of related experience with a Bachelor’s degree; or 8 years and a Master’s degree; or equivalent experience.

  • Broad expertise and/or unique knowledge in the areas of data management, storage and I/O as applied to scientific data is required.

Desired knowledge/experience:

  • Familiarity with HPC/AI storage architectures and technologies.

  • Experience with I/O optimization of scientific and /or AI workloads.

  • A proven track record of software development in computing, AI, or domain sciences.

  • Familiarity with computing hardware, storage systems, and data management systems.

  • Ability to work on and resolve significant and unique issues where analysis of situations or data requires an evaluation of intangibles.

We’re here for the same mission – to bring science solutions to the world. Join our team and YOU will play a supporting role in our goal to address global challenges! Have a high level of impact and work for an organization associated with 17 Nobel Prizes!

Why join Berkeley Lab?

We invest in our employees by offering a total rewards package you can count on:

  • Exceptional health and retirement , including pension or 401K-style plans

  • Opportunities to grow in your career:

  • A culture where you’ll belong. We are invested in our teams!

  • In addition to accruing vacation and sick time, we also have a Winter Shutdown every year.

  • Parental bonding leave (for both mothers and fathers)

  • Pet insurance

Additional Information:

  • Appointment Type: This is a full-time career appointment, exempt (monthly paid) from overtime pay.

  • Multi-level Posting: This position will be hired at a level commensurate with the business needs and the skills, knowledge, and abilities of the successful candidate.

  • Salary Range:

    Level 3: The expected salary for this position is $156,864 - $191,724, which fits into the full salary of $139,440 - $235,308 depending upon the candidate’s skills, knowledge, and abilities, including education, certifications, and years of experience.

    Level 4: The expected salary for this position is $178,644 - $218,364, which fits into the full salary of $158,808 - $267,996 depending upon the candidate’s skills, knowledge, and abilities, including education, certifications, and years of experience.

  • Background Check: This position is subject to a background check. Any convictions will be evaluated to determine if they directly relate to the responsibilities and requirements of the position. Having a conviction history will not automatically disqualify an applicant from being considered for employment.

  • Work Modality: This position requires substantial on-site presence, but is eligible for a flexible work mode, and hybrid schedules may be considered. Hybrid work is a combination of performing work on-site at Lawrence Berkeley National Lab, 1 Cyclotron Road, Berkeley, CA and some telework. Individuals working a hybrid schedule must reside within 150 miles of Berkeley Lab. Work schedules are dependent on business needs. In rare cases, full-time telework or remote work modes may be considered. A REAL ID or other acceptable form of identification is required to access Berkeley Lab sites (for more information).

  • Common for Posting Export Control access role:This position will involve access to hardware, commodities, and technical information subject to export control regulations including, but not limited to, the Export Administration Regulations ("EAR") and/or International Traffic in Arms Regulations ("ITAR"). Accordingly, any hiring decision may depend in part on Berkeley Lab’s ability to obtain or rely on federal government authorizations as required, if you are not a U.S. citizen, lawful permanent resident of the U.S. (“green card holder”), asylee, refugee, or other qualifying protected individual as defined by 8 U.S.C. 1324b(a)(3).

Want to learn more about working at Berkeley Lab? Please visit:

Equal Employment Opportunity Employer: The foundation of Berkeley Lab is our Stewardship Values: Team Science, Service, Trust, Innovation, and Respect; and we strive to build community with these shared values and commitments. Berkeley Lab is an Equal Opportunity Employer. We heartily welcome applications from all who could contribute to the Lab's mission of leading scientific discovery, excellence, and professionalism. In support of our rich global community, all qualified applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, protected veteran status, or other protected categories under State and Federal law.

Berkeley Lab is a University of California employer. It is the policy of the University of California to undertake affirmative action and anti-discrimination efforts, consistent with its obligations as a Federal and State contractor.

Misconduct Disclosure Requirement: As a condition of employment, the finalist will be required to disclose if they are subject to any final administrative or judicial decisions within the last seven years determining that they committed any misconduct, are currently being investigated for misconduct, left a position during an investigation for alleged misconduct, or have filed an appeal with a previous employer.

Create a job alert for this search

HPC/AI Data Performance Engineer • Bay Area, California, US

Similar jobs

Platform Engineer, Enterprise AI & Data Infra

OpenAISan Francisco, CA, United States
Full-time

A leading AI research firm in San Francisco is seeking a professional to design and deploy services that empower models and agents.The role involves ensuring data access patterns enhance model perf...Show more

 • Promoted

Senior Fullstack Engineer – AI Data Platform

Databricks Inc.San Francisco, CA, United States
Full-time

A leading data and AI company seeks a Senior Software Engineer to develop impactful AI features for their Data Intelligence Platform.You will be responsible for end-to-end execution and thrive in f...Show more

 • Promoted

Senior Full-Stack AI Engineer: Production-Ready Field Copilot

MSI DataSan Francisco, CA, United States
Full-time

A forward-thinking software company is looking for a Senior Full Stack Developer to join their new AI team in San Francisco.You will lead the development of innovative AI-native solutions that enha...Show more

 • Promoted

Lead AI Engineer

XcedeSan Francisco, CA, United States
Full-time

A top global hedge fund is looking for an AI Platform Engineer to lead the development and management of cutting-edge AI infrastructure for our complex enterprise environment.This hire will design ...Show more

 • Promoted

AI Data Engineer: Cloud Pipelines for ML & Data Quality

Full ScopeSan Francisco, CA, United States
Full-time

A data solutions firm in San Francisco is seeking an experienced Data Engineer to design and maintain scalable data pipelines supporting AI workflows.The role requires a strong foundation in progra...Show more

 • Promoted

AI Engineer

BayRockLabsSan Francisco, CA, United States
Full-time

At BayRock Labs, we pioneer innovative tech solutions that drive business transformation.As a leading product engineering firm based in Silicon Valley, we provide full-cycle product development, le...Show more

 • Promoted

ML Kernel Performance Engineer for AI Acceleration

AmazonSan Francisco, CA, United States
Full-time

A leading cloud services provider in San Francisco is seeking an ML Kernel Performance Engineer to design and implement high-performance compute kernels for machine learning operations.The ideal ca...Show more

 • Promoted

Senior Backend Engineer — Healthcare AI Infra & Data

Epsilon Labs, Inc.San Francisco, CA, United States
Full-time

A healthcare technology company based in San Francisco is seeking a strong software engineer with over 5 years of experience to join their team.You will design and build robust backend systems and ...Show more

 • Promoted

Senior AI / ML Data Engineer - Remote Healthcare Platform

Crossing HurdlesSan Francisco, CA, United States
Remote
Full-time

A forward-thinking healthcare AI company is seeking a Senior Data / AI / ML Software Engineer to build the intelligence behind clinical workflows.This role requires 7years of experience in AI / ML ...Show more

 • Promoted

Principal Data Platform Engineer: Graph & AI

Salesforce, Inc.San Francisco, CA, United States
Full-time

A leading technology company is looking for a Principal Software Engineer to architect their data platform.Candidates should have extensive experience in software engineering focusing on backend sy...Show more

 • Promoted

Principal Engineer - AI Tools

UberSan Francisco, CA, United States
Full-time

At Uber, developer productivity is a cornerstone of our innovation engine.We are seeking a world‑class Principal Engineer (Director‑equivalent) to lead the next frontier of software development by ...Show more

 • Promoted

GPU Performance Engineer: Scalable AI Inference

AnthropicSan Francisco, CA, United States
Full-time

A leading AI research and development firm in San Francisco is seeking a GPU Performance Engineer to innovate GPU performance and systems engineering.This role involves architecting systems that ma...Show more

 • Promoted

Senior Data Solutions Engineer - Hybrid/AI Enablement

MeltwaterRedwood City, CA, United States
Full-time

A leading data solutions company is seeking a Senior/Lead Data Solution Engineer to design and implement scalable data solutions, manage complex data migrations, and oversee data integration.This r...Show more

 • Promoted

Senior Data Engineer — Hybrid, AI/ML Data Platform

BaselayerSan Francisco, CA, United States
Full-time

A leading data solutions company in San Francisco is seeking a Senior Data Engineer to design and maintain ETL/ELT pipelines and manage data architecture using cloud tools.The ideal candidate will ...Show more

 • Promoted

Senior Data Engineer — AI-Driven Platform, Equity

KiddomSan Francisco, CA, United States
Full-time

A forward-thinking technology company is seeking a Senior Data Engineer to enhance their data infrastructure.The ideal candidate will have over 3 years of data engineering experience and a strong b...Show more

 • Promoted

Senior AI & Data Platform Engineer — Onsite SF, ML Infra

Icon VenturesSan Francisco, CA, United States
Full-time

A leading educational technology company located in San Francisco is seeking a Senior AI & Data Platform Engineer.In this role, you will design and build analytical data and machine learning infras...Show more

 • Promoted

Remote AI Performance Engineer -- Scale-Out HPC

Cornelis NetworksSan Francisco, CA, United States
Remote
Full-time

A leading technology firm is seeking an AI Performance Engineer to optimize performance for distributed AI workloads.The role involves benchmarking and tuning workloads, designing experiment plans,...Show more

 • Promoted

Staff Performance Engineer — AI Platform & Scale

Weights & BiasesSan Francisco, CA, United States
Full-time

A leading AI development firm in San Francisco is looking for a Staff Software Engineer, Performance.You will own complex performance challenges and design experiments to enhance system scalability...Show more