Talent.com
HPC/AI Data Performance Engineer
HPC/AI Data Performance EngineerLawrence Berkeley National Laboratory • Berkeley, CA, United States
HPC/AI Data Performance Engineer

HPC/AI Data Performance Engineer

Lawrence Berkeley National Laboratory • Berkeley, CA, United States
18 days ago
Job type
  • Full-time
  • Permanent
Job description

In this exciting role, you will serve as a Data Performance Engineer in NERSC's Application Performance Group, architecting HPC and AI data services that advance fundamental science. You'll optimize storage systems for Doudna, NERSC's next supercomputer, and develop I/O and data management solutions for the American Science Cloud. Working with lab, academic, and industry partners, you will help design and deploy advanced storage and data management solutions, support I/O needs for HPC and AI applications, and collaborate with scientists to optimize data workflows for their research.

The selected candidate(s) will be hired at the Computer Systems Engineer 3 or 4 (CSE3 or CSE4) depending on your level skills and experience.

What you'll do, at Level 3:

  • Develop AI storage and I/O services on NERSC's advanced computing and data systems to support fundamental science.

  • Support storage and I/O software on NERSC supercomputers, deploy new cutting-edge tools and frameworks for scalable scientific workflows.

  • Provide expert I/O engineering engagement and training events to scientists and users of NERSC computing resources.

  • Engage with the AI and HPC storage community to stay on top of the latest advancements in services and software.

  • Shape future NERSC supercomputers, evaluating new storage systems for AI and HPC.

  • Collaborate with scientists and industry partners to enable transformative science.

  • Determine methods and procedures on new assignments and may coordinate activities of other personnel.

  • Network with key contacts outside their own area of expertise.

  • Work on and resolve complex issues where analysis of situations or data requires an in-depth evaluation of variable factors.

  • Exercise judgment in selecting methods, techniques, and evaluation criteria for obtaining results.

In addition to above, at Level 4:

  • Mentor early career staff members in computing services, techniques and projects

  • Stay abreast of new and emerging trends in Storage, I/O and AI, through collaborations, workshops, and conferences; translate these new directions into actionable opportunities for NERSC or NERSC users.

  • Develop strategy for addressing both performance, as well as productivity requirements of the science community.

  • Work on and resolve significant and unique issues where analysis of situations or data requires an evaluation of intangibles.

  • Exercise independent judgment in methods, techniques, and evaluation criteria for obtaining results.

Requirements, at Level 3:

  • Bachelor's degree in Physical Sciences, Computer Science or related field or equivalent is required. Masters and PhD degrees in similar disciplines are preferred.

  • Typically requires a minimum of 8 years of related experience with a Bachelor's degree; or 6 years and a Master's degree; or equivalent experience.

  • Wide-ranging experience in the areas of data management, storage and I/O as applied to scientific data.

  • Ability to troubleshoot and resolve complex issues in creative and effective ways.

  • Ability to network and collaborate with key contacts outside their own area of expertise.

  • Excellent oral and written communication skills.

  • Excellent software development skills

  • Proven ability to work productively both independently and as part of an interdisciplinary team balancing divergent objectives involving research, code development, supporting software and consulting with scientists.

In addition to above, at Level 4:

  • Typically requires a minimum of 12 years of related experience with a Bachelor's degree; or 8 years and a Master's degree; or equivalent experience.

  • Broad expertise and/or unique knowledge in the areas of data management, storage and I/O as applied to scientific data is required.

Desired knowledge/experience:

  • Familiarity with HPC/AI storage architectures and technologies.

  • Experience with I/O optimization of scientific and /or AI workloads.

  • A proven track record of software development in computing, AI, or domain sciences.

  • Familiarity with computing hardware, storage systems, and data management systems.

  • Ability to work on and resolve significant and unique issues where analysis of situations or data requires an evaluation of intangibles.

We're here for the same mission - to bring science solutions to the world. Join our team and YOU will play a supporting role in our goal to address global challenges! Have a high level of impact and work for an organization associated with 17 Nobel Prizes!

Why join Berkeley Lab?

We invest in our employees by offering a total rewards package you can count on:

  • Exceptional health and retirement benefits, including pension or 401K-style plans

  • Opportunities to grow in your career: Tuition assistance program

  • A culture where you'll belong. We are invested in our teams!

  • In addition to accruing vacation and sick time, we also have a Winter Holiday Shutdown every year.

  • Parental bonding leave (for both mothers and fathers)

  • Pet insurance

Additional Information:

  • Appointment Type: This is a full-time career appointment, exempt (monthly paid) from overtime pay.

  • Multi-level Posting: This position will be hired at a level commensurate with the business needs and the skills, knowledge, and abilities of the successful candidate.

  • Salary Range:

    • Level 3: The expected salary for this position is $156,864 - $191,724, which fits into the full salary of $139,440 - $235,308 depending upon the candidate's skills, knowledge, and abilities, including education, certifications, and years of experience.

    • Level 4: The expected salary for this position is $178,644 - $218,364, which fits into the full salary of $158,808 - $267,996 depending upon the candidate's skills, knowledge, and abilities, including education, certifications, and years of experience.

  • Background Check: This position is subject to a background check. Any convictions will be evaluated to determine if they directly relate to the responsibilities and requirements of the position. Having a conviction history will not automatically disqualify an applicant from being considered for employment.

  • Work Modality: This position requires substantial on-site presence, but is eligible for a flexible work mode, and hybrid schedules may be considered. Hybrid work is a combination of performing work on-site at Lawrence Berkeley National Lab, 1 Cyclotron Road, Berkeley, CA and some telework. Individuals working a hybrid schedule must reside within 150 miles of Berkeley Lab. Work schedules are dependent on business needs. In rare cases, full-time telework or remote work modes may be considered. A REAL ID or other acceptable form of identification is required to access Berkeley Lab sites (for more information click here).

  • Export Control Access: This position will involve access to hardware, commodities, and technical information subject to export control regulations including, but not limited to, the Export Administration Regulations ("EAR") and/or International Traffic in Arms Regulations ("ITAR"). Accordingly, any hiring decision may depend in part on Berkeley Lab's ability to obtain or rely on federal government authorizations as required, if you are not a U.S. citizen, lawful permanent resident of the U.S. ("green card holder"), asylee, refugee, or other qualifying protected individual as defined by 8 U.S.C. 1324b(a)(3).

Want to learn more about working at Berkeley Lab? Please visit: careers.lbl.gov

Equal Employment Opportunity Employer: The foundation of Berkeley Lab is our Stewardship Values: Team Science, Service, Trust, Innovation, and Respect; and we strive to build community with these shared values and commitments. Berkeley Lab is an Equal Opportunity Employer. We heartily welcome applications from all who could contribute to the Lab's mission of leading scientific discovery, excellence, and professionalism. In support of our rich global community, all qualified applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, protected veteran status, or other protected categories under State and Federal law.

Misconduct Disclosure Requirement: As a condition of employment, the finalist will be required to disclose if they are subject to any final administrative or judicial decisions within the last seven years determining that they committed any misconduct, are currently being investigated for misconduct, left a position during an investigation for alleged misconduct, or have filed an appeal with a previous employer.

Create a job alert for this search

HPC/AI Data Performance Engineer • Berkeley, CA, United States

Similar jobs

Lead AI Engineer (MLX)

Capital OneSan Francisco, CA, United States
Full-time +1

At Capital One, we are creating responsible and reliable AI systems, changing banking for good.For years, Capital One has been an industry leader in using machine learning to create real-time, pers...Show more

 • Promoted

Senior Fullstack Engineer – AI Data Platform

Databricks Inc.San Francisco, CA, United States
Full-time

A leading data and AI company seeks a Senior Software Engineer to develop impactful AI features for their Data Intelligence Platform.You will be responsible for end-to-end execution and thrive in f...Show more

 • Promoted

AI Data Engineer

Fluency CorpSan Francisco, CA, United States
Full-time

Fluency Is Enabling The Autonomous Enterprise.You're needed to build the data infrastructure that powers enterprise intelligence.We're not wiring up dashboards.We're building pipelines that ingest,...Show more

 • Promoted

Lead Data Infrastructure Engineer – AI Robotics

AmazonSan Francisco, CA, United States
Full-time

A leading technology company in San Francisco seeks a skilled professional to build and maintain scalable data infrastructure for AI robotics research.The role involves designing dataset management...Show more

 • Promoted

Head of AI Data, STEM

TuringSan Francisco, CA, United States
Full-time

Based in San Francisco, California, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises looking to deploy advanced AI systems.Turing...Show more

 • Promoted

Public Sector Field Engineer: AI Data Pipelines

Scale AI, Inc.San Francisco, CA, United States
Full-time

A leading AI technology firm is seeking a Field Engineer in San Francisco to enhance customer experience through technical solutions.You will work closely with engineering teams to ensure seamless ...Show more

 • Promoted

Head of GTM - Hyperscale Data Center & AI Infra

Hamilton Barnes Associates LimitedSan Francisco, CA, United States
Full-time

A stealth-mode data center startup is seeking an experienced leader to define and execute its global go-to-market strategy.This role involves managing sales to hyperscale and AI infrastructure cust...Show more

 • Promoted

Senior/Principal AI Data and Feature Platform Engineer

RobloxSan Mateo, CA, United States
Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers...Show more

 • Promoted

Lead AI Engineer

XcedeSan Francisco, CA, United States
Full-time

A top global hedge fund is looking for an AI Platform Engineer to lead the development and management of cutting-edge AI infrastructure for our complex enterprise environment.This hire will design ...Show more

 • Promoted

AI Data Engineer: Cloud Pipelines for ML & Data Quality

Full ScopeSan Francisco, CA, United States
Full-time

A data solutions firm in San Francisco is seeking an experienced Data Engineer to design and maintain scalable data pipelines supporting AI workflows.The role requires a strong foundation in progra...Show more

 • Promoted

AI Engineer

Flint TechnologiesSan Francisco, CA, United States
Full-time

We're backed by Accel, by the same partner who led the series A for Scale, Vercel, and Sentry.We launched out of stealth 2 months ago (TechCrunch), and are already powering pages that generate hund...Show more

 • Promoted

Lead Data Engineer, AI Platform (Relocation Included)

OpenAISan Francisco, CA, United States
Full-time

A leading AI research organization is seeking a Data Engineer to build essential data pipelines in San Francisco.This role involves designing robust systems for data processing and collaborating wi...Show more

 • Promoted

Senior C++ Full-Stack Engineer AI Data & Infrastructure

AlignerrSan Francisco, CA, United States
Full-time

Senior C++ Full-Stack Engineer AI Data & Infrastructure.Alignerr connects top technical experts with leading AI labs to build, evaluate, and improve next-generation models.We work on real productio...Show more

 • Promoted

AI Engineer

AirwallexSan Francisco, CA, United States
Full-time

Airwallex is the only unified payments and financial platform for global businesses.Powered by our unique combination of proprietary infrastructure and software, we empower over 200,000 businesses ...Show more

 • Promoted

AI Kernel Engineer

quadric.ioBurlingame, CA, United States
Full-time

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture.Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads...Show more

 • Promoted

AI Research Engineer - Data Infrastructure

1X Technologies ASSan Carlos, CA, United States
Full-time

AI Research Engineer, Data Infrastructure.We build humanoid robots that work alongside people to solve labor shortages and create abundance.As a Research Engineer in Infrastructure, you will design...Show more

 • Promoted

GPU Performance Engineer: Scalable AI Inference

AnthropicSan Francisco, CA, United States
Full-time

A leading AI research and development firm in San Francisco is seeking a GPU Performance Engineer to innovate GPU performance and systems engineering.This role involves architecting systems that ma...Show more

 • Promoted

AI Analytics Engineer (AI & Analytics Platform)

AirtableSan Francisco, CA, United States
Full-time

Airtable is the no-code app platform that empowers people closest to the work to accelerate their most critical business processes.More than 500,000 organizations, including 80% of the Fortune 100,...Show more