Search jobs > San Francisco, CA > Data engineer

Scientific Data Engineer

Lawrence Berkeley Lab
San Francisco, CA, United States
$7.7K-$9.4K a month
Full-time

Lawrence Berkeley National Lab’s (LBNL) Scientific Data Division - Computing Sciences Research Division has an opening for a Scientific Data Engineer to join the team.

The Computational Biosciences Group (cbg.lbl.gov) has an immediate opening for a Software and Data Engineer in the area of Data Management and Analysis with applications to bioscience research.

The position is in support of the Neurodata Without Borders project, which develops methods for standardization, sharing, and analysis of neurophysiology data.

The technologies developed will have broader applications to other science domains such as genomics, protein design, and other biological data, providing great opportunities for career growth.

The overall objectives for this position are to develop new methods and software tools that enable scientific knowledge discovery using modern data management and machine learning technologies and advance the state-of-the-art in data-intensive analysis.

This position will be part of an experienced team conducting R&D in the areas of FAIR data science and modern methods for data understanding.

The candidate will be working as part of a multi-disciplinary team composed of computer scientists, data scientists, neuroscientists, and other domain biologists.

What you will do (if hired at a level 1) :

Design and develop user-friendly software packages for scientific data analysis and management.

Develop machine learning solutions for analysis of biological data in close collaboration with diverse teams of scientists.

Work closely with the community of developers of the Neurodata Without Borders () and LinkML () open-source data ecosystems.

Maintain and manage open-source software products, including managing development priorities, software releases, continuous integration, and testing.

Design, implement and maintain software tools for creating and running parallel data intensive analysis workflows.

Train scientists and research software engineers in the use of the developed software products at workshops and conferences.

In addition to the above (if hired at a level 2) :

Design, implement and maintain high performance computing and cloud solutions for visualization and analysis of complex biological data.

Work independently on and resolve problems of moderate scope where analysis of situations or data requires a review of a variety of factors.

Exercise judgment within defined procedures and practices to determine appropriate actions to address software and research issues.

Build productive internal / external working relationships and contribute to community working groups.

Contribute to the development of project priorities and reporting.

What is Required (if hired at a level 1) :

Bachelor’s degree in computer science, data science, neuroscience, machine learning, bioinformatics, or equivalent, and a minimum of 2 years of related experience designing and developing software for data modeling or analysis.

Demonstrated capability with programming languages, such as Python, C++, or Javascript.

Demonstrated capability with the Git version control and continuous integration systems, such as GitHub or GitLab.

Experience developing complex software solutions.

Experience developing and contributing to community-driven open-source software.

Demonstrated experience in one or more of the following areas : machine learning, data management, scientific data analysis.

Works well in a collaborative team environment.

Ability to troubleshoot and solve problems of moderate scope where analysis of situations or data requires a review of a variety of factors.

Excellent oral and written communication skills.

Demonstrated ability to work effectively as part of a cross-disciplinary team.

Additional Requirements (if hired at a level 2) :

Typically requires a minimum of 5 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or equivalent experience.

Experience testing large code bases.

Demonstrated capability in managing complex software development efforts.

Demonstrated ability to work independently and develop scientific data solutions.

Additional Desired Qualifications :

Experience working with modern scientific data formats and database systems, such as HDF5, Zarr, MongoDB, PostgreSQL, MySQL, and Redis.

Experience with Neurodata Without Borders, LinkML, or similar software ecosystems.

Experience working with large biological data, such as in the areas of neurophysiology, microbiology, genomics, or protein design.

Experience working with modern parallel computer technologies, such as Cloud, High-Performance Computing, containerization, and parallel programming environments (MPI, OpenMP, CUDA, or pthreads).

Experience developing web-based graphical user interfaces (GUIs) or application programming interfaces (APIs) for scientific data analysis and management.

For full consideration, please apply by August 25, 2023.

Want to learn more about Berkeley Lab's Culture, Benefits and answers to FAQs? Please visit :

Notes :

This is a full-time 2 year, term appointment with the possibility of extension or conversion to Career appointment based upon satisfactory job performance, continuing availability of funds and ongoing operational needs.

If hired at a level 1 - The full salary range of this position is between $6,871 to $11,596 per month and is expected to pay between a targeted range of $7,730 to $9,449 per month depending upon candidates' full skills, knowledge, and abilities, including education, certifications, and years of experience.

If hired at a level 2 - The full salary range of this position is between $8,658 to $14,610 per month and is expected to pay between a targeted range of $9,739 to $11,905 per month depending upon candidates' full skills, knowledge, and abilities, including education, certifications, and years of experience.

This position may be subject to a background check. Any convictions will be evaluated to determine if they directly relate to the responsibilities and requirements of the position.

Having a conviction history will not automatically disqualify an applicant from being considered for employment.

This position is eligible for a hybrid work schedule - a combination of teleworking and performing work on site at Lawrence Berkeley National Lab, 1 Cyclotron Road, Berkeley, CA.

Work schedules are dependent on business needs. Individuals working a hybrid schedule must reside within 150 miles of Berkeley Lab.

Berkeley Lab is committed to Inclusion, Diversity, Equity and Accountability (IDEA) and strives to continue building community with these shared values and commitments.

Berkeley Lab is an Equal Opportunity and Affirmative Action Employer. We heartily welcome applications from women, minorities, veterans, and all who would contribute to the Lab's mission of leading scientific discovery, inclusion, and professionalism.

In support of our diverse global community, all qualified applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, or protected veteran status.

Equal Opportunity and IDEA Information Links : Know your rights, click here for the supplement : Equal Employment Opportunity is the Law and the Pay Transparency Nondiscrimination Provision under 41 CFR 60-1.4.

21 days ago
Related jobs
Promoted
Lawrence Berkeley Lab
San Francisco, California

Lawrence Berkeley National Lab’s (LBNL) Scientific Data Division - Computing Sciences Research Division has an opening for a Scientific Data Engineer to join the team. Software and Data Engineer in the area of Data Management and Analysis with applications to bioscience research. The overall objecti...

Promoted
Pinterest
San Francisco, California

We are looking for a Senior Data Scientist to join the Growth org. You will collaborate on a wide array of product and business problems with a diverse set of cross-functional partners across Product, Engineering, Design, Research, Product Analytics, Data Engineering and others. Proven ability to ap...

Promoted
University of California-Berkeley
Berkeley, California

The College is responsible for growing Berkeley's broad-based undergraduate programs in data science, computing, statistics and other interdisciplinary programs, including classes and programs serving thousands of undergraduate students a year. The Internships & Career Services Programs Manager will...

Promoted
InsideHigherEd
San Francisco, California

Research Data Analyst 2 (6256U), Haas School of Business - 67983. UC Berkeley's Haas School of Business offers a unique opportunity to champion new ideas, collaborate across boundaries, and continually learn in a workplace committed to increasing diversity and creating a welcoming environment for al...

Promoted
University of California - San Francisco
San Francisco, California

The Business Systems Analyst (BSA) position focuses on system administration technical support for transactions in workflow and error queue monitoring and resolution, exploring and evaluating existing or new system features, enhancements and functionality with a goal to improve customer experience w...

Promoted
Clari Inc.
San Francisco, California
Remote

Query Manager is a part of Clari's Data Platform team, and is the interface that allows application and API developers to easily and efficiently retrieve data across hundreds of databases and billions of rows of data that comprise our ever-evolving Data Platform. Learn and contribute to all aspects ...

Promoted
Ivy Exec
San Francisco, California

Market research studies are paid engagements Ivy Exec conducts with clients to get feedback on certain topics. VP of AI, SVP of AI, VP of Data Analytics, SVP of Data Analytics, Data Scientist, Chief Technology Officer, Chief Data Officer, Chief Digital Officer, Head of AI, Head of Data Analytics, IT...

Promoted
Woodard & Curran
CA, United States

We currently have an opportunity in California for a Senior Water Resources Project Manager with interest and experience in the areas of hydrology, supply, and demand forecasting and management, supply vulnerability and risk analyses, water systems modeling, integrated planning, and decision support...

Promoted
Harbour
CA, United States

As a senior software engineer on the team, you have the opportunity to truly understand the needs of customers and work with our team to design and build key features that empower and delight our users. Harbour loves taking complex workflows and turning them into simple and beautiful user experience...

Promoted
Source Technology
CA, United States

Do you have data science experience and are you seeking a new job in Bay Area? Our client is helping a leading tech company find a full-time Lead Data Scientist, and the role comes with an excellent salary and benefits package. As a Lead Data Scientist, you will work with large, complex data sets an...