Talent.com
Senior Linux Infrastructure Engineer (HPC)

Senior Linux Infrastructure Engineer (HPC)

The Voleon GroupSan Francisco, CA, US
30+ days ago
Job type
  • Full-time
Job description

Join to apply for the Senior Linux Infrastructure Engineer (HPC) role at The Voleon Group

5 days ago Be among the first 25 applicants

Join to apply for the Senior Linux Infrastructure Engineer (HPC) role at The Voleon Group

Get AI-powered advice on this job and more exclusive features.

Direct message the job poster from The Voleon Group

Voleon is a technology company that applies state-of-the-art machine-learning techniques to real-world problems in finance. For more than a decade, we have led our industry and worked at the frontier of applying machine learning to investment management. We have become a multibillion-dollar asset manager, and we have ambitious goals for the future.

As a Senior HPC Infrastructure Engineer, you'll design, optimize, and maintain HPC clusters, storage, and systems. You'll ensure high-performance computing resources remain efficient, secure, and reliable. The ideal candidate will have deep expertise in Linux environments, scripting, automation, and troubleshooting complex system issues. You will play a key role in maintaining system uptime, implementing security best practices, optimizing performance, and supporting a growing, dynamic team of software, research, and systems engineers.

Your Team

We look for brilliant people with a passion for solving problems through innovation and engineering fundamentals. You'll work in a collaborative environment that encourages creative thinking and efficient implementation. We embrace experimentation. You'll work alongside experienced engineers recruited from leading technology companies and universities. You and your team will collaborate closely with top machine learning researchers.

We seek to hire someone in the following locations : Sacramento, Berkeley, or New York City.

Responsibilities

  • Manage HPC clusters, scheduling, and resource optimization (SLURM, PBS).
  • Configure and optimize distributed filesystems (Ceph, Lustre, NFS).
  • Diagnose and resolve complex Linux system issues and bottlenecks.
  • Develop automation / scripts for HPC cluster management.
  • Perform performance tuning (CPU, memory, networking, storage).
  • Participate in an on-call rotation.

Requirements

  • 5+ years of Linux systems expertise (kernel tuning, troubleshooting).
  • Experience managing distributed filesystems (Ceph, Lustre, NFS).
  • Scripting proficiency (Bash / Python) for automation and management.
  • Strong performance tuning and troubleshooting skills.
  • Networking fundamentals (high-speed interconnects, TCP / IP tuning).
  • Preferred (Nice-to-Have)

  • Familiarity with containerization (Docker, Singularity).
  • Experience with hybrid / cloud HPC environments (OpenStack, AWS).
  • Exposure to DevOps automation tools (Terraform, CI / CD).
  • Background in scientific / research computing or engineering workloads.
  • Compensation

    The base salary for this position is $170,000 to $205,000 in the location(s) of this posting. Individual salaries are determined through a variety of factors, including, but not limited to, education, experience, knowledge, skills, and geography. Base salary does not include other forms of total compensation, such as bonus compensation and other benefits. Our benefits package includes medical, dental, and vision coverage, life and AD&D insurance, 20 days of paid time off, 9 sick days, and a 401(k) plan with a company match.

    Friends of Voleon" Candidate Referral Program

    If you have a great candidate in mind for this role and would like to have the potential to earn $15,000 if your referred candidate is successfully hired and employed by The Voleon Group, please use this form to submit your referral. For more details regarding eligibility, terms and conditions please make sure to review the Voleon Referral Bonus Program.

    Equal Opportunity Employer

    The Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

    The Voleon Group has implemented a policy requiring all employees who will be entering our worksite, including new hires, to be fully vaccinated with the COVID-19 vaccine. This policy also applies to remote employees, as such employees will be asked to visit our offices from time to time. To the extent permitted by applicable law, proof of vaccination will be required as a condition of employment. This policy is part of Voleon's ongoing efforts to ensure the safety and well-being of our employees and community, and to support public health efforts.

    Seniority level

    Seniority level

    Mid-Senior level

    Employment type

    Employment type

    Full-time

    Job function

    Job function

    Information Technology

    Referrals increase your chances of interviewing at The Voleon Group by 2x

    Inferred from the description for this job

    Medical insurance

    Vision insurance

    401(k)

    Paid maternity leave

    Disability insurance

    Paid paternity leave

    Get notified when a new job is posted.

    Sign in to set job alerts for "Senior Infrastructure Engineer" roles.

    Systems Administrator / Security Engineer - Windows

    Mountain View, CA $70.00-$80.00 1 week ago

    Systems Administrator / Security Engineer - Windows

    Systems Administrator / Security Engineer - Windows

    San Mateo County, CA $70.00-$80.00 1 week ago

    Santa Clara, CA $115,000.00-$130,000.00 15 hours ago

    San Francisco, CA $168,480.00-$243,360.00 3 weeks ago

    San Francisco, CA $121,505.00-$175,507.00 3 weeks ago

    St Helena, CA $130,000.00-$150,000.00 1 day ago

    Senior) Datacenter System Administrator

    Fremont, CA $100,000.00-$185,000.00 2 weeks ago

    San Francisco, CA $101,920.00-$131,040.00 3 weeks ago

    Staff Software Engineer, Audio / Video Infrastructure

    Senior System / Network Engineer (Windows-Linux-ProxMox)

    Newark, CA $50,000.00-$70,000.00 20 hours ago

    Fremont, CA $130,000.00-$140,000.00 18 hours ago

    San Francisco, CA $131,040.00-$189,280.00 3 weeks ago

    Senior Cloud System Administrator / Job Req 788838227

    San Francisco, CA $94,504.11-$121,505.28 3 weeks ago

    Fremont, CA $90,000.00-$140,000.00 1 week ago

    We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

    J-18808-Ljbffr

    Create a job alert for this search

    Senior Infrastructure Engineer • San Francisco, CA, US