Talent.com
No longer accepting applications
Research Engineer, Training Infrastructure Lead

Research Engineer, Training Infrastructure Lead

GoodfireSan Francisco, CA, United States
1 day ago
Job type
  • Full-time
Job description

Research Engineer, Training Infrastructure Lead About Goodfire

Behind our name : Like fire, AI holds the potential for both immense benefit and significant risk. Just as mastering fire transformed human history, we believe the safe and intentional development of AI will shape the future of our species. Our goal is to tame this new fire.

Goodfire is an AI interpretability research company focused on understanding and intentionally designing advanced AI systems. We believe advances in interpretability will unlock the next frontier of safe and powerful foundation models and that deep research breakthroughs are necessary to make this possible.

Everything we do is in service of that mission. We move fast, take ownership, and constantly push to improve. We believe in acting today rather than tomorrow. We care deeply about the success of the organization and put the team above ourselves.

Goodfire is a public benefit corporation headquartered in San Francisco with a team of the world’s top interpretability researchers and engineers from organizations like OpenAI and DeepMind. We’ve raised $57M from investors like Menlo, Lightspeed and Anthropic and work with customers including Arc Institute, Mayo Clinic, and Rakuten.

The role :

We're seeking a senior engineering leader to own and evolve research platform and training infrastructure. You'll define both the technical vision and the implementation strategy for the systems that power our research breakthroughs.

Key responsibilities :

  • Design and build customizable training pipelines that scale from experimentation to production
  • Architect and implement large-scale model serving infrastructure for interpretability (reference : NDIF , Garcon )
  • Identify and execute on opportunities to dramatically accelerate research velocity
  • Lead technical decision-making for infrastructure that supports cutting-edge AI research

Who you are :

Goodfire is looking for experienced individuals who embody our values and share our deep commitment to making interpretability accessible. We care deeply about building a team who shares our values :

Put mission and team first

All we do is in service of our mission. We trust each other, deeply care about the success of the organization, and choose to put our team above ourselves.

Improve constantly

We are constantly looking to improve every piece of the business. We proactively critique ourselves and others in a kind and thoughtful way that translates to practical improvements in the organization. We are pragmatic and consistently implement the obvious fixes that work.

Take ownership and initiative

There are no bystanders here. We proactively identify problems and take full responsibility over getting a strong result. We are self-driven, own our mistakes, and feel deep responsibility over what we’re building.

Action today

We have a small amount of time to do something incredibly hard and meaningful. The pace and intensity of the organization is high. If we can take action today or tomorrow, we will choose to do it today.

What we are looking for : Required experience :

  • 5+ years of experience in ML infrastructure, research engineering, and / or systems programming
  • Leadership experience as senior architect, tech lead, and / or engineering manager
  • Cross-functional expertise bridging research and engineering domains
  • Technical proficiency in Python, PyTorch / JAX, and distributed systems
  • Production experience deploying and maintaining ML systems at scale
  • Mission alignment with advancing AI safety and interpretability
  • Core competencies :

  • High-ownership leadership
  • Owns broad areas with autonomy, driving architectural and strategic decisions even amid uncertainty

  • Balances technical depth with speed, adapting as priorities evolve
  • Research-to-production mindset
  • Bridges fast research iteration with reliable, scalable production systems

  • Designs abstractions that preserve flexibility while ensuring robustness
  • Deep experience in Python, PyTorch, and large-scale training strategies
  • Hands-on with end-to-end ML infrastructure : from experiments to serving
  • Strong track record of scaling systems and debugging complex runs
  • Preferred qualifications
  • Contributions to open-source ML infrastructure projects

  • Experience in fast-paced startup or research lab environments
  • This role offers market competitive salary, equity, and competitive benefits. More importantly, you'll have the opportunity to work on groundbreaking technology with a world-class team on the critical path to ensuring a safe and beneficial future for humanity.

    The expected salary range for this position is $200,000 - $400,000 USD.

    Create a Job Alert

    Interested in building your career at Goodfire? Get future opportunities sent straight to your email.

    Apply for this job

    indicates a required field

    First Name

    Last Name

    Email

    Phone

    Resume / CV

    Enter manually

    Accepted file types : pdf, doc, docx, txt, rtf

    Enter manually

    Accepted file types : pdf, doc, docx, txt, rtf

    #J-18808-Ljbffr

    Create a job alert for this search

    Infrastructure Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    Senior Infrastructure Security Engineer

    Senior Infrastructure Security Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Infrastructure Security Engineer - DGX Cloud.Key Responsibilities Implement, manage, and troubleshoot firewalls within on-premise and cloud network infrastructur...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AI Research Engineer

    AI Research Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for an AI Research Engineer specializing in LLM orchestration and prompting.Key Responsibilities Build LLM-powered software by designing prompt flows and orchestrations for o...Show moreLast updated: 20 hours ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Principal Site Reliability Engineer.Key Responsibilities Lead project work to build and maintain platform features for reliability and cloud infrastructure Mentor serv...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, ML Infrastructure - Training Platform

    Software Engineer, ML Infrastructure - Training Platform

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale is looking for an AI / ML Infrastructure Engineer to join our Machine Learning Infrastructure team to build out our Training Platform. You will partner closely with Machine Learning researchers ...Show moreLast updated: 30+ days ago
    • Promoted
    Machine Learning Data Engineer - Systems & Retrieval

    Machine Learning Data Engineer - Systems & Retrieval

    ZyphraPalo Alto, CA, US
    Full-time
    Machine Learning Data Engineer - Systems & Retrieval.This includes designing high-performance pipelines for collecting, transforming, indexing, and serving massive, heterogeneous datasets from ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Deep Learning Engineer

    Senior Deep Learning Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Deep Learning Software Engineer - Autonomous Vehicles.Key Responsibilities Train, fine-tune, optimize, and customize perception DNNs in low precision (FP16 / INT8)...Show moreLast updated: 30+ days ago
    Research Scientist / Engineer – Training Infrastructure

    Research Scientist / Engineer – Training Infrastructure

    IntelliPro Group Inc.Palo Alto, CA, US
    Full-time
    Quick Apply
    Research Scientist / Engineer – Training Infrastructure Position Type : Full time Location : Palo Alto, CA • Remote - US • Remote - International Salary Range : $220,000 - $300...Show moreLast updated: 11 days ago
    • Promoted
    Expert GIS DevSecOps Infrastructure Lead - Elevate

    Expert GIS DevSecOps Infrastructure Lead - Elevate

    Pacific Gas and Electric CompanyOakland, CA, United States
    Full-time
    Job Category : Engineering / Science.Job Level : Individual Contributor.Business Unit : Information Technology.PG&E IT is a unified organization composed of various departments which collaborate effec...Show moreLast updated: 4 days ago
    • Promoted
    Cyberinfrastructure Facilitator

    Cyberinfrastructure Facilitator

    VirtualVocationsConcord, California, United States
    Full-time
    A company is looking for a Cyberinfrastructure Facilitator, Remote.Key Responsibilities Forge strategic partnerships with researchers, educators, and IT teams to enhance CI capabilities Design a...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Automation Infrastructure Lead

    Automation Infrastructure Lead

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for an Automation Infrastructure Lead to manage a team responsible for OT Networks and data application systems in the Biopharmaceutical Operations Facility.Key Responsibilitie...Show moreLast updated: 18 hours ago
    • Promoted
    ML Research Engineer, ML Systems

    ML Research Engineer, ML Systems

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and opera...Show moreLast updated: 30+ days ago
    • Promoted
    AI Infrastructure Engineer, ML Data Platform

    AI Infrastructure Engineer, ML Data Platform

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale's AI Infrastructure team supports both R&D and applied Generative AI initiatives, driving breakthroughs in areas of post-training research such as AI safety, agents, and evaluating state-of-t...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Research Engineer

    Senior Research Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior Research Engineer - Multimodal & Video Foundation Model.Key Responsibilities Pioneer multimodal and video-centric research, contributing to usable prototypes and...Show moreLast updated: 9 days ago
    • Promoted
    Senior MLOps Engineer

    Senior MLOps Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Senior MLOps Engineer to design and scale infrastructure for AI research and product development. Key Responsibilities Identify and resolve infrastructure and software b...Show moreLast updated: 30+ days ago
    • Promoted
    Research Engineer - Distributed Training

    Research Engineer - Distributed Training

    Prime IntellectSan Francisco, CA, United States
    Full-time
    At Prime Intellect, we are on a mission to accelerate open and decentralized AI progress by enabling anyone to contribute compute, code or capital to train powerful, open models.Our ultimate goal? ...Show moreLast updated: 30+ days ago
    • Promoted
    Lecturer Pool-Landscape Architecture and Environmental Planning

    Lecturer Pool-Landscape Architecture and Environmental Planning

    InsideHigherEdBerkeley, California, United States
    Full-time +1
    Lecturer Pool-Landscape Architecture and Environmental Planning.The posted UC academic salary scales set the minimum pay at appointment. See the following table for the salary scale for this positio...Show moreLast updated: 21 days ago
    • Promoted
    Principal Threat Intelligence Engineer

    Principal Threat Intelligence Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Principal Threat Intelligence Engineer.Key Responsibilities Oversee the entire Threat Intelligence Lifecycle including requirements, collection, processing, analysis, d...Show moreLast updated: 1 day ago
    • Promoted
    Staff Engineer, Ads Infrastructure

    Staff Engineer, Ads Infrastructure

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Staff Engineer, Ads Development Infra.Key Responsibilities Lead the evolution of the Ads tech stack to enhance scalability and performance Collaborate with engineers t...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    Terraform and IaC Engineer

    Terraform and IaC Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Terraform and IaC Engineer to support a migration project.Key Responsibilities Design, author, and maintain Terraform modules / stacks for various AWS constructs and serv...Show moreLast updated: 14 hours ago
    • Promoted
    Research Scientist / Engineer - Training Infrastructure

    Research Scientist / Engineer - Training Infrastructure

    IntelliPro Group Inc.Palo Alto, CA, US
    Full-time
    Research Scientist / Engineer – Training Infrastructure.Palo Alto, CA • Remote - US • Remote - International.We believe that multimodality is critical for intelligence.To go beyond ...Show moreLast updated: 2 days ago