Talent.com
Software Engineer, Distributed Systems
Software Engineer, Distributed SystemsOpenAI • San Francisco, CA, United States
Software Engineer, Distributed Systems

Software Engineer, Distributed Systems

OpenAI • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

About the Team

The Compute Runtime team builds the low level framework components to power our ML training systems. We work on building robust, scalable, high performance components to support our distributed training workloads. Our priorities are to maximize the productivity of our researchers and our hardware, with the goal of accelerating progress towards AGI.

About the Role

As a Distributed Systems engineer, you will work to deliver powerful APIs orchestrating thousands of computers moving and persisting vast amounts of data. This requires both providing easy to use, introspectable systems that can promote a fast debugging and development cycle, while also enabling that experience to scale to our newest supercomputers maintaining stability and performance throughout.

We're looking for people who love optimizing an end to end system, understanding high performance I / O to maximize local performance and distributed across our supercomputers. We want someone excited by the rapid pace of responding to the dynamic and evolving needs of our training systems architectures.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will :

  • Work across our Python and Rust stack
  • Profile and optimize and help design for scale our compute and data capabilities
  • Work on deploying our training framework to our latest supercomputers rapidly responding to the changing shapes and needs of the ML systems.

You might thrive in this role if you :

  • Have worked on large distributed systems
  • Love figuring out how systems work and continuously come up with ideas for how to make them faster while minimizing complexity and maintenance burden
  • Have strong software engineering skills and are proficient in Python and Rust or equivalent.
  • About OpenAI

    OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

    We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

    For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement.

    Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers : we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment : protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

    To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

    We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

    OpenAI Global Applicant Privacy Policy

    At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

    Create a job alert for this search

    Software Engineer • San Francisco, CA, United States

    Related jobs
    Senior Software Engineer, Distributed Systems

    Senior Software Engineer, Distributed Systems

    Verse • San Francisco, CA, United States
    Full-time
    San Francisco, CA (Remote / Hybrid Available).Organizations today are under growing pressure to navigate the transition to clean energy not just to meet sustainability goals, but to manage risk, cont...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer : Distributed Systems, WARP

    Senior Software Engineer : Distributed Systems, WARP

    Cloudflare Inc • San Francisco, CA, United States
    Full-time
    At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...Show more
    Last updated: 30+ days ago • Promoted
    Distributed Systems Software Engineer - Public Cloud (Mid / Senior / Lead / Principal)

    Distributed Systems Software Engineer - Public Cloud (Mid / Senior / Lead / Principal)

    Salesforce • San Francisco, CA, United States
    Full-time
    Distributed Systems Software Engineer - Public Cloud (Mid / Senior / Lead / Principal).To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure y...Show more
    Last updated: 1 day ago • Promoted
    Software & Systems Engineer

    Software & Systems Engineer

    Diamond Foundry • San Francisco, CA, United States
    Full-time
    AI & cloud compute, electric-car power electronics, and 5G / 6G wireless.We have managed to produce the world's first single-crystal diamond wafers and are now on a mission to put a diamond behind ev...Show more
    Last updated: 8 days ago • Promoted
    Software Engineer, Distributed Systems

    Software Engineer, Distributed Systems

    Replit • Foster City, California, United States
    Full-time
    Replit is the fastest way to turn ideas into software.With our powerful AI-powered Agent and Assistant, anyone can create and launch apps from natural language in just one click.Build and deploy fu...Show more
    Last updated: 30+ days ago • Promoted
    Remote Distributed Systems Engineer - Equity Eligible

    Remote Distributed Systems Engineer - Equity Eligible

    W3 Global Sourcing • San Francisco, CA, United States
    Remote
    Full-time
    A leading global technology firm is seeking a Software Engineer focused on Distributed Systems.You will design and build scalable, high-performance systems for processing large-scale data.Ideal can...Show more
    Last updated: 4 days ago • Promoted
    Senior Software Engineer, Distributed Systems

    Senior Software Engineer, Distributed Systems

    Mixpanel • San Francisco, CA, United States
    Full-time
    Mixpanel is an event analytics platform for builders who need answers from their data at their fingertips-no SQL required. When everyone in the organization can see and learn from the impact of thei...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer (Distributed Systems)

    Software Engineer (Distributed Systems)

    Browserbase, Inc. • San Francisco, CA, United States
    Full-time
    As a Software Engineer (Distributed Systems) at.You’ll ensure it is high performance, scalable, constantly evolving and growing, and that our customers. As a Distributed Systems Engineer at Browserb...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - Distributed Systems

    Senior Software Engineer - Distributed Systems

    Alluxio Inc • San Mateo, CA, United States
    Full-time
    Senior Software Engineer - Distributed Systems (Foster City, CA).Proven at a global web-scale in production for modern data services, Alluxio is the premier developer of open-source data orchestrat...Show more
    Last updated: 6 hours ago • Promoted • New!
    Software Engineer, Distributed Systems

    Software Engineer, Distributed Systems

    Hightouch • San Francisco, CA, United States
    Full-time
    Hightouch’s mission is to empower everyone to take action on their data.Hundreds of companies, including Autotrader, Calendly, Cars. PetSmart, trust Hightouch to power their growth.We pioneered the ...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Systems

    Software Engineer, Systems

    META • Menlo Park, CA, United States
    Full-time
    Meta), formerly known as Facebook Inc.When Facebook launched in 2004, it changed the way people connect.Apps and services like Messenger, Instagram, and WhatsApp further empowered billions around t...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Distributed Systems

    Software Engineer, Distributed Systems

    Figma • San Francisco, CA, United States
    Full-time
    Figma is growing our team of passionate creatives and builders on a mission to make design accessible to all.Figma's platform helps teams bring ideas to life-whether you're brainstorming, creating ...Show more
    Last updated: 6 days ago • Promoted
    Senior Software Engineer, Distributed Systems

    Senior Software Engineer, Distributed Systems

    Conviva • Foster City, CA, United States
    Full-time
    Conviva is the intelligence layer for digital businesses, turning every consumer interaction into outcome-based intelligence-linking engagement patterns across AI agents, apps, websites, and stream...Show more
    Last updated: 17 days ago • Promoted
    | Software Engineer, Distributed Systems (Core) |

    | Software Engineer, Distributed Systems (Core) |

    Recruiting from Scratch • San Francisco, CA, United States
    Full-time
    Who is Recruiting from Scratch : .Recruiting from Scratch is a premier talent firm that focuses on placing the best product managers, software, and hardware talent at innovative companies.Our team is...Show more
    Last updated: 30+ days ago • Promoted
    Systems Software Engineer

    Systems Software Engineer

    SF Compute • San Francisco, CA, United States
    Full-time
    We're going to secure the financial risk of the largest infrastructure build-out in the history of the world.When people finance clusters, the data centers that house them, and the power that power...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer (Distributed Systems)

    Software Engineer (Distributed Systems)

    Browserbase • San Francisco, CA, United States
    Full-time
    Core (aka Browserbase Core Infrastructure) is the backbone of everything we do.This team keeps our browsers running at scale, solving massive distributed systems challenges and making sure our plat...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - Distributed Data Systems

    Senior Software Engineer - Distributed Data Systems

    Databricks • San Francisco, CA, United States
    Full-time
    At Databricks, we are passionate about enabling data teams to solve the world's toughest problems - from making the next mode of transportation a reality to accelerating the development of medical ...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Distributed Systems

    Software Engineer, Distributed Systems

    OpenAI • San Francisco, CA, United States
    Full-time
    The Compute Runtime team builds the low level framework components to power our ML training systems.We work on building robust, scalable, high performance components to support our distributed trai...Show more
    Last updated: 30+ days ago • Promoted