Talent.com
Software Engineer, Data Infrastructure

Software Engineer, Data Infrastructure

OpenAISan Francisco, CA, United States
17 hours ago
Job type
  • Full-time
Job description

About the Team

Data Platform at OpenAI owns the foundational data stack powering critical product, research, and analytics workflows. We operate some of the largest Spark compute fleets in production; design, and build data lakes and metadata systems on Iceberg and Delta with a vision toward exabyte-scale architecture; run high throughput streaming platforms on Kafka and Flink; provide orchestration with Airflow; and support ML feature engineering tooling such as Chronon. Our mission is to deliver reliable, secure, and efficient data access at scale and accelerate intelligent, AI assisted data workflows.

Join us to build and operate these core platforms that underpin OpenAI products, research, and analytics.

We're not just scaling infrastructure - we're redefining how people interact with data. Our vision includes intelligent interfaces and AI-assisted workflows that make working with data faster, more reliable, and more intuitive.

About the Role

This role focuses on building and operating data infrastructure that supports massive compute fleets and storage systems, designed for high performance and scalability. You'll help design, build, and operate the next generation of data infrastructure at OpenAI. You will scale and harden big data compute and storage platforms, build and support high-throughput streaming systems, build and operate low latency data ingestions, enable secure and governed data access for ML and analytics, and design for reliability and performance at extreme scale.

You will take full lifecycle ownership : architecture, implementation, production operations, and on-call participation.

You've supported Spark, Kafka, Flink, Airflow, Trino, or Iceberg as platforms. You're well-versed in infrastructure tooling like Terraform, experienced in debugging large-scale distributed systems, and excited about solving data infrastructure problems in the AI space.

This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will :

  • Design, build, and maintain data infrastructure systems such as distributed compute, data orchestration, distributed storage, streaming infrastructure, machine learning infrastructure while ensuring scalability, reliability, and security
  • Ensure our data platform can scale by orders of magnitude while remaining reliable and efficient
  • Accelerate company productivity by empowering your fellow engineers & teammates with excellent data tooling and systems
  • Collaborate with product, research and analytics teams to build the technical foundations capabilities that unlock new features and experiences
  • Own the reliability of the systems you build, including participation in an on-call rotation for critical incidents

You might thrive in this role if you :

  • Have 4+ years in data infrastructure engineering OR
  • Have 4+ years in infrastructure engineering with a strong interest in data
  • Take pride in building and operating scalable, reliable, secure systems
  • Are comfortable with ambiguity and rapid change
  • Have an intrinsic desire to learn and fill in missing skills, and an equally strong talent for sharing learnings clearly and concisely with others
  • This role is exclusively based in our San Francisco HQ. We offer relocation assistance to new employees.

    About OpenAI

    OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

    We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

    For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement.

    Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers : we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment : protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

    To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

    We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

    OpenAI Global Applicant Privacy Policy

    At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

    Create a job alert for this search

    Software Engineer Infrastructure • San Francisco, CA, United States

    Related jobs
    • Promoted
    • New!
    MTS, Data Infrastructure Engineer

    MTS, Data Infrastructure Engineer

    DelphinaSan Francisco, CA, United States
    Full-time
    Today's Data Scientists are in pain - spending their time manually wrangling data, building models through slow trial and error, taking on painstaking rewrites for deployment, and dealing with coun...Show moreLast updated: 17 hours ago
    • Promoted
    Software Engineer - Data Infrastructure (Pretraining Data)

    Software Engineer - Data Infrastructure (Pretraining Data)

    XaiSan Francisco, CA, United States
    Full-time
    Software Engineer - Data Infrastructure (Pretraining Data).AIs mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is s...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data Infrastructure Engineer

    Data Infrastructure Engineer

    zaimlerSan Mateo, CA, United States
    Full-time
    We're creating the foundation for AI systems that don't just generate, but retrieve, link, and reason over enterprise knowledge. In just over a year, we've begun partnering with Fortune 500 design p...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Software Engineer II, Data Engineering & Infrastructure

    Software Engineer II, Data Engineering & Infrastructure

    Aurora COSan Francisco, CA, United States
    Full-time
    The Aurora Driver will create a new era in mobility and logistics, one that will bring a safer, more efficient, and more accessible future to everyone. At Aurora, you will tackle massively complex p...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Senior Software Engineer, Data Infrastructure (RDBMS)

    Senior Software Engineer, Data Infrastructure (RDBMS)

    TRM LabsSan Francisco, CA, United States
    Full-time
    Senior or Staff Software Engineer, Database Engineer.Senior or Staff Software Engineer, Database Engineer.TRM Labs is a blockchain intelligence company committed to fighting crime and creating a sa...Show moreLast updated: 17 hours ago
    • Promoted
    Software Engineer, Data Infrastructure

    Software Engineer, Data Infrastructure

    datologyaiRedwood City, CA, United States
    Full-time
    Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Software Engineer - Data Infrastructure in San Francisco

    Senior Software Engineer - Data Infrastructure in San Francisco

    Energy Jobline ZRSan Francisco, CA, United States
    Full-time
    Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub.We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy ...Show moreLast updated: 17 hours ago
    • Promoted
    • New!
    Software Engineer, Data Infrastructure

    Software Engineer, Data Infrastructure

    FigmaSan Francisco, CA, United States
    Full-time
    Figma is growing our team of passionate creatives and builders on a mission to make design accessible to all.Figma's platform helps teams bring ideas to life-whether you're brainstorming, creating ...Show moreLast updated: 17 hours ago
    • Promoted
    Software Engineer, Data Infrastructure - Research

    Software Engineer, Data Infrastructure - Research

    OpenAISan Francisco, CA, United States
    Full-time
    The Workload team is responsible for designing and running OpenAI's LLM training and inference infrastructure that powers frontier models at massive scale. Our systems unify how researchers train an...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Staff+ Software Engineer - Data Infrastructure

    Staff+ Software Engineer - Data Infrastructure

    AnthropicSan Francisco, CA, United States
    Full-time
    Anthropic's mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Show moreLast updated: 17 hours ago
    • Promoted
    Senior Software Engineer, Data Infrastructure

    Senior Software Engineer, Data Infrastructure

    LMArenaSan Francisco, CA, United States
    Full-time
    Software Engineer, Data Infrastructure at LMArena.LMArena is seeking a Software Engineer to join our team and build the data infrastructure that powers real-world AI evaluation.You'll play a crucia...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Software Engineer - Business Systems Data Infrastructure

    Senior Software Engineer - Business Systems Data Infrastructure

    VerkadaSan Mateo, CA, United States
    Full-time
    Designed with simplicity in mind, Verkada's six product lines - video security cameras, access control, environmental sensors, alarms, workplace, and intercoms - provide unparalleled building secur...Show moreLast updated: 17 hours ago
    • Promoted
    Software Engineer, Data Infrastructure

    Software Engineer, Data Infrastructure

    Thinking Machines LabSan Francisco, CA, United States
    Full-time
    Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence.We're building a future where everyone has access to the knowledge and tools to make AI w...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer - Data Infrastructure

    Senior Software Engineer - Data Infrastructure

    PlaidSan Francisco, CA, United States
    Full-time
    Senior Software Engineer - Data Infrastructure.Making data driven decisions is key to Plaid's culture.To support that, we need to scale our data systems while maintaining correct and complete data....Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, Data Infrastructure & Acquisition - San Francisco, USA

    Software Engineer, Data Infrastructure & Acquisition - San Francisco, USA

    SpeechifySan Francisco, CA, United States
    Full-time
    PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever ...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer II, Data Engineering & Infrastructure

    Software Engineer II, Data Engineering & Infrastructure

    Aurora InnovationSan Francisco, CA, United States
    Full-time
    Aurora's mission is to deliver the benefits of self-driving technology safely, quickly, and broadly.The Aurora Driver will create a new era in mobility and logistics, one that will bring a safer, m...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, Infrastructure & Data

    Software Engineer, Infrastructure & Data

    LIGHTFIELD INCSan Francisco, CA, United States
    Full-time
    Lightfield is a next-generation CRM that automatically captures customer interactions like emails, meetings, and support tickets and organizes them into structured CRM data, enabling deep analysis ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Software Engineer - Data Infrastructure (Pretraining Data)San Francisco & Palo Alto, CA

    Software Engineer - Data Infrastructure (Pretraining Data)San Francisco & Palo Alto, CA

    XaiSan Francisco, CA, United States
    Full-time
    Software Engineer - Data Infrastructure.AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motiv...Show moreLast updated: 17 hours ago