Software Engineer, Inference - Multi Modal
Join to apply for the Software Engineer, Inference - Multi Modal role at OpenAI .
About the Team
OpenAI’s Inference team powers the deployment of our most advanced models—including our GPT models, 4o Image Generation, and Whisper—across a variety of platforms. Our work ensures these models are available, performant, and scalable in production, and we partner closely with Research to bring the next generation of models into the world. We're a small, fast‑moving team of engineers focused on delivering a world‑class developer experience while pushing the boundaries of what AI can do.
We’re expanding into multimodal inference, building the infrastructure needed to serve models that handle image, audio, and other non‑text modalities. These workloads are inherently more heterogeneous and experimental, involving diverse model sizes and interactions, more complex input / output formats, and tighter coordination with product and research.
About the Role
We’re looking for a software engineer to help us serve OpenAI’s multimodal models at scale. You’ll be part of a small team responsible for building reliable, high‑performance infrastructure for serving real‑time audio, image, and other MM workloads in production. This work is inherently cross‑functional : you’ll collaborate directly with researchers training these models and with product teams defining new modalities of interaction. You'll build and optimize the systems that let users generate speech, understand images, and interact with models in ways far beyond text.
In This Role, You Will
You Might Thrive In This Role If You
Nice to Have
Compensation Range
$325K - $490K
Seniority Level
Mid‑Senior level
Employment Type
Full‑time
Job Function
Engineering and Information Technology
Industries
Research Services
Equal Opportunity Statement
OpenAI is an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.
Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US‑based candidates. For unincorporated Los Angeles County workers, we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment : protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non‑public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
#J-18808-Ljbffr
Software Engineer • San Francisco, CA, United States