Talent.com
Sr Software Engineer - AI Infrastructure

Sr Software Engineer - AI Infrastructure

OracleSanta Clara, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Job Description

Job Description

Oracle Cloud Infrastructure (OCI) is looking for a Senior Software Engineer - AI Infrastructure to lead the development of scalable, resilient, and secure infrastructure systems that underpin the core of OCI's compute platform. This role sits within the Host Provisioning Services (HoPS) team, which owns the critical infrastructure responsible for automating the full server lifecycle from rack integration and hardware bring-up to customer-ready instance provisioning and firmware management. HoPS services operate at the intersection of bare metal hardware and full-stack orchestration frameworks. They interface directly with components like BMCs, NICs, SmartNICs, ILOMs, GPUs, and custom firmware stacks. The team builds microservices and tooling that provision, configure, secure, and validate server platforms across OCI's global fleet.

As a Senior Software Engineer, you will design and deliver highly available services and automation pipelines that manage server provisioning at hyperscale, enable firmware pinning for deterministic customer environments, and deliver fleet-wide firmware updates and telemetry-based observability. You'll drive solutions to support new silicon (e.g., NVIDIA, AMD, Intel platforms), SmartNIC / HostNIC convergence, RoT security integration, and the evolution of OCI's infrastructure into next-gen clusters and composable hardware environments.

You will partner closely with teams across Compute, Networking, Security, Datacenter Engineering, and Hardware Development to ensure OCI can launch, scale, and maintain new server platforms with minimal operational overhead and high reliability.

This role is ideal for experienced systems engineers with a deep understanding of operating systems, hardware-software integration, distributed services, and cloud-scale automation.shoot and debug software programs for databases, applications, tools, networks etc.

Responsibilities

  • Design, develop, and maintain highly available and scalable microservices for OCI's server provisioning and lifecycle management.
  • Lead automation of the full server lifecycle including rack integration, hardware bring-up, provisioning, and firmware management.
  • Build systems that interface directly with bare metal components such as BMCs, ILOMs, NICs, SmartNICs, and GPUs.
  • Develop automation pipelines for provisioning, firmware validation, and observability across OCI's global fleet.
  • Implement firmware pinning and update mechanisms to support deterministic and secure customer environments.
  • Deliver telemetry-backed monitoring and alerting systems to ensure infrastructure health and visibility.
  • Support onboarding of new hardware platforms, including custom silicon and next-gen server technologies (e.g., NVIDIA GB200, AMD, Intel).
  • Enable secure root-of-trust (RoT) integrations and SmartNIC / HostNIC convergence for next-generation platform reliability.
  • Collaborate with cross-functional teams across Compute, Networking, Security, Datacenter Engineering, and Hardware Development.
  • Contribute to the evolution of OCI infrastructure toward composable hardware and next-generation data center clusters.
  • Drive design reviews, participate in on-call rotations, and contribute to operational excellence and incident prevention.
  • Provide technical leadership in troubleshooting, root cause analysis, and continuous improvement of service reliability.

Qualifications

Disclaimer :

Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.

Range and benefit information provided in this posting are specific to the stated locations only

US : Hiring Range in USD from : $79,800 to $178,100 per annum. May be eligible for bonus and equity.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.

Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following :

1. Medical, dental, and vision insurance, including expert medical opinion

2. Short term disability and long term disability

3. Life insurance and AD&D

4. Supplemental life insurance (Employee / Spouse / Child)

5. Health care and dependent care Flexible Spending Accounts

6. Pre-tax commuter and parking benefits

7. 401(k) Savings and Investment Plan with company match

8. Paid time off : Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.

9. 11 paid holidays

10. Paid sick leave : 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.

11. Paid parental leave

12. Adoption assistance

13. Employee Stock Purchase Plan

14. Financial planning and group legal

15. Voluntary benefits including auto, homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.

Career Level - IC3

About Us

As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Create a job alert for this search

Software Engineer Infrastructure • Santa Clara, CA, United States

Related jobs
  • Promoted
Sr. AI / Edge Compute Engineer

Sr. AI / Edge Compute Engineer

PlanetSan Francisco, California, United States
Full-time
We believe in using space to help life on Earth.Planet designs, builds, and operates the largest constellation of imaging satellites in history. This constellation delivers an unprecedented dataset ...Show moreLast updated: 30+ days ago
  • Promoted
Principal Software Engineer AI Platform

Principal Software Engineer AI Platform

Snorkel AiRedwood City, California, United States
Full-time
At Snorkel, we believe meaningful AI doesn’t start with the model, it starts with the data.We’re on a mission to help enterprises transform expert knowledge into specialized AI at scale.The AI land...Show moreLast updated: 2 days ago
  • Promoted
Software Engineer - Analytics & AI

Software Engineer - Analytics & AI

Cxapp Us, Inc.San Ramon, California, United States
Full-time
At CXApp, we are the innovators of Indoor Intelligence, delivering actionable insights for people, places and things.Our flagship product the “CXApp” is a workplace experience platform for the ente...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer, Enterprise AI

Software Engineer, Enterprise AI

Scale AI, Inc.San Francisco, CA, United States
Full-time
Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...Show moreLast updated: 30+ days ago
  • Promoted
Sr Software Engineer, AI Compiler

Sr Software Engineer, AI Compiler

TenstorrentSanta Clara, California, United States
Full-time +1
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions mu...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer II, AI Box

Software Engineer II, AI Box

BoxRedwood City, California, United States
Full-time
Box (NYSE : BOX) is the leader in Intelligent Content Management.Our platform enables organizations to fuel collaboration, manage the entire content lifecycle, secure critical content, and transform ...Show moreLast updated: 30+ days ago
  • Promoted
Senior Software Engineer - Machine Learning Platform

Senior Software Engineer - Machine Learning Platform

SnowflakeMenlo Park, California, United States
Full-time
The Snowflake Machine Learning Platform team’s mission is to enable customers to bring their machine learning and deep learning workloads to Snowflake. Our customers want to build powerful models wi...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Software Engineer - Engineering Productivity (Data Services)

Sr. Software Engineer - Engineering Productivity (Data Services)

Reliable RoboticsMountain View, CA, United States
Permanent
We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Staff Software Engineer, AI Infra

Sr. Staff Software Engineer, AI Infra

LinkedinMountain View, California, United States
Full-time
LinkedIn is the worlds largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover excit...Show moreLast updated: 30+ days ago
  • Promoted
AI System Engineer, Sr. Staff

AI System Engineer, Sr. Staff

Sk Hynix AmericaSan Jose, California, United States
Full-time
Job Title : AI System Engineer, Sr.At SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data center...Show moreLast updated: 30+ days ago
  • Promoted
Sr Software Engineer - AI

Sr Software Engineer - AI

The Trade DeskSan Francisco, CA, United States
Full-time
The Trade Desk is a global technology company with a mission to create a better, more open internet for everyone through principled, intelligent advertising. Handling over 1 trillion queries per day...Show moreLast updated: 30+ days ago
  • Promoted
Software Engineer, Machine Learning Infrastructure

Software Engineer, Machine Learning Infrastructure

DatologyaiRedwood City, California, United States
Full-time
Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Software Engineer, Infra

Sr. Software Engineer, Infra

Jerry.aiPalo Alto, California, USA
Full-time +1
Were building the first AI-powered.From insurance to repairs to road safety were connecting the entire car ownership experience into one mobile-first platform. Our revenue has grown 60x in the last ...Show moreLast updated: 3 days ago
  • Promoted
AI Software Engineer

AI Software Engineer

RattleSan Francisco, California, United States
Full-time
Rattle is building the first AI-powered Revenue Intelligence Platform, solving the most critical problem in B2B sales : 75% of companies miss their revenue forecasts because the entire revenue tech ...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Software Integration Engineer

Sr. Software Integration Engineer

Reliable RoboticsMountain View, CA, United States
Permanent
We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
  • Promoted
AI Software Engineer, Search

AI Software Engineer, Search

NexusSan Francisco, California, United States
Full-time
Nexus is building a world supercomputer by leveraging the latest advancements in cryptography, engineering, and science.Our team of experts is developing and deploying the Nexus Layer 1, the Nexus ...Show moreLast updated: 30+ days ago
  • Promoted
Sr Staff Engineer Software (AI Ops)

Sr Staff Engineer Software (AI Ops)

Palo Alto NetworksSanta Clara, California, United States
Full-time
At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...Show moreLast updated: 30+ days ago
  • Promoted
Senior / Staff AI Algorithms Engineer

Senior / Staff AI Algorithms Engineer

DexterityRedwood City, California, United States
Full-time
At Dexterity, we believe robots can positively transform the world.Our breakthrough technology frees people to do the creative, inspiring, problem-solving jobs that humans do best by enabling robot...Show moreLast updated: 30+ days ago