Talent.com
Sr Hardware Developer, GPU / AI and Compute

Sr Hardware Developer, GPU / AI and Compute

OracleSanta Clara, CA, United States
9 days ago
Job type
  • Full-time
Job description

Job Description

Oracle hardware development engineering, within Oracle's Cloud Infrastructure development, is seeking a highly driven GPU Platform Hardware Engineer at the Senior Engineer level. The GPU Hardware Engineer will work within development engineering with a small team of talented engineers who lead the development and day-to-day engineering efforts for Oracle's rapidly growing and successful Cloud AI platforms. You will participate in hardware development oversight & in house development, design reviews, Hardware integration, debug, and performance testing. You will interact closely with third party GPU IC suppliers & partners as well as internal hardware and software development engineers. You will be a critical part of the team developing Oracle's growing Cloud AI solutions. The team you will be joining has delivered all generations of Oracle Cloud dedicated compute, AI platforms, and is working on the current and next generation of Cloud and Enterprise systems.

Responsibilities

You will be responsible for, and not limited to :

Review and assessment of third-party merchant silicon.

Evaluation of system architecture and proposed implementation path analysis.

You will participate in platform definition and analysis.

Provide platform development oversight for partners.

Work with in-house engineering functional experts on design and reviews.

Support system integration, performance testing, debug and characterization.

Support program managers on technical assessments.

You will interact closely with third party GPU IC suppliers & partners as well as internal hardware, software development, quality assurance, cloud orchestration, hardware and software security experts, and Oracle manufacturing teams.

You will document and specify design intent and design details where appropriate in collaboration with the appropriate engineering teams.

Participate in hardware platform security evaluations.

Guide partner internal Oracle teams on support needed to scale, monitor, and successfully deploy our products to the Cloud.

You will assist Oracle Cloud and Support teams in the root-cause of potential hardware or software bugs through firsthand lab replication debug, remote debug, and calls with the appropriate teams supporting our deployed products.

Work with Oracle manufacturing teams to ensure that Oracle hardware is secure, robustly evaluated, performing at peak capabilities and well qualified for deployment to our Cloud customers.

What This Role Looks Like

Work directly with hardware design and development teams on architecture, implementation, development, deployment, and troubleshooting of AI hardware platforms. Collaboration is also expected with the wider Oracle engineering and operations functional groups as well as our external partners.

Develop, implement, and run the day-to-day execution of AI platform development, both internally and in partnership with third-party design teams. Including reviews of design plans, schematics, board layout, test feature definition / guidance for subsystem test, as well as System validation plans. Work on system and hardware integration, system test and qualification, work with software diagnostics engineers to test functionality, and utilize third party as well as approved open-source AI platform qualification test tools. Add to a roster of system characterization and performance testing capabilities and support definition of in-service system monitoring and error reporting needs.

Work closely and collaborate with hardware developers, System architects, System engineers, technical leads, platform firmware developers, partners and AI chip / GPU suppliers, storage, networking and compute experts, on product development and then with Manufacturing and external suppliers assisting across the new product introduction process out to production. You will also serve as one of the last level of engineering technical support when cloud and support teams require guidance and help in resolving complex deployed product issues.

Required Qualifications

Technical hands-on experience with market leading GPU (or alternate AI platforms) from the hardware and platform development, test, and characterization perspectives.

Good knowledge of AI / GPU platform architecture and their capabilities.

A strong understanding and experience running firmware and system diagnostics tools using BMC firmware, UEFI / BIOS and Linux tools. Skilled in scripting to customize tests.

Demonstrated working experience with GPU supplier test code as well as open-source AI test / characterization tools.

Experience with design, and implementation of modern server platforms consisting of multiple architectures and vendors, including x86 and ARM server architectures.

Experience with hardware development at the board, and FPGA level.

Required experience with board ECAD level tools and ability to reviews hierarchical schematics, multilayer advance board layout, cross board interconnect and end-to-end connectivity analysis.

Strong communications skills and ability to clearly communicate complex technical issue across engineering disciplines as well as clearly and succinctly articulate issues for executives.

Demonstrated experience debugging and root-causing complex issues that may have a mix of hardware and software causes.

Experience with early stage bring-up and power-on, platform firmware debugging, prototype GPU & CPU complex and memory complex debugging.

An ability to isolate a problem to the source and the required creativity & expertise to devise timely and robust solutions.

Experience and understanding of the latest high-speed busses and interconnect used in modern Compute and AI platforms. Familiarity with their startup connectivity and operational robustness.

Preferred Qualifications

Demonstrated knowledge of "low-level" hardware component interfaces, including, but not limited to, e.g. : PCIe, SPI, I2C (incl. SMBus, PMBus), LPC, eSPI, etc.

Comfortable with the use of hardware debuggers, O'Scopes, and advanced Signal characterization measurement tools.

Experience with platform level security technologies present an advantage in the role.

Disclaimer :

Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.

Range and benefit information provided in this posting are specific to the stated locations only

US : Hiring Range in USD from : $87,000 to $178,100 per annum. May be eligible for bonus and equity.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.

Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following :

Medical, dental, and vision insurance, including expert medical opinion

Short term disability and long term disability

Life insurance and AD&D

Supplemental life insurance (Employee / Spouse / Child)

Health care and dependent care Flexible Spending Accounts

Pre-tax commuter and parking benefits

401(k) Savings and Investment Plan with company match

Paid time off : Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.

11 paid holidays

Paid sick leave : 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.

Paid parental leave

Adoption assistance

Employee Stock Purchase Plan

Financial planning and group legal

Voluntary benefits including auto, homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.

Career Level - IC3

About Us

As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Create a job alert for this search

Hardware Developer • Santa Clara, CA, United States

Related jobs
  • Promoted
Senior Hardware Engineer

Senior Hardware Engineer

SonatusSunnyvale, CA, United States
Full-time
Join a high-performing team at Sonatus that's redefining what cars can do in the era of Software-Defined Vehicles (SDV).At Sonatus, we're driving the transformation to AI-enabled software-defined v...Show moreLast updated: 3 days ago
  • Promoted
Travel CT Tech - $2,600 to $2,704 per week in Santa Cruz, CA

Travel CT Tech - $2,600 to $2,704 per week in Santa Cruz, CA

AlliedTravelCareersSANTA CRUZ, CA, US
Full-time
AlliedTravelCareers is working with AMN Healthcare Allied to find a qualified CT Tech in SANTA CRUZ, California, 95065!.Job Description & Requirements. Computed Tomography Technologist - (CT Tec...Show moreLast updated: 3 days ago
  • Promoted
Travel CT Tech - $2,450 to $2,830 per week in Santa Cruz, CA

Travel CT Tech - $2,450 to $2,830 per week in Santa Cruz, CA

AlliedTravelCareersSanta Cruz, CA, US
Full-time
AlliedTravelCareers is working with National Staffing Solutions to find a qualified CT Tech in Santa Cruz, California, 95060!. Details of the CT Tech - Computed Tomography (CT) opening in Santa Cruz...Show moreLast updated: 3 days ago
  • Promoted
Sr Principal Hardware Engineer

Sr Principal Hardware Engineer

LumentumSan Jose, CA, United States
Full-time
It's fun to work in a company where people truly BELIEVE in what they're doing! We’re committed to bringing passion and customer focus to the business. If you like wild growth and working with happy...Show moreLast updated: 15 days ago
  • Promoted
Senior Hardware Engineer

Senior Hardware Engineer

AxiadoSan Jose, CA, United States
Full-time
Axiado is an AI-enhanced security processor company redefining the control and management of every digital system.The company was founded in 2017, and currently has 100+ employees.At Axiado, develo...Show moreLast updated: 9 days ago
  • Promoted
Sr. System Engineer - GPU Servers (27156)

Sr. System Engineer - GPU Servers (27156)

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a top-tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC, and IoT / Embedded customers...Show moreLast updated: 12 days ago
  • Promoted
Sr. Hardware Design Engineer - x86 / GPU / HPC (27752)

Sr. Hardware Design Engineer - x86 / GPU / HPC (27752)

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 12 days ago
  • Promoted
Senior Engineer, AI / ML Hardware / Software Validation

Senior Engineer, AI / ML Hardware / Software Validation

Samsung SemiconductorSan Jose, CA, US
Full-time
To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period.Advancing the World's Tec...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Research and Development Technician, Advanced Development

Sr. Research and Development Technician, Advanced Development

Calyxo, Inc.Pleasanton, CA, United States
Full-time
The company was founded in 2016 to address the profound need for improved kidney stone treatment.Kidney stone disease is a common, painful condition that consumes vast amounts of healthcare resourc...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Director Data Center GPU Platform & System Validation GPU

Sr. Director Data Center GPU Platform & System Validation GPU

Advanced Micro Devices, Inc.San Jose, CA, United States
Full-time
WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded syst...Show moreLast updated: 9 days ago
  • Promoted
Robotics Hardware Engineer

Robotics Hardware Engineer

OrchardSan Francisco, CA, United States
Full-time
Series A startup backed by top VCs like Quiet Capital, Shine Capital, and General Catalyst.We're securing America’s food supply by building the AI farmer that automates our nation’s farms.We've rai...Show moreLast updated: 15 days ago
  • Promoted
Travel CT Tech - $2600 / Week

Travel CT Tech - $2600 / Week

AMN Healthcare AlliedSanta Cruz, CA, US
Full-time
AMN Healthcare Allied is seeking an experienced CT Tech for an exciting Travel Allied job in Santa Cruz, CA.Shift : 8 hr nights Start Date : 12 / 08 / 2025 Duration : 13 weeks Pay : $2600 / Week.Job Descri...Show moreLast updated: 3 days ago
  • Promoted
Software Engineer, GPU Inference

Software Engineer, GPU Inference

OpenAISan Francisco, CA, United States
Full-time
The Sora team is pioneering multimodal capabilities for OpenAI's foundation models.We're a hybrid research and product team focused on integrating multimodal functionalities into our AI products, e...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Avionics Hardware Engineer

Sr. Avionics Hardware Engineer

Reliable RoboticsMountain View, CA, United States
Permanent
We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
  • Promoted
Sr. R&D Engineer, Instruments

Sr. R&D Engineer, Instruments

Calyxo, Inc.Pleasanton, CA, United States
Full-time
The company was founded in 2016 to address the profound need for improved kidney stone treatment.Kidney stone disease is a common, painful condition that consumes vast amounts of healthcare resourc...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Hardware Design Engineer - x86 / GPU / HPC Servers (27733)

Sr. Hardware Design Engineer - x86 / GPU / HPC Servers (27733)

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 12 days ago
  • Promoted
Senior Hardware PCIe Engineer, Systems Engineering

Senior Hardware PCIe Engineer, Systems Engineering

Pure StorageSanta Clara, CA, United States
Full-time
We're in an unbelievably exciting area of tech and are fundamentally reshaping the data storage industry.Here, you lead with innovative thinking, grow along with us, and join the smartest team in t...Show moreLast updated: 9 days ago
  • Promoted
Hardware Engineer

Hardware Engineer

SupermicroSan Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago