Talent.com
Principal Software Engineer - AI GPU Innovation
Principal Software Engineer - AI GPU InnovationOracle • Jefferson City, MO, United States
Principal Software Engineer - AI GPU Innovation

Principal Software Engineer - AI GPU Innovation

Oracle • Jefferson City, MO, United States
2 days ago
Job type
  • Full-time
Job description

Job Description

Oracle Cloud Infrastructure's (OCI) architecture development engineering team is seeking a highly driven GPU platform software & system development engineer at the Principal Engineer level. We are at the forefront of AI innovation, exploring the next generation of AI accelerators and hardware solutions.

As a Senior Principal software engineer, part of our growing team, you will be involved in evaluation, prototyping, and optimizing cutting-edge AI hardware, AI accelerators, including custom-designed AI chips and systems and software to drive next-gen Cloud AI Infrastructure platforms.

You will contribute to platform definition, platform development oversight as well as in house development, design reviews, system integration, performance testing and characterization. You will interact closely with third party GPU IC suppliers & partners as well as internal hardware and software development teams to help drive Oracle's AI Cloud platform solution space. You will be a critical part of the team developing Oracle's growing Cloud AI Infra solutions.

You will work with the latest AI hardware architectures, benchmark their performance, and collaborate with software engineers to ensure tight integration with AI workloads. You'll have a direct impact on shaping the future of AI hardware for machine learning and deep learning applications.

Career Level - IC5

Responsibilities

Responsibilities

Our Senior Principal engineers are also the people who can work independently and provide technical leadership to the broader organization. You should have experience developing AI infrastructure and operating high-scale services, and an understanding of how to make these cloud-scale services resilient. The ideal candidate will be technically strong and productive; someone who knows how to balance speed and quality with iterative and incremental improvements. You understand operational excellence and know-how to infuse a culture of being proactive within your team. You recommend and justify major changes to new and existing products and establish consensus with data-driven approaches.

Evaluation of system architecture and proposed implementation path analysis.

Work directly with hardware design and development teams on architecture, implementation, development, deployment, and troubleshooting of AI hardware platforms. Collaboration is also expected with the wider Oracle engineering and operations functional groups as well as our external partners.

Conduct comprehensive benchmarking and performance analysis of AI accelerators from emerging hardware vendors (e.g., SambaNova, Groq).

Compare and contrast new AI accelerators with industry-standard hardware (e.g., NVIDIA GPUs) for training and inference workloads.

Develop tools and processes for evaluating the performance of hardware in real-world AI applications.

Contribute to the design and improvement of performance optimization algorithms for AI models running on the hardware.

Basic Qualifications

BS or MS degree in Computer Science or relevant technical field involving coding or equivalent practical experience.

10+ years of total experience in software development

Demonstrated ability to write great code using Java, GoLang, C#, or similar OO languages.

Solid knowledge of AI / GPU platform architecture and their capabilities.

Experience working on large-scale, highly distributed services infrastructure.

Solid working experience with GPU supplier test code as well as open-source AI test / characterization tools.

Experience with the architecture, design, and implementation of modern server platforms consisting of multiple architectures and vendors, including x86 and ARM server architectures.

Demonstrated experience debugging and root-causing complex issues that may have a mix of hardware and software causes.

Systematic problem-solving approach, strong communication skills, a sense of ownership, and drive

Preferred Qualifications

Experience as technical lead on a large-scale cloud service

Hands-on experience developing and maintaining services on a public cloud platform (e.g., AWS, Azure, Oracle)

Experience with AI accelerator chips (e.g., SambaNova, Groq, etc.).

Knowledge of AI accelerator benchmarks and tools for performance evaluation (e.g., MLPerf, DeepBench).

Understanding of AI model optimization techniques for hardware acceleration.

strong understanding and experience running firmware and system diagnostics tools using BMC firmware, UEFI / BIOS and Linux tools. Skilled in scripting to customize tests.

Disclaimer :

Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.

Range and benefit information provided in this posting are specific to the stated locations only

US : Hiring Range in USD from : $96,800 to $223,400 per annum. May be eligible for bonus and equity.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.

Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following :

Medical, dental, and vision insurance, including expert medical opinion

Short term disability and long term disability

Life insurance and AD&D

Supplemental life insurance (Employee / Spouse / Child)

Health care and dependent care Flexible Spending Accounts

Pre-tax commuter and parking benefits

401(k) Savings and Investment Plan with company match

Paid time off : Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.

11 paid holidays

Paid sick leave : 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.

Paid parental leave

Adoption assistance

Employee Stock Purchase Plan

Financial planning and group legal

Voluntary benefits including auto, homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.

Career Level - IC4

About Us

As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

Create a job alert for this search

Principal Engineer Ai • Jefferson City, MO, United States

Related jobs
Sr Principal Software Engineer, Networking - AI Infrastructure Innovation

Sr Principal Software Engineer, Networking - AI Infrastructure Innovation

Oracle • Jefferson City, MO, United States
Full-time
OCI (Oracle Cloud) AI Infrastructure Innovation team is pioneering the creation of next-generation AI / HPC networking for GPU superclusters at massive scale. Our mission is to design and deliver stat...Show more
Last updated: 2 days ago • Promoted
Principal Cloud Architect, GPU

Principal Cloud Architect, GPU

Oracle • Jefferson City, MO, United States
Full-time
Job Responsibilities / Top Qualifications : .Hands on technical architect responsible to design, build and manage large compute (GPU / HPC) clusters, troubleshoot issues for POC and production deployment...Show more
Last updated: 2 days ago • Promoted
Remote - Senior Principal AI Developer

Remote - Senior Principal AI Developer

Missouri Staffing • Jefferson City, MO, United States
Remote
Full-time
We're looking for experienced AI developers (IC4 / IC5 level) to join our team and help shape the future of AI at Oracle. In this role, you'll lead the design and development of advanced AI applicatio...Show more
Last updated: 13 hours ago • Promoted • New!
Senior / Principal AI Engineer for Business Intelligence

Senior / Principal AI Engineer for Business Intelligence

TSMC - Taiwan Semiconductor Manufacturing Company Limited • California, MO, United States
Full-time
Principal AI Engineer within TSMC's Artificial Intelligence for Business Intelligence Innovation (AI4BII) Center, you will join an exciting global team dedicated to generating crucial business inte...Show more
Last updated: 13 hours ago • Promoted • New!
Principal Data & AI Solutions Engineer Joint Simulation Environment (JSE)

Principal Data & AI Solutions Engineer Joint Simulation Environment (JSE)

Modern Technology Solutions Inc • California, MO, United States
Full-time
We are seeking a highly innovative and adaptable Principal Data & AI Solutions Engineer to join our team at Edwards Air Force Base, CA or Nellis AFB, NV, directly supporting the Joint Simulation En...Show more
Last updated: 13 hours ago • Promoted • New!
Principal Software Engineer | AI-Driven Next-Gen Analytics Platform

Principal Software Engineer | AI-Driven Next-Gen Analytics Platform

Oracle • Jefferson City, MO, United States
Full-time
As a member of the development team, you will design, code, debug, and deliver innovative analytic features that involve in C++, Java and AI development with extensive exposure on highly scalable, ...Show more
Last updated: 2 days ago • Promoted
Principal Software Engineer - AI-Driven Next-Gen Analytics Platform

Principal Software Engineer - AI-Driven Next-Gen Analytics Platform

Oracle • Jefferson City, MO, United States
Full-time
We are seeking a seasoned software developer with deep expertise in C++ programming, database internals, distributed systems, and cloud-native development, with a strong focus on AI-first architect...Show more
Last updated: 2 days ago • Promoted
Senior Principal Software Engineer - Virtual GPU

Senior Principal Software Engineer - Virtual GPU

Missouri Staffing • Jefferson City, MO, United States
Full-time
You will be responsible for the research, design, implementation, and operation of new cloud offerings, with a particular focus on virtual GPU (vGPU) and GPU slicing technologies to power virtual w...Show more
Last updated: 2 days ago • Promoted
Principal Software Engineer, Networking - AI Infrastructure Innovation

Principal Software Engineer, Networking - AI Infrastructure Innovation

Oracle • Jefferson City, MO, United States
Full-time
OCI (Oracle Cloud) AI Infrastructure Innovation team is pioneering the creation of next-generation AI / HPC networking for GPU superclusters at massive scale. Our mission is to design and deliver stat...Show more
Last updated: 2 days ago • Promoted
Software Engineer, GenAi - Platform and Solution - AiDP, IS&T

Software Engineer, GenAi - Platform and Solution - AiDP, IS&T

Apple • California, MO, United States
Full-time
Software Engineer, GenAi - Platform and Solution - AiDP, IS&T.San Francisco Bay Area, California, United States Corporate Functions. We are looking for a passionate and experienced Software Engineer...Show more
Last updated: 13 hours ago • Promoted • New!
Sr Principal Software Engineer

Sr Principal Software Engineer

Oracle • Jefferson City, MO, United States
Full-time
Strong knowledge of C++ / C , Systems Programming & Distributed Systems.Open to considering Java / C# other language skills as long as willing to transition to C / C++ Proficient with data structures, ...Show more
Last updated: 2 days ago • Promoted
Principal Software Engineer - Network Reliability Engineering - AI / ML

Principal Software Engineer - Network Reliability Engineering - AI / ML

Missouri Staffing • Jefferson City, MO, United States
Full-time
Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.Oracle Cloud Infrastructure (OCI) provides mission-critical cloud services to enterprises ...Show more
Last updated: 2 days ago • Promoted
Sr Principal Software Engineer -GPU Technology Specialist

Sr Principal Software Engineer -GPU Technology Specialist

Oracle • Jefferson City, MO, United States
Full-time
Oracle Cloud Infrastructure's (OCI) architecture development engineering team is seeking a highly driven GPU platform software & system development engineer at the Principal Engineer level.We are a...Show more
Last updated: 2 days ago • Promoted
Senior Software Engineer - AI

Senior Software Engineer - AI

Ease Inc • California, MO, United States
Full-time
The enterprise-grade mobile platform combines simplicity and efficiency with powerful performance insights, driving quality and safety on the plant floor. Industry leaders, including Aston Martin, D...Show more
Last updated: 2 days ago • Promoted
Remote - Principal Software AI / ML Developer

Remote - Principal Software AI / ML Developer

Missouri Staffing • Jefferson City, MO, United States
Remote
Full-time
We're looking for experienced AI Developers to join our team and help shape the future of AI at Oracle.In this role, you'll lead the design and development of advanced AI applications, particularly...Show more
Last updated: 13 hours ago • Promoted • New!
Principal Software Engineer

Principal Software Engineer

Ford Motor Company • Jefferson City, MO, United States
Full-time
We are the movers of the world and the makers of the future.We get up every day, roll up our sleeves and build a better world together. At Ford, we're all a part of something bigger than ourselve...Show more
Last updated: 2 days ago • Promoted
[Remote] Principal Software Engineer Agent Toolkits & Cloud Infrastructure, Healthcare AI

[Remote] Principal Software Engineer Agent Toolkits & Cloud Infrastructure, Healthcare AI

Missouri Staffing • Jefferson City, MO, United States
Remote
Full-time
Develop and maintain robust software toolkits in Python and Java to support applied scientists in building, testing, and deploying machine learning models and agents. Design, implement, and optimize...Show more
Last updated: 13 hours ago • Promoted • New!
Principal Software Engineer

Principal Software Engineer

General Motors • Jefferson City, MO, United States
Full-time
This role is based remotely but if you live within a 50-mile radius of [Austin, Detroit, Warren,.Mountain View], you are expected to report to that location three times a week, at minimum.As a Prin...Show more
Last updated: 2 days ago • Promoted