Talent.com
Principal AI/ML System Software Engineer
Principal AI/ML System Software Engineerd-Matrix • Santa Clara, CA, United States
Principal AI / ML System Software Engineer

Principal AI / ML System Software Engineer

d-Matrix • Santa Clara, CA, United States
30+ days ago
Job type
  • Full-time
Job description

At d-Matrix , we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. Our culture is one of respect and collaboration.

We value humility and believe in direct communication. Our team is inclusive , and our differing perspectives allow for better solutions. We are seeking individuals passionate about tackling challenges and are driven by execution. Ready to come find your playground? Together , we can help shape the endless possibilities of AI.

Location

Hybrid, working onsite at our Santa Clara, CA, headquarters 3 days per week.

Principal AI / ML System Software Engineer

What you will do

The role requires you to be part of the team that helps productize the SW stack for our AI compute engine. As part of the software team, you will be responsible for the development, enhancement, and maintenance of the next-generation AI deployment software. You have had past experience working across all aspects of the full-stack toolchain and understand the nuances of what it takes to optimize and trade-off various aspects of hardware-software co-design. You are able to build and scale software deliverables in a tight development window. You will work with a team of system software experts to build out the deployment infrastructure, working closely with other software (ML, compilers) and hardware experts in the company.

What you will bring

Minimum

  • BS in Computer Science, Engineering, Math, Physics, or related degree with 12+ years of industry software development experience and MS in Computer Science, Engineering, Math, Physics, or related degree preferred with 6+ years
  • Strong grasp of system software, data structures, computer architecture, and machine learning fundamentals
  • Proficient in C / C++ / Python development in Linux environment and using standard development tools
  • Experience with distributed, high-performance software design and implementation
  • Self-motivated team player with a strong sense of ownership and leadership.

Preferred

  • MS or PhD in Computer Science, Electrical Engineering, or related fields
  • Experience with inference servers / model serving frameworks (such as TensorRT-LLM, vLLM, SGLang, etc.)
  • Experience with deep learning frameworks (such as PyTorch and TensorFlow)
  • Experience with deep learning runtimes (such as ONNX Runtime, TensorRT, etc.).
  • Experience with distributed systems collectives such as NCCL and OpenMPI
  • Experience with software testing fundamentals
  • Experience deploying ML workloads (LLMs, VLMs, NLP, etc.) on distributed systems.
  • Experience with Kubernetes, Ray, or other MLOps tools and techniques used from definition to deployment
  • Prior startup, small team, or incubation experience
  • Work experience at a cloud provider or AI compute / subsystem company
  • #LI-DL1

    Equal Opportunity Employment Policy

    d-Matrix is proud to be an equal opportunity workplace and affirmative action employer. We’re committed to fostering an inclusive environment where everyone feels welcomed and empowered to do their best work. We hire the best talent for our teams, regardless of race, religion, color, age, disability, sex, gender identity, sexual orientation, ancestry, genetic information, marital status, national origin, political affiliation, or veteran status. Our focus is on hiring teammates with humble expertise, kindness, dedication and a willingness to embrace challenges and learn together every day.

    d-Matrix does not accept resumes or candidate submissions from external agencies. We appreciate the interest and effort of recruitment firms, but we kindly request that individual interested in opportunities with d-Matrix apply directly through our official channels. This approach allows us to streamline our hiring processes and maintain a consistent and fair evaluation of all applicants. Thank you for your understanding and cooperation.

    #J-18808-Ljbffr

    Create a job alert for this search

    Principal Software Engineer • Santa Clara, CA, United States

    Related jobs
    Sr. Software Engineer - AI / LLM Applications (26456)

    Sr. Software Engineer - AI / LLM Applications (26456)

    Supermicro • San Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
    Last updated: 12 days ago • Promoted
    Principal Software Engineer, ML Systems

    Principal Software Engineer, ML Systems

    Waymo • Mountain View, CA, US
    Full-time
    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show more
    Last updated: 10 days ago • Promoted
    Chief AI Systems Architect for LLMs

    Chief AI Systems Architect for LLMs

    SAP • Palo Alto, CA, US
    Full-time
    A global technology company in California is looking for a Chief Machine Learning Expert to design and deliver advanced AI and LLM systems. The ideal candidate will have extensive software engineeri...Show more
    Last updated: 23 hours ago • Promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    General Motors • Sunnyvale, CA, United States
    Full-time
    We are seeking a Principal AI Engineer to lead the design and advancement of our AI platform.You will play a key role in shaping the infrastructure that powers large-scale training and cloud infere...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI / ML Software Engineer – Scalable Systems

    Senior AI / ML Software Engineer – Scalable Systems

    Apple Inc. • Cupertino, CA, United States
    Full-time
    A leading tech company in Cupertino is seeking a Senior Software Engineer for its Rights & Pricing engineering team.The role involves collaboration on AI / ML projects, code modernization, and mentor...Show more
    Last updated: 23 hours ago • Promoted
    AI / ML System Software Engineer, Staff

    AI / ML System Software Engineer, Staff

    D-matrix • Santa Clara, California, United States
    Full-time
    AI to power the transformation of technology.We are at the forefront of software and hardware innovation, pushing the boundaries of what is possible. We value humility and believe in direct communic...Show more
    Last updated: 30+ days ago • Promoted
    Principal ML Engineer — Lead End-to-End AI / ML Platforms

    Principal ML Engineer — Lead End-to-End AI / ML Platforms

    Intuit Inc. • Mountain View, CA, United States
    Full-time
    A financial technology company is seeking a Principal Machine Learning Engineer in Mountain View, California.This role involves leading AI strategy and deploying AI / ML solutions across financial pr...Show more
    Last updated: 3 days ago • Promoted
    Principal Software Engineer - AI Systems

    Principal Software Engineer - AI Systems

    Walmart Canada • Sunnyvale, CA, US
    Full-time
    Balance functional requirements with non-functional goals such as reliability, latency, and security.Experience with ML / AI frameworks (PyTorch, TensorFlow, Hugging Face) as.RAG pipelines, vector da...Show more
    Last updated: 10 days ago • Promoted
    Principal ML Engineer — Production AI Systems

    Principal ML Engineer — Production AI Systems

    Inworld AI • Mountain View, CA, United States
    Full-time
    A leading AI technology company in Mountain View seeks experienced Machine Learning Engineers to research, build, and optimize production ML systems. Candidates should have a robust background in so...Show more
    Last updated: 3 days ago • Promoted
    Senior ML Engineer : Architect Scalable AI Systems

    Senior ML Engineer : Architect Scalable AI Systems

    GEICO • Palo Alto, CA, US
    Full-time
    A leading insurance company in California is seeking a Staff Machine Learning Engineer to lead an AI / Machine Learning team. You will architect scalable AIML solutions, develop best practices for mac...Show more
    Last updated: 23 hours ago • Promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Servicenow • Santa Clara, California, United States
    Full-time
    It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market ...Show more
    Last updated: 30+ days ago • Promoted
    AI System Software Engineer

    AI System Software Engineer

    Matx • Mountain View, California, United States
    Full-time
    MatX's mission is to make the world’s best AI models run as efficiently as allowed by physics, bringing the world years ahead in AI quality and availability. MatX is seeking silicon verification eng...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning System Software Engineer

    Machine Learning System Software Engineer

    Apple • Sunnyvale, CA, United States
    Full-time
    At Apple, we're on the cutting edge of delivering transformative experiences through Artificial Intelligence.If you're passionate about pushing the boundaries of AI and hardware optimization, we wa...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning System Engineer

    Machine Learning System Engineer

    Signify Technology • San Jose, CA, United States
    Full-time
    Machine Learning System Engineer.An AI startup is seeking a Senior Systems Engineer to optimize deep learning performance at scale. In this role, you’ll work at the intersection of systems, infrastr...Show more
    Last updated: 30+ days ago • Promoted
    Principal Machine Learning Engineer

    Principal Machine Learning Engineer

    Coupang • Mountain View, CA, United States
    Full-time
    We know we’re doing the right thing when we hear our customers say, “How did I ever live without Coupang?” Born out of an obsession to make shopping, eating, and living easier than ever, we’re coll...Show more
    Last updated: 22 days ago • Promoted
    Software Engineer L4 / L5, Model Serving Systems, Machine Learning Platform

    Software Engineer L4 / L5, Model Serving Systems, Machine Learning Platform

    Netflix • Los Gatos, California, United States
    Full-time
    Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and lan...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - AI / ML

    Senior Software Engineer - AI / ML

    Microsoft Corporation • Mountain View, CA, United States
    Full-time
    OverviewJoin the PowerPoint team as we deliver modern, intelligent, and collaborative experiences that will delight millions of PowerPoint customers. Located in the heart of Silicon Valley in Mounta...Show more
    Last updated: 2 days ago • Promoted
    Principal Software Engineer - AI Systems

    Principal Software Engineer - AI Systems

    ODAIA • Sunnyvale, CA, United States
    Full-time
    Design and implement large-scale, production-grade AI systems that integrate LLMs and Generative AI into real-world applications. Build frameworks that support Retrieval-Augmented Generation (RAG), ...Show more
    Last updated: 3 days ago • Promoted