Lead ML Research Engineer (LLMs, Quantization, Inference)

Acceler8 Talent

CA, United States

Full-time

Lead ML Research Engineer (LLMs, Quantization, Inference)

We are seeking a Lead Machine Learning Engineer with a strong focus on systems to drive the development of next-generation computing solutions.

This role is ideal for a dynamic professional who thrives under pressure, utilizing their deep expertise in machine learning and systems integration to significantly enhance computational performance and efficiency.

Our company is at the cutting edge of developing a highly integrated chip and software stack designed for advanced computing applications, including the potential to support artificial general intelligence (AGI) in the future.

Our goal is to deliver a platform that significantly outstrips current benchmarks in cost-effectiveness and performance, setting new standards for computational efficiency.

In this crucial role, you will lead efforts to train and optimize large language models (LLMs) tailored for our innovative hardware.

Your focus will be on creating distributed systems for effective training and inference, ensuring that our solutions deliver a dramatic increase in performance-per-dollar, substantially exceeding typical industry advancements.

What We Offer :

Competitive salary and significant equity in a pioneering company.
Customized benefits package to meet your needs.
A dynamic and supportive work environment that encourages innovation and collaboration.
Opportunities for professional growth in a rapidly advancing field.

Key Responsibilities :

Optimize LLMs to maximize the performance capabilities of our specialized hardware.
Conduct thorough quality evaluations to ensure superior performance standards.
Develop and manage scalable distributed systems for efficient training and inference processes.
Provide expertise in hardware architecture to enhance machine learning efficiency and effectiveness, ensuring optimal performance.

Relevant Keywords : Machine Learning, Large Language Models, Next-Generation Computing, Systems Integration, Distributed Infrastructure, Hardware Optimization, Neural Network Training.

26 days ago

Related jobs

Promoted

Lead ML Research Engineer (LLMs, Quantization, Inference)

Acceler8 Talent

CA, United States

Lead ML Research Engineer (LLMs, Quantization, Inference). We are seeking a Lead Machine Learning Engineer with a strong focus on systems to drive the development of next-generation computing solutions. In this crucial role, you will lead efforts to train and optimize large language models (LLMs) ta...

Promoted

Staff Software Engineer, Inference, ML Platform

Waymo

Mountain View, California

Staff Software Engineer, Inference, ML Platform. ML accelerator profiling, model inference for LLMs, and designing production-grade systems. The ML Platform team at Waymo provides a set of tools to support and automate the lifecycle of the machine learning workflow, including feature and experiment ...

Promoted

Cyber Security Engineer III - Team Lead

Scientific Research

San Diego, California

Experience in leading one or more of the following CS/IA areas: CS/IA engineering design efforts to solve current or potential Fleet issues; leading design and fielding of CA/IA technologies; leading or providing support to significant Department of the Navy (DoN) and/or Department of Defense (DoD) ...

Research Inference, Tech Lead

OpenAI

San Francisco, California

We are looking for an experienced Technical Lead to lead critical work on our shared internal inference stack and grow the team. Our inference stack is primarily built by the Applied AI engineering team and we will improve and extend it for research use cases. Have experience with ML systems, partic...

Research Engineer L4/L5 -LLMs for Search, Recommendations, and Personalization

Netflix

Los Gatos, California

Hence we are looking for an exceptional applied research engineer to help us develop the technology to power future search and recommendations experiences using the latest advances related to LLMs. In this role, you will aid applied research and product development by conceptualizing, designing, and...

Lead AI / ML Engineer

Consumer Reports

CA, US

The Lead AI/ML Engineer will leverage emerging data technologies and applied AI/ML techniques/frameworks to build reliable, scalable and maintainable production-ready applications. CR is actively looking for a lead AI/ML engineer to join our Data Office to execute on a strategic multi-year roadmap f...

Sr Principal Machine Learning Engineer (AI/ML, NLP, LLMs)

Palo Alto Networks

Santa Clara, California

We lead with flexibility and choice in all of our people programs. You will be part of the team that is leading and driving industry technology evolution in and creating a positive impact on business. Design and development of workflow frameworks using AI/ML, with result collection and visualization...

AIML - Senior ML Research Engineer, ML Innovation

Apple

San Jose, California

A premier ML innovation team in Apple, is looking for Senior and Principal Machine Learning Engineers for working on algorithms and systems for ML model understanding and adaptation. This is a technical lead position, working with both junior and senior ML engineers, and with cross functional duties...

Cyber Security Engineer III – Team Lead

Scientific Research Corporation

San Diego, California

Lead IT Infrastructure Engineer

Lam Research

Fremont, California

This role will also involve in core engineering projects and product life cycle development for Lam's Advanced Services products by providing IT infrastructure technical expertise and leadership. The candidate will be part of the Fab Integrated Technology Services (FITS) team as the IT Infrastructur...