LLM Algorithmic Optimization Engineer

NIOSan Jose, CA, United States

13 hours ago

Job type

Full-time

Job description

JOB DESCRIPTION

About NIO

NIO is a pioneer and a leading company in the premium smart electric vehicle market. Founded in November 2014, NIO's mission is to shape a joyful lifestyle. NIO aims to build a community starting with smart electric vehicles to share joy and grow together with users.

NIO designs, develops, jointly manufactures and sells premium smart electric vehicles, driving innovations in next-generation technologies in autonomous driving, digital technologies, electric powertrains and batteries. NIO differentiates itself through its continuous technological breakthroughs and innovations, such as its industry-leading battery swapping technologies, Battery as a Service, or BaaS, as well as its proprietary autonomous driving technologies and Autonomous Driving as a Service, or ADaaS.

NIO's product portfolio consists of the ES8, a six-seater smart electric flagship SUV, the ES7 (or the EL7), a mid-large five-seater smart electric SUV, the ES6, a five-seater all-round smart electric SUV, the EC7, a five-seater smart electric flagship coupe SUV, the EC6, a five-seater smart electric coupe SUV, the ET7, a smart electric flagship sedan, and the ET5, a mid-size smart electric sedan.

About NIO

Roles and Responsibilities :

Conduct research and apply cutting-edge technologies to optimize Large Language Models (LLMs) and multimodal models, exploration and implementation of the core algorithmic optimization on heterogeneous architectures, for highly efficient LLM inference as well as deployment across distributed and heterogeneous hardware environments.
Focus on model optimization from a systems perspective, ensuring efficient deployment in the vehicle's digital cockpit and advanced driving (AD) domain.
Collaborate with cross-functional teams to ensure the integration of optimized models into real-world automotive applications.
Contribute to the entire pipeline from research, development, and testing, through to deployment on hardware, including GPUs and other distributed systems.

Qualifications :

Currently pursuing or completed a PhD or Master's degree in Computer Science, Computer Engineering, Applied Mathematics, Communications, Electronics, or a related field with relevant research projects and publications.

Strong understanding of GPU / NPU architecture and optimization techniques to identify and address bottlenecks.

Proficient in LLM and VLM architectures and algorithms, familiar with transformer based NLP / Audio / CV algorithms and technologies.

Proficiency in Python and experience with AI-related training and inference tools such as PyTorch.

Proficiency in C / C++ programming, familiar with at least one commonly used LLM inference engines.

Hands-on experience with model-serving frameworks such as Open Neural Network Exchange (ONNX).

Familiarity with debugging code in distributed computing environments.Experience in LLM inference optimization on resource constrained edge devices is a plus.

Preferred Qualification :

Ph.D. in computer science, artificial intelligence, or related fields; or Masters degree + 3 years of relevant industry experience

Experience in inference optimization techniques of deep learning models or libraries on hardware architectures;

Familiar with microkernel architecture, Linux kernel, hypervisor, middleware, and application framework

Those who have good publication records and have published high impact, innovative papers are preferred

Compensation :

The US base salary range for this full-time position is $143,200.00 - $186,000.00.

Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.

Please note that the compensation details listed in US role postings reflect the base salary only. It does not include discretionary bonus, equity, or benefits.

Benefits :

Along with competitive pay, as a full-time NIO employee, you are eligible for the following benefits on the first day you join NIO :

CIGNA EPO, HSA, and Kaiser HMO medical plans with $0 for Employee Only Coverage.

Dental (including orthodontic coverage) and vision plan. Both provide options with a $0 paycheck contribution covering you and your eligible dependents.

Company Paid HSA (Health Savings Account) Contribution when enrolled in the High Deductible CIGNA medical plan

Healthcare and Dependent Care Flexible Spending Accounts (FSA)

401(k) with Brokerage Link option

Company paid Basic Life, AD&D, short-term and long-term disability insurance

Employee Assistance Program

Sick and Vacation time

13 Paid Holidays a year

Paid Parental Leave for first 8 weeks at full pay (eligible after 90 days of employment with NIO)

Paid Disability Leave for first 6 weeks at full pay (eligible after 90 days of employment with NIO)

Voluntary benefits including : Voluntary Life and AD&D options for you, your spouse / domestic partner and dependent child(ren), pet insurance

Commuter benefits

Mobile Cell Phone Credit

Healthjoy mobile benefit app supporting you and your dependents with benefit questions on the go & support with benefit billing questions

Free lunch and snacks

Onsite gym

Employee discounts and perks program

Create a job alert for this search

Llm Engineer • San Jose, CA, United States

Related jobs

Promoted
New!

LLM / ML Engineer (Inference)

ReductoSan Francisco, CA, United States

Full-time

We would love to meet you if you : .Philosophy : You are your own worst critic.You have a high bar for quality and don't rest until the job is done rightno settling for 90%. We want someone who ships f...Show moreLast updated: 13 hours ago

Promoted

Wireless Systems Algorithm Design Engineer

AppleSunnyvale, CA, US

Full-time

At Apple, we work every single day to craft products that enrich people's lives.Do you love working on challenges that no one has solved yet? As a member of our Wireless Silicon Design group, you w...Show moreLast updated: 28 days ago

Promoted
New!

Senior GenAI Algorithms Engineer Model Optimizations for Inference

NVIDIASanta Clara, CA, United States

Full-time

NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and d...Show moreLast updated: 13 hours ago

Promoted

LLM Inference Frameworks and Optimization Engineer

Together AISan Francisco, CA, United States

Full-time

Our mission is to optimize inference frameworks, algorithms, and infrastructure, pushing the boundaries of performance, scalability, and cost-efficiency. We are seeking anInference Frameworks and Op...Show moreLast updated: 30+ days ago

Promoted
New!

LLM or GenAI Application Engineer

FocusKPI Inc.Mountain View, CA, United States

Temporary

FocusKPI is looking for an LLM or GenAI Application Engineer to join one of our clients, a high-tech SaaS company.An LLM or GenAI Application Engineer or LLM Research Engineer role is mainly focuse...Show moreLast updated: 13 hours ago

Promoted
New!

Machine Learning Engineer - LLM Engineering

LiberateSan Francisco, CA, United States

Full-time

Liberate is reimagining how the $2.By starting with voice, the most valuable and complex channel in insurance, the company proved that even the hardest problems can be automated.Now expanding into ...Show moreLast updated: 13 hours ago

Promoted
New!

Lidar Algorithms SW engineer (Algorithms)

Aeva, IncMountain View, CA, United States

Full-time

Aeva's mission is to bring the next wave of perception to a broad range of applications from automated driving to industrial robotics, consumer electronics, consumer health, security, and beyond.Ae...Show moreLast updated: 13 hours ago

Promoted
New!

Lidar Algorithms SW engineer (Optimization)

Aeva, IncMountain View, CA, United States

Full-time

Aevas mission is to bring the next wave of perception to a broad range of applications from automated driving to industrial robotics, consumer electronics, consumer health, security, and beyond.Aev...Show moreLast updated: 13 hours ago

Promoted

AI Engineer - LLM Infra

YutoriSan Francisco, CA, United States

Full-time

Yutori is reimagining how people interact with the web by building AI agents that can reliably do everyday digital tasks. We are building the entire stack to be agent-first, from training our own mo...Show moreLast updated: 30+ days ago

Promoted
New!

Senior AI Software Engineer, LLM Inference Performance Analysis

NVIDIASanta Clara, CA, United States

Full-time

We're now seeking a Senior AI Software Engineer, in our LLM Inference Performance Analysis and Optimization team!.NVIDIA leads the generative AI revolution. We're now seeking an experienced AI Softw...Show moreLast updated: 13 hours ago

Promoted
New!

ML Ops Engineer

Luminary CloudSan Mateo, CA, United States

Full-time

Luminary Cloud helps engineering companies be more competitive by getting to market faster, creating new, better products, and reducing development risk. We do this with our Physics AI platform, the...Show moreLast updated: 14 hours ago

Promoted

ML Ops Engineer

Omni InclusiveSan Leandro, CA, United States

Full-time

ML Ops Engineer to drive the full lifecycle of machine learning solutions-from data exploration and model development to scalable deployment and monitoring. This role bridges the gap between data sc...Show moreLast updated: 30+ days ago

Promoted
New!

LLM Inference Frameworks and Optimization EngineerSan Francisco, Singapore, Amsterdam

Together AISan Francisco, CA, United States

Full-time

Inference Frameworks And Optimization Engineer.Our mission is to optimize inference frameworks, algorithms, and infrastructure, pushing the boundaries of performance, scalability, and cost-efficien...Show moreLast updated: 13 hours ago

Promoted

Imaging Algorithms Product Owner

Intuitive.aiSan Francisco, CA, United States

Full-time

With the reputation of being a.Digital Transformation challenges across following Intuitive Superpowers : .Application & Database Modernization. Platform Engineering (IaC / EaC, DevSecOps & SRE).Cloud N...Show moreLast updated: 5 days ago

Promoted
New!

Applied ML / LLM Engineer

PincitesSan Francisco, CA, United States

Full-time

Were looking for a sharp, ambitious.AI-native products someone who knows how to turn messy real-world data into performant models, fine-tune and deploy LLMs, and design feedback loops that make AI ...Show moreLast updated: 13 hours ago

Promoted
New!

ML Engineer with LLM + Agentic AI

Cardinal Integrated Technologies, Inc.San Francisco, CA, United States

Temporary

Role : ML Engineer with LLM + Agentic AI.Duration : 6-12+ Months Contract.Skill 1 - Experience designing, training, fine-tuning, and deploying LLM / ML models for production. Skill 2 - Hands-on experien...Show moreLast updated: 13 hours ago

Promoted

ML Engineer with LLM, Langchain and Google ADk

Diverse LynxSan Francisco, CA, United States

Full-time

ML Engineer with LLM, Langchain and Google ADk.Location : Sunnyvale, CA Onsite.Candidate must have extensive knowledge in Machine learning with LLM. Candidate should have Agentic AI experience.Candi...Show moreLast updated: 30+ days ago

Promoted

ML Engineer

RIT Solutions, Inc.Fremont, CA, United States

Full-time

Onsite in Fremont, CA (MUST BE LOCAL).In-depth knowledge of Python for high-performance data-intensive applications.Familiarity with at least one modern deep learning framework (Pytorch, Jax, Tenso...Show moreLast updated: 30+ days ago