Talent.com
Machine Learning Engineer - Fine-Tuning and On-device AI
Machine Learning Engineer - Fine-Tuning and On-device AIHp Iq • Palo Alto, CA, US
Machine Learning Engineer - Fine-Tuning and On-device AI

Machine Learning Engineer - Fine-Tuning and On-device AI

Hp Iq • Palo Alto, CA, US
1 day ago
Job type
  • Full-time
Job description

Machine Learning Engineer – Fine-Tuning and On-device AI

HP IQ is HP's new AI innovation lab. Combining startup agility with HP's global scale, we're building intelligent technologies that redefine how the world works, creates, and collaborates.

We're assembling a diverse, world-class team—engineers, designers, researchers, and product minds—focused on creating an intelligent ecosystem across HP's portfolio. Together, we're developing intuitive, adaptive solutions that spark creativity, boost productivity, and make collaboration seamless.

We create breakthrough solutions that make complex tasks feel effortless, teamwork more natural, and ideas more impactful—always with a human-centric mindset.

By embedding AI advancements into every HP product and service, we're expanding what's possible for individuals, organisations, and the future of work.

Join us as we reinvent work, so people everywhere can do their best work.

About the Role

We are seeking a Machine Learning Engineer to lead the fine-tuning, optimization, and deployment of AI models for diverse tasks, with a strong emphasis on on-device inference. You will work on cutting-edge applications such as orchestration, planning, multi-agent coordination, and other intelligent decision-making systems.

You will be responsible for adapting foundation models (LLMs, multimodal models) to specialized domains, making them fast, accurate, and efficient for resource-constrained environments—while ensuring robustness and safety.

What You Might Do

Fine-tune large language models, multimodal models, and task-specific models for orchestration, planning, and any other workflows as defined.

Design and run experiments to improve task accuracy, robustness, and generalization.

Explore and apply methods like full fine-tuning, LoRA, QLoRA and other types of parameter-efficient fine-tuning.

Employ advanced techniques such as QAT, DPO, GRPO to further improve the model quality.

On-Device Optimization

Prune, quantize and compress models (e.g., INT8, INT4, mixed-precision) for CPU, GPU, NPU and edge accelerators.

Optimize models for low-latency inference using frameworks like OpenVINO, ONNX Runtime, QNNetc..

Build robust data pipelines for domain-specific datasets, including synthetic data generation and annotation.

Define evaluation metrics. Perform evaluations and analyze results.

Establish best practices for versioning, reproducibility, and continuous improvement of model performance.

AI Orchestration & Planning

Develop and refine models to support multi-step reasoning, tool orchestration, and decision planning.

Work with stakeholders on orchestrator architecture.

Collaborate with product and research teams to design intelligent, context-aware assistant capabilities.

Essential Qualifications

5+ years of experience in applied machine learning, including at least 3 years in LLM fine-tuning.

Proficiency in Python and ML framework ecosystem (HuggingFace, PyTorch).

Strong understanding of transformer architectures, attention mechanisms, and PEFT techniques.

Experience with on-device inference optimization (OpenVINO, ONNX, QNN).

Familiarity with orchestration / planning architectures and techniques for AI assistants.

Track record of delivering production-ready ML solutions in latency-sensitive environments.

Preferred Qualifications

Experience with multi-agent systems or AI assistant orchestration.

Familiarity with advanced inference optimization techniques such as KV cache paging, flash attention.

Knowledge about common inference engines, including but not limited to llama.cpp, vLLM.

Salary Range : $120,000 - $215,000

Compensation & Benefits (Full-Time Employees)

The salary range for this role is listed above. Final salary offered is based upon multiple factors including individual job-related qualifications, education, experience, knowledge and skills.

Health insurance

Vision insurance

Long term / short term disability insurance

Employee assistance program

Flexible spending account

Life insurance

Generous time off policies, including

4-12 weeks fully paid parental leave based on tenure

11 paid holidays

Additional flexible paid vacation and sick leave (US benefits overview)

HP IQ is HP's new AI innovation lab, building the intelligence to empower humanity—reimagining how we work, create, and connect to shape the future of work.

Innovative Work Help shape the future of intelligent computing and workplace transformation.

Autonomy and Agility Work with the speed and focus of a startup, backed by HP's scale.

Meaningful Impact Build AI-powered solutions that help people and organisations thrive.

Flexible Work Environment Freedom and flexibility to do your best work.

Forward-Thinking Culture We learn fast, stay future-focused, and imagine what comes next—together.

Equal Opportunity Employer (EEO) Statement

HP, Inc. provides equal employment opportunity to all employees and prospective employees, without regard to race, color, religion, sex, national origin, ancestry, citizenship, sexual orientation, age, disability, or status as a protected veteran, marital status, familial status, physical or mental disability, medical condition, pregnancy, genetic predisposition or carrier status, uniformed service status, political affiliation or any other characteristic protected by applicable national, federal, state, and local law(s).

Please be assured that you will not be subject to any adverse treatment if you choose to disclose the information requested. This information is provided voluntarily. The information obtained will be kept in strict confidence.

J-18808-Ljbffr

Create a job alert for this search

Machine Learning Engineer • Palo Alto, CA, US

Related jobs
AI Engineer - Machine Learning (US)

AI Engineer - Machine Learning (US)

Gauss Labs • Palo Alto, California, United States
Full-time
Gauss Labs is looking for a passionate and talented AI Engineer for developing cutting-edge Industrial AI solutions that will normalize the standard of AI for manufacturing.We are working with the ...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer (Hybrid)

Machine Learning Engineer (Hybrid)

Cisco • Milpitas, CA, United States
Full-time
Applications are accepted until 1 / 12 / 2026.Who We Are : Join us at Cisco to shape the future of enterprise AI.We are building an AI Platform Team focused on creating the next-generation foundation th...Show more
Last updated: 15 days ago • Promoted
AI Machine Learning Engineer I (Intern) - United States

AI Machine Learning Engineer I (Intern) - United States

Cisco Systems, Inc. • San Jose, California, United States
Full-time
Please note this posting is to advertise potential job opportunities.This exact role may not be open today but could open in the near future. When you apply, a Cisco representative may contact you d...Show more
Last updated: 19 days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Gridmatic • Cupertino, California, United States
Full-time
Bay Area and Houston that is accelerating the clean energy transition by applying our expertise in data, machine learning, and energy to power markets. We are the rare startup that has multiple year...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, Generative AI

Machine Learning Engineer, Generative AI

Adobe Systems GmbH • San Jose, CA, US
Full-time
JOB LEVEL P30 ADDITIONAL JOB LEVELS P40 - P50 EMPLOYEE ROLE Individual Contributor Job Description Adobe Firefly's Generative AI Services team is seeking Senior Machine Learning Engineers for...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Apple Inc. • Cupertino, CA, United States
Full-time
Senior Software Engineer (Machine Learning - ML Platform).Cupertino, California, United States Software and Services.At Apple, we work every day to create products that enrich people’s lives.Apple ...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, Compute

Machine Learning Engineer, Compute

Waymo • Mountain View, California, United States
Full-time
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...Show more
Last updated: 30+ days ago • Promoted
Sr. Machine Learning Engineer, GAI Search Relevance

Sr. Machine Learning Engineer, GAI Search Relevance

Moveworks • Mountain View, California, United States
Full-time
As a senior member of the core platform team, you will play a key role in shaping the evolution of moveworks conversational AI platform. You will have the opportunity to - build enterprise products ...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, Tech Lead

Machine Learning Engineer, Tech Lead

Creatify Lab • Mountain View, California, United States
Full-time
Creatify is building the world’s first end-to-end AI advertising agent—a platform that automates the entire video ad lifecycle, from scripting and avatar-led generation to testing, optimization, an...Show more
Last updated: 30+ days ago • Promoted
AI Engineer - Machine Learning (US)

AI Engineer - Machine Learning (US)

Slab Inc. • Palo Alto, California, US
Full-time
Gauss Labs is looking for a passionate and talented AI Engineer for developing cutting-edge Industrial AI solutions that will normalize the standard of AI for manufacturing.We are working with the ...Show more
Last updated: 2 days ago • Promoted
Machine Learning Engineer 5

Machine Learning Engineer 5

Adobe • San Jose, CA, United States
Full-time
Adobe is seeking a Machine Learning Engineer 5 to contribute to the development of advanced machine learning models and platforms for personalized and creative customer experiences.As a senior memb...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer, Integration, AI Platforms

Machine Learning Engineer, Integration, AI Platforms

Tesla • Palo Alto, CA, United States
Full-time
Machine Learning Engineer, Integration, AI Platforms.Tesla is a leader in innovative technology, pioneering advancements in autonomous vehicles and humanoid robotics. Our cutting-edge AI platform po...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer

Machine Learning Engineer

Diffbot Technologies Corp. • Palo Alto, CA, United States
Full-time
At Diffbot, we believe that access to structured information will be the critical resource for the coming wave of intelligent applications—for everything from new app experiences, to search assista...Show more
Last updated: 22 hours ago • Promoted • New!
Machine Learning Engineer, Recommendation

Machine Learning Engineer, Recommendation

Newsbreak • Mountain View, California, United States
Full-time
NewsBreak is redefining the way users interact with local news and their communities.By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibr...Show more
Last updated: 30+ days ago • Promoted
Sr. Staff Machine Learning Engineer, Closeup Relevance

Sr. Staff Machine Learning Engineer, Closeup Relevance

Pinterest • Palo Alto, CA, United States
Full-time
Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show more
Last updated: 11 days ago • Promoted
Machine Learning Engineer, LLM Fine-Tuning

Machine Learning Engineer, LLM Fine-Tuning

First Soft Solutions LLC • San Jose, CA, United States
Full-time
Machine Learning Engineer, LLM Fine‑Tuning.LLM fine‑tuning for Verilog / RTL applications.LLM fine‑tuning, Verilog / RTL, AWS, Bedrock, SageMaker. Own the technical roadmap for Verilog / RTL‑focused LLM c...Show more
Last updated: 30+ days ago • Promoted
Android AI ML Engineer - On-Device

Android AI ML Engineer - On-Device

Focuskpi • Mountain View, California, United States
Temporary
Android AI ML Engineer - On-Device.The client is looking for a highly capable Android AI / ML Engineer - On-Device to help build intelligent, privacy-first mobile systems that can detect, respond to,...Show more
Last updated: 30+ days ago • Promoted
Machine Learning Engineer 756

Machine Learning Engineer 756

Protegrity • Palo Alto, California, United States
Remote
Full-time
At Protegrity, we lead innovation by using AI and quantum-resistant cryptography to transform data protection across cloud-native, hybrid, on-premises, and open source environments.We leverage adva...Show more
Last updated: 30+ days ago • Promoted