Talent.com
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer (FM Hosting, LLM Inference)Capital One • New York, NY, United States
No longer accepting applications
Lead AI Engineer (FM Hosting, LLM Inference)

Lead AI Engineer (FM Hosting, LLM Inference)

Capital One • New York, NY, United States
21 days ago
Job type
  • Full-time
  • Part-time
Job description

Lead AI Engineer (FM Hosting, LLM Inference)

Overview

At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent — along with our deep experience in machine learning — position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.

Team Description :

The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact.

In this role, you will :

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.

Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.

Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.

Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems.

Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.

The Ideal Candidate :

You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.

Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.

You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.

You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.

You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.

Basic Qualifications :

Bachelors degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 4 years of experience developing AI and ML algorithms or technologies, or a Masters degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 2 years of experience developing AI and ML algorithms or technologies

At least 4 years of experience programming with Python, Go, Scala, or Java

Preferred Qualifications :

6 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud)

Experience designing, developing, delivering, and supporting AI services

Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang

Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost

Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production

Capital One will consider sponsoring a new qualified applicant for employment authorization for this position.

The minimum and maximum full-time annual salaries for this role are listed below, by location. Please note that this salary information is solely for candidates hired to perform work within one of these locations, and refers to the amount Capital One is willing to pay at the time of this posting. Salaries for part-time roles will be prorated based upon the agreed upon number of hours to be regularly worked.

Cambridge, MA : $193,400 - $220,700 for Lead AI Engineer

McLean, VA : $193,400 - $220,700 for Lead AI Engineer

New York, NY : $211,000 - $240,800 for Lead AI Engineer

San Francisco, CA : $211,000 - $240,800 for Lead AI Engineer

San Jose, CA : $211,000 - $240,800 for Lead AI Engineer

Candidates hired to work in other locations will be subject to the pay range associated with that location, and the actual annualized salary amount offered to any candidate at the time of hire will be reflected solely in the candidate’s offer letter.

This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and / or long term incentives (LTI). Incentives could be discretionary or non discretionary depending on the plan.

Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being. Learn more at the

Capital One Careers website . Eligibility varies based on full or part-time status, exempt or non-exempt status, and management level.

This role is expected to accept applications for a minimum of 5 business days.No agencies please. Capital One is an equal opportunity employer (EOE, including disability / vet) committed to non-discrimination in compliance with applicable federal, state, and local laws. Capital One promotes a drug-free workplace. Capital One will consider for employment qualified applicants with a criminal history in a manner consistent with the requirements of applicable laws regarding criminal background inquiries, including, to the extent applicable, Article 23-A of the New York Correction Law; San Francisco, California Police Code Article 49, Sections 4901-4920; New York City’s Fair Chance Act; Philadelphia’s Fair Criminal Records Screening Act; and other applicable federal, state, and local laws and regulations regarding criminal background inquiries. If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at 1-800-304-9102 or via email at

RecruitingAccommodation@capitalone.com

  • . All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.

For technical support or questions about Capital Ones recruiting process, please send an email to

Careers@capitalone.com

Capital One does not provide, endorse nor guarantee and is not liable for third-party products, services, educational tools or other information available through this site.

Capital One Financial is made up of several different entities. Please note that any position posted in Canada is for Capital One Canada, any position posted in the United Kingdom is for Capital One Europe and any position posted in the Philippines is for Capital One Philippines Service Corp. (COPSSC).

Create a job alert for this search

Lead Ai Engineer • New York, NY, United States

Related jobs
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)

Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)

Capital One • New York, NY, United States
Full-time +1
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry...Show more
Last updated: 22 days ago • Promoted
Lead Machine Learning Engineer

Lead Machine Learning Engineer

Harnham • New York, NY, United States
Full-time
Get AI-powered advice on this job and more exclusive features.This range is provided by Harnham.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more....Show more
Last updated: 30+ days ago • Promoted
AI Infrastructure Engineer, Model Serving Platform

AI Infrastructure Engineer, Model Serving Platform

Scale AI, Inc. • New York, NY, United States
Full-time
As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show more
Last updated: 30+ days ago • Promoted
Lead Machine Learning Engineer -ML / AI

Lead Machine Learning Engineer -ML / AI

Capital One • New York, NY, United States
Full-time +1
Lead Machine Learning Engineer -ML / AI.At Capital One, we are changing banking for good by creating responsible and reliable AI-powered systems. Our investments in technology infrastructure and world...Show more
Last updated: 22 days ago • Promoted
AI Engineer

AI Engineer

Oscar • New York, NY, United States
Full-time
AI Engineer - Healthcare Automation Platform.Healthcare AI | Hybrid (NYC preferred) or Remote.AI-powered automation platform. Our system processes complex, unstructured data to ensure time-sensitive...Show more
Last updated: 14 days ago • Promoted
Lead Engineer - AI Clinical Navigator Product

Lead Engineer - AI Clinical Navigator Product

Health Universe, Inc. • New York, NY, United States
Full-time
Lead Engineer - AI Clinical Navigator Product.We're looking for a Lead Engineer to shape our AI Clinical Navigator tool for clinicians. AI innovation with real-world clinical impact.We're building t...Show more
Last updated: 30+ days ago • Promoted
Director, Division of Infectious Diseases

Director, Division of Infectious Diseases

Hackensack Meridian Health • Neptune Township, US
Full-time +1
Director, Division of Infectious Diseases.Jersey Shore University Medical Center.Hackensack Meridian Health – Neptune, New Jersey. Hackensack Meridian Health is seeking a Director, Division of...Show more
Last updated: 30+ days ago • Promoted
Sr. Azure Data Engineer

Sr. Azure Data Engineer

Cognizant • New Rochelle, NY, US
Full-time
Azure Data Engineer , you will make an impact by driving end-to-end data integration and migration initiatives across enterprise data platforms. You will be a valued member of our Data Engineering &...Show more
Last updated: 1 day ago • Promoted
AI & Machine Learning Engineer - Manager - Consulting - Open Location

AI & Machine Learning Engineer - Manager - Consulting - Open Location

EY • Stamford, CT, United States
Full-time
Technology – Data and Decision Science – AI Native Engineering.AI / Machine Learning Engineer, Manager Consultant.In this role, you will research, build, and implement scalable artificial intelligenc...Show more
Last updated: 9 days ago • Promoted
Associate Director, AI Tech Lead

Associate Director, AI Tech Lead

Novartis Group Companies • East Hanover, NJ, United States
Full-time
The location of this position is East Hanover, NJ.The Insights and Decision Science (IDS) team is dedicated to enabling improved decision making at Novartis by. We collaborate closely with the US bu...Show more
Last updated: 30+ days ago • Promoted
Staff Machine Learning Engineer, MLOps / LLMOps

Staff Machine Learning Engineer, MLOps / LLMOps

Teladoc Health • Purchase, NY, United States
Full-time
Join the team leading the next evolution of virtual care.At Teladoc Health, you are empowered to bring your true self to work while helping millions of people live their healthiest lives.Here you w...Show more
Last updated: 20 days ago • Promoted
Senior AI Engineer (AI Foundations, LLM Core and Agentic AI)

Senior AI Engineer (AI Foundations, LLM Core and Agentic AI)

Capital One • New York, NY, United States
Full-time +1
Senior AI Engineer (AI Foundations, LLM Core and Agentic AI).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an indust...Show more
Last updated: 22 days ago • Promoted
Director, Responsible AI

Director, Responsible AI

Novartis Group Companies • East Hanover, NJ, United States
Full-time
This position will be located at the East Hanover, NJ location and will not have the ability to be located remotely.The Insights and Decision Science (IDS) team is dedicated to enabling improved de...Show more
Last updated: 30+ days ago • Promoted
Principal AI Cybersecurity Engineer

Principal AI Cybersecurity Engineer

Teladoc Health • Purchase, NY, United States
Full-time
Teladoc Health is a global, whole person care company made up of a diverse community of people dedicated to transforming the healthcare experience. As an employee, you're empowered to show up every ...Show more
Last updated: 21 hours ago • Promoted • New!
ML Architect

ML Architect

Openkyber • NY, United States
Full-time
Quick Apply
Title : AI / ML Full Stack Engineer Location : New York City, New York Duration : Long-term Required : Proven experience with pro...Show more
Last updated: 2 days ago
AI Agent Engineer II : Build Scalable LLM Agents

AI Agent Engineer II : Build Scalable LLM Agents

Dun & Bradstreet • Florham Park, NJ, United States
Full-time
A global data analytics firm based in Florham Park, NJ, is seeking an AI Engineer II to design and implement agentic AI systems that enhance product capabilities. The ideal candidate will have an ad...Show more
Last updated: 2 days ago • Promoted
AI / ML Architect

AI / ML Architect

Precision Technologies Corp • Tarrytown, NY, United States
Full-time
Quick Apply
Role : AI / ML Architect Location : Tarrytown NY 10591 Role Overview : < / ...Show more
Last updated: 1 hour ago • New!
AI / ML Engineer

AI / ML Engineer

Purple Drive • Jersey City, NJ, New Jersey, USA
Full-time
Required Skills Show more
Last updated: 28 days ago