LLM or GenAI Application Engineer

FocusKPI Inc.Mountain View, CA, US

30+ days ago

Job type

Temporary

Quick Apply

Job description

FocusKPI is looking for an LLM or GenAI Application Engineer to join one of our clients, a high-tech SaaS company.

An LLM or GenAI Application Engineer or LLM Research Engineer role is mainly focused on the consumer-facing applications.

The role will be a part of the core GenAI AI development team within the client.

It primarily contributes to LLM-based application development, evaluation, and testing of new features, as well as core technology, such as an agent framework, utilizing the latest technology. stack, LLM technology.

Work Location :

Mountain View, CA Duration : 12-month contract; Hybrid role (4 days per week onsite) Pay Range : $110 / hr to $120 / hr
No C2C resumes are considered
Role & Responsibilities : Design, train, and fine-tune large language models (e.g., GPT, LLaMA, PaLM) for various applications.
Research cutting-edge techniques in natural language processing (NLP) and machine learning to improve model performance.
Explore advancements in transformer architectures, multi-modal models, and emergent AI behaviors.
Collect, clean, and preprocess large-scale text datasets from diverse sources.
Develop and implement data augmentation techniques to improve training data quality.
Ensure data is free from bias and aligned with ethical AI standards.
Optimize model architecture to improve accuracy, efficiency, and scalability.
Implement techniques to reduce latency, memory footprint, and inference time for real-time applications.
Collaborate with MLOps teams to deploy LLMs into production environments using Docker, Kubernetes, and cloud.
Develop robust evaluation pipelines to measure model performance using key metrics like accuracy, perplexity, BLEU, and F1 score.
Continuously test for bias, fairness, and robustness of language models across diverse datasets.
Conduct A / B testing to evaluate model improvements in real-world applications.
Stay updated with the latest advancements in generative AI, transformers, and NLP research.

Contribute to research papers, patents, and open-source projects—present findings and insights at conferences and internal knowledge-sharing sessions. Qualifications :

Required to have 5-7 years of industrial work experience along with research / academic experience.

Advanced degree in Computer Science, Artificial Intelligence, Data Science, or a related field.

Strong programming skills .

Expertise with LLM and GenAI application development.

Experience with deep learning frameworks such as TensorFlow, PyTorch, or JAX.

Hands-on experience with transformer-based models (e.g., GPT, BERT, RoBERTa, LLaMA).

Expertise in natural language processing (NLP) and sequence-to-sequence models .

Familiarity with Hugging Face libraries and OpenAI APIs.

Experience with MLOps tools like Docker, Kubernetes, and CI / CD pipelines.

Strong understanding of distributed computing and GPU acceleration using CUDA.

Knowledge of reinforcement learning and RLHF (Reinforcement Learning with Human Feedback).

Top 3 skills (must have) :

The candidate must have a real (actual) product experience, application development, and GenAI-based application shipping.

Actual or Industrial LLM or GenAI application experience of at least 2-3 years

No C2C resumes are considered

Thank you!

FocusKPI Hiring Team Founded in 2010, FocusKPI, Inc. (FocusKPI) is a data science and technology firm specializing in predictive analytics practice and methodologies.

FocusKPI is a US company headquartered in Silicon Valley, California, with an East Coast office in Boston, Massachusetts.

NOTICE :

Please be aware of fraudulent emails regarding job postings, job offers and fake checks.

FocusKPI's recruiting team will strictly reach out via @focuskpi.com email domain.

If you have received fraudulent emails now or in the past, please report it to https :

/ / reportfraud.ftc.gov / .

The domain @focuskpijobs.com is fraudulent and not related to FocusKPI.

Please do not not reply or communicate to anyone with @focuskpijobs.com.

Create a job alert for this search

Application Engineer • Mountain View, CA, US

Related jobs

Promoted

LLM Training Dataset and Checkpoint Optimization Engineer

Together AISan Francisco, CA, United States

Full-time

AI infrastructure that powers the training of state-of-the-art models.We focus on creating scalable, efficient systems for handling massive datasets and managing large-scale distributed checkpoints...Show moreLast updated: 30+ days ago

Promoted

Lead Application Engineer

DocusignSan Francisco, CA, United States

Full-time

Docusign brings agreements to life.Docusign solutions to accelerate the process of doing business and simplify people’s lives. With intelligent agreement management, Docusign unleashes business‑crit...Show moreLast updated: 4 days ago

Promoted

Remote Investment Analyst – AI Trainer ($50-$60 / hour)

Data AnnotationGilroy, California

Remote

Full-time +1

We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show moreLast updated: 30+ days ago

Promoted

Applied ML / LLM Engineer

PincitesSan Francisco, CA, United States

Full-time

We’re looking for a sharp, ambitious.AI-native products — someone who knows how to turn messy real-world data into performant models, fine-tune and deploy LLMs, and design feedback loops that make ...Show moreLast updated: 4 days ago

Promoted

Applied AI / ML Engineer

Catalyst LabsSan Francisco, CA, United States

Full-time

Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science.We partner directly with AI-first startups building products powered by LLMs a...Show moreLast updated: 4 days ago

Promoted

Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

Data AnnotationWatsonville, California

Remote

Full-time +1

Promoted

Remote Senior FP&A Analyst - AI Trainer ($50-$60 / hour)

Data AnnotationWatsonville, California

Remote

Full-time +1

Promoted

Senior LLM Engineer

ConvivaFoster City, CA, United States

Full-time

Conviva is the first and best place to go to understand and optimize digital customer experiences.Our Operational Data Platform harnesses full-census, comprehensive client-side telemetry—capturing ...Show moreLast updated: 30+ days ago

Promoted

LLM Training Frameworks and Optimization Engineer

Together AISan Francisco, CA, United States

Full-time

LLM Training Frameworks and Optimization Engineer.LLM Training Frameworks and Optimization Engineer.LLM Training Frameworks and Optimization Engineer. LLM Training Frameworks and Optimization Engine...Show moreLast updated: 30+ days ago

Promoted

Aderant Application Integration Engineer

McDermott Will & SchulteSan Francisco, CA, United States

Full-time

Aderant Application Integration Engineer.Build your big career with the firm that does.McDermott Will & Schulte is a leading global law firm that brings together more than 1,750 lawyers and 3,200 b...Show moreLast updated: 4 days ago

Promoted

LLM Training Frameworks and Optimization Engineer

TogetherSan Francisco, CA, United States

Full-time

LLM Training Frameworks and Optimization Engineer.We focus on optimizing training frameworks, algorithms, and infrastructure to push the boundaries of AI performance, scalability, and cost‑efficien...Show moreLast updated: 4 days ago

Promoted

Remote Corporate Development Associate - AI Trainer ($50-$60 / hour)

Data AnnotationWatsonville, California

Remote

Full-time +1

Promoted

Applied ML Engineer (LLMs & RAG)

AlldusSan Francisco, CA, United States

Full-time

This role is a combination of research and engineering.We are looking for someone who's a talented software engineer at their core, but has contributed to AI research, especially in the field of RA...Show moreLast updated: 30+ days ago

Promoted

Remote M&A Integration Manager - AI Trainer ($50-$60 / hour)

Data AnnotationWatsonville, California

Remote

Full-time +1

Promoted

Remote M&A Associate - AI Trainer ($50-$60 / hour)

Data AnnotationGilroy, California

Remote

Full-time +1

Promoted

AI Engineer, Applied ML

PerplexitySan Francisco, CA, United States

Full-time

Perplexity is looking for an Applied ML Engineer to design, build, and iterate on cutting‑edge AI models powering our core experience. As an expert in machine learning and artificial intelligence, y...Show moreLast updated: 13 days ago

Promoted

AI Engineer, Multimodal LLMs

Eloquent AI, Inc.San Francisco, CA, United States

Full-time

At Eloquent AI, we build reliable, high-performance AI agents that solve real-world problems.Our mission is to create precise, scalable AI solutions that businesses can depend on.With a culture of ...Show moreLast updated: 2 days ago

Promoted
New!

LLM Engineer : Build, Fine-Tune & Deploy AI

DiceSan Jose, CA, US

Full-time

A technology company based in San Jose, CA, is seeking an experienced LLM Engineer to design, fine-tune, and deploy large language models for production use. The ideal candidate will have deep exper...Show moreLast updated: 8 hours ago