Talent.com
AI Engineer, Evaluation and Reliability
AI Engineer, Evaluation and ReliabilityMice Groups • Redwood City, CA, US
AI Engineer, Evaluation and Reliability

AI Engineer, Evaluation and Reliability

Mice Groups • Redwood City, CA, US
Hace 5 días
Tipo de contrato
  • Indefinido
Descripción del trabajo

Senior Engineer, AI Evaluation and Reliability / Contract-to-Hire or Direct Hire / Redwood City / Hybrid, onsite 3 days per week / This position pays $70-80 / hr. W2 for Contract, $140-190K annually upon conversion / US Citizens and Green Card holders only Summary : Our client is looking for a Senior Engineer, AI Evaluation & Reliability to lead the design and execution of evaluation, quality assurance, and release gating for our agentic AI features. You'll develop the pipelines, datasets, and dashboards that measure and improve agent performance across real-world SOC workflows ensuring every release is safe, reliable, efficient, and production-ready. You will guarantee that our agentic AI features operate at full production scale, ingesting and active on millions of SOC alerts per day, with measurable impact on analyst productivity and risk mitigation. This role partners closely with the Product team to deliver operational excellence and trust in every AI-drive capability. Responsibilities : Define quality metrics : Translate SOC use cases into measurable KPI's (e.g., precision / recall, MTTR, false-positive rate, step success, latency / cost budgets). Build continuous evaluations : Develop offline / online evaluation pipelines, regression suites, and A / B or canary test; integrate them into CI / CD for release gating. Curate and manage datasets : Maintain gold-standard datasets and red-team scenarios; establish data governance and drift monitoring practices. Ensure safety, reliability, and explainability : Partner with Platform and Security Research to encode guardrails, policy enforcement, and runtime safety checks. Expand adversarial test coverage (prompt injection, data exfiltration, abuse scenarios). Ensure explainability and auditability of agent decisions, maintaining traceability and compliance of AI-driven workflows. Production reliability & observability : Monitor and maintain reliability of agentic AI features post-release define and uphold SLIs / SLOs, establish alerting and rollback strategies, and conduct incident post-mortems. Design and implement infrastructure to scale evaluation and production pipelines for real-time SOC workflows across cloud environments. Drive agentic system engineering : Experiment with multi-agent systems, tool-using language models, retrieval-augmented workflows, and prompt orchestration. Manage model and prompt lifecycle track version, rollout strategies, and fallbacks; measure impact through statistically sound experiments. Collaborate cross-functionally : Work with Product, UX and Engineering to prioritize high-leverage improvements, resolve regressions quickly, and advance overall system reliability. Required Skills : 6+ years building evaluation or testing infrastructure for ML / LLM systems or large-scale distributes system Proven ability to translate product requirements into measurable metrics and test plans. Strong Python skills Strong Experience with modern data tooling Hands-on experience running A / B tests, canaries, or experiment frameworks. Experience defining and maintaining operational reliability metrics (SLIs / SLOs) for AI-driven systems. Familiarity with large-scale distributed or streaming systems serving AI / agent workflows (millions of events or alerts / day). Excellent communication skills able to clearly convey technical results and trade-offs to engineer, PMs, and analysts. Pay for this position is based on market location and may vary depending on job-related knowledge, skills, and experience. As a contractor you may also be eligible for health benefits such as health, dental, and vision as well as access to a 401K plan. A sign-on payment and restricted stock units may be provided as part of the compensation package, in addition to a full range of medical, financial, and / or other benefits, dependent on the position offered by our client. Applicants should apply via The Mice Groups Inc. website (www.micegroups.com) or through this careers site posting. We are an equal opportunity employer and value diversity at The Mice Groups Inc. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. Pursuant to the Los Angeles Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. The Mice Groups Inc. values your privacy. Please consult our Candidate Privacy Notice, for information about how we collect, use, and disclose personal information of our candidates. Privacy Policy One of the basic principles The Mice Groups follows in designing and operating this website is that we ask for only the information we need to provide the service you’ve requested. The Mice Groups does not currently collect personal identifying information via its website except (i) to the extent that you provide this information in an online job application and (ii) to the extent that your web browser provides personal identifying information. The Mice Groups will use your personally identifying information solely for the purpose for which you submitted the information. The Mice Groups may, however, aggregate certain elements of your personal identifying information with the information of other users of our website to analyze the usefulness and popularity of various web pages on its website. The Mice Groups reserves the right to change this policy at any time by posting a new privacy policy at this location. Questions regarding this statement should be directed to info@micegroups.com

Crear una alerta de empleo para esta búsqueda

Reliability Engineer • Redwood City, CA, US

Ofertas relacionadas
Enterprise AI Evaluation Engineer for LLMs

Enterprise AI Evaluation Engineer for LLMs

The Rundown AI, Inc. • San Francisco, CA, United States
A tiempo completo
A leading AI solutions company in San Francisco is seeking an AI Research Engineer.This role involves partnering with teams to build evaluation datasets, designing AI evaluation systems, and pursui...Mostrar más
Última actualización: hace 8 horas • Oferta promocionada • Nueva oferta
Reliability Engineer

Reliability Engineer

Periodic Labs • Menlo Park, CA, United States
A tiempo completo
We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries.We are well funded and growing rapidly. Team members are owners who identify and solve prob...Mostrar más
Última actualización: hace 1 día • Oferta promocionada
Lead AI Engineer (AI Foundations)

Lead AI Engineer (AI Foundations)

Capital One • San Jose, CA, United States
A tiempo completo +1
Lead AI Engineer (AI Foundations).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine ...Mostrar más
Última actualización: hace 2 días • Oferta promocionada
Lead AI Engineer

Lead AI Engineer

Backflip • San Francisco, CA, United States
A tiempo completo
Mechanical design, the work done in CAD, is the rate-limiter for progress in the physical world.However, there are only 2-4 million people on Earth who know how to CAD. But what if hundreds of milli...Mostrar más
Última actualización: hace 5 días • Oferta promocionada
Lead Generative AI Engineer

Lead Generative AI Engineer

Madison-Davis, LLC • San Francisco, CA, United States
A tiempo completo
We’re supporting a major global financial technology organization that’s making significant investments in AI innovation. They’re scaling their engineering teams across North America to drive develo...Mostrar más
Última actualización: hace 29 días • Oferta promocionada
Founding Software Engineer - AI Evaluation Infra

Founding Software Engineer - AI Evaluation Infra

MLabs • San Francisco, CA, United States
A tiempo completo
A high-growth AI startup is seeking a Founding Software Engineer to play a key role in building and scaling their AI evaluation infrastructure. The position involves designing systems for AI evaluat...Mostrar más
Última actualización: hace 1 día • Oferta promocionada
Lead AI Engineer / Head of R&D

Lead AI Engineer / Head of R&D

Thoth AI • Alameda, CA, United States
A tiempo completo
To engineer the next generation of.AI-Assisted Human Annotation Systems.Our goal is to scale the production of high-quality, personalized, and safety-aligned datasets. Participate in the customer so...Mostrar más
Última actualización: hace 20 horas • Oferta promocionada • Nueva oferta
Senior Engineer, AI Evaluation & Reliability (Agentic AI)

Senior Engineer, AI Evaluation & Reliability (Agentic AI)

Anomali • Redwood City, CA, United States
A tiempo completo
Anomali is headquartered in Silicon Valley and is the Leading AI-Powered Security Operations Platform that is modernizing security operations. At the center of it is an omnipresent, intelligent, and...Mostrar más
Última actualización: hace 14 días • Oferta promocionada
AI Inference Engineer

AI Inference Engineer

Pantera Capital • San Francisco, CA, United States
A tiempo completo
We are looking for an AI Inference engineer to join our growing team.Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale d...Mostrar más
Última actualización: hace 7 días • Oferta promocionada
Founding Software Engineer - AI Evaluation Platform

Founding Software Engineer - AI Evaluation Platform

MLabs Ltd • San Francisco, CA, United States
A tiempo completo
An innovative AI startup in San Francisco is seeking a Founding Software Engineer to contribute to complex AI evaluation systems. You will take on end-to-end ownership, design scalable solutions acr...Mostrar más
Última actualización: hace 3 días • Oferta promocionada
AI Research Engineer, Enterprise Evaluations

AI Research Engineer, Enterprise Evaluations

The Rundown AI, Inc. • San Francisco, CA, United States
A tiempo completo
Scale AI is seeking a technically rigorous and driven.This high-impact role is critical to our mission of delivering the industry's leading. You will be a hands-on contributor to the core systems th...Mostrar más
Última actualización: hace 8 horas • Oferta promocionada • Nueva oferta
AI Research Engineer, Enterprise Evaluations

AI Research Engineer, Enterprise Evaluations

Scale AI, Inc. • San Francisco, CA, United States
A tiempo completo
Scale AI is seeking a technically rigorous and driven.This high-impact role is critical to our mission of delivering the industry's leading. You will be a hands-on contributor to the core systems th...Mostrar más
Última actualización: hace 8 horas • Oferta promocionada • Nueva oferta
AI Developer Productivity Engineer

AI Developer Productivity Engineer

Zoox • San Mateo, CA, United States
A tiempo completo
Software Engineer, Developer Experience Team.Zoox is developing state-of-the-art autonomous vehicle software for our purpose-built vehicle. We believe that developing the end-to-end product will get...Mostrar más
Última actualización: hace 5 horas • Oferta promocionada • Nueva oferta
Founding AI Engineer

Founding AI Engineer

HartleyCo • San Francisco, CA, United States
A tiempo completo
This range is provided by HartleyCo.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Direct message the job poster from HartleyCo.Headhunter & Di...Mostrar más
Última actualización: hace más de 30 días • Oferta promocionada
Lead AI Engineer for Generative AI Platforms

Lead AI Engineer for Generative AI Platforms

Capital One • San Francisco, CA, United States
A tiempo completo
Are you passionate about building innovative systems and committed to delivering high-quality work? Do you aspire to contribute to transformative projects that aim to revolutionize banking?.As a Le...Mostrar más
Última actualización: hace 3 días • Oferta promocionada
Principal AI Engineer

Principal AI Engineer

Aquent • San Bruno, CA, United States
Indefinido
This role is onsite in San Bruno, CA or Bentonville, AR and relocation is offered • • •.As a Principal Engineer, AI Systems in the Design Organization, you will : . Implement and Productionize AI Systems...Mostrar más
Última actualización: hace 2 días • Oferta promocionada
AI / ML Engineer

AI / ML Engineer

General Motors • San Francisco, CA, United States
A tiempo completo
As an AI / ML Engineer on the Metrics Frameworks team, part of the Simulation, Evaluation, and Data organization, you will be an individual contributor focused on developing and optimizing infrastruc...Mostrar más
Última actualización: hace más de 30 días • Oferta promocionada
Robotics Engagement Lead for AI Data Engine

Robotics Engagement Lead for AI Data Engine

Scale AI, Inc. • San Francisco, CA, United States
A tiempo completo
A leading AI technology firm is seeking an Engagement Manager in San Francisco to drive customer relationships and manage critical operational processes. The ideal candidate has 4-9 years of experie...Mostrar más
Última actualización: hace 5 días • Oferta promocionada