Senior Engineer, AI Evaluation & Reliability (Agentic AI)Anomali • Redwood City, CA, United States

Senior Engineer, AI Evaluation & Reliability (Agentic AI)

Anomali • Redwood City, CA, United States

17 days ago

Job type

Full-time

Job description

Company :

Anomali is headquartered in Silicon Valley and is the Leading AI-Powered Security Operations Platform that is modernizing security operations. At the center of it is an omnipresent, intelligent, and multilingual Anomali Copilot that automates important tasks and empowers your team to deliver the requisite risk insights to management and the board in seconds. The Anomali Copilot navigates a proprietary cloud-native security data lake that consolidates legacy attempts at visibility and provides first-in-market speed, scale, and performance while reducing the cost of security analytics. Anomali combines ETL, SIEM, XDR, SOAR, and the largest repository of global intelligence in one efficient platform. Protect and drive your business with better productivity and talent retention.

Do more with less. Be Different.

Be the Anomali.

Learn more at http : / / www.anomali.com .

Job Description

We're looking for a Senior Engineer, AI Evaluation & Reliability to lead the design and execution of evaluation, quality assurance, and release gating for our agentic AI features.

You'll develop the pipelines, datasets, and dashboards that measure and improve agent performance across real-world SOC workflows ensuring every release is safe, reliable, efficient, and production-ready.

You will guarantee that our agentic AI features operate at full production scale, ingesting and active on millions of SOC alerts per day, with measurable impact on analyst productivity and risk mitigation. This role partners closely with the Product team to deliver operational excellence and trust in every AI-drive capability.

Key Responsibilities :

Define quality metrics : Translate SOC use cases into measurable KPI's (e.g., precision / recall, MTTR, false-positive rate, step success, latency / cost budgets).

Build continuous evaluations : Develop offine / online evaluation pipelines, regression suites, and A / B or canary test; integrate them into CI / CD for release gating.

Curate and manage datasets : Maintain gold-standard datasets and red-team scenarios; establish data governance and drift monitoring practices.

Ensure safety, reliability, and explainability : Partner with Platform and Security Research to encode guardrails, policy enforcement, and runtime safety checks.

o Expand adversarial test coverage (prompt injection, data exfiltration, abuse scenarios).

o Ensure

explainability and auditability of agent decisions, maintaining traceability and compliance of AI-driven workflows.

Production reliability & observability : Monitor and maintain reliability of agentic AI features post-release define and uphold SLIs / SLOs, establish alerting and rollback strategies, and conduct incident post-mortems.

o Design and implement infrastructure to scale evaluation and production pipelines for real-time SOC workflows across cloud environments.

Drive agentic system engineering : Experiment with multi-agent systems, tool-using language models, retrieval-augmented workflows, and prompt orchestration.

o Manage model and prompt lifecycle track version, rollout strategies, and fallbacks; measure impact through statistically sound experiments.

Collaborate cross-functionally : Work with Product, UX and Engineering to prioritize high-leverage improvements, resolve regressions quickly, and advance overall system reliability.

Qualifications

Required Skills and Experience

o 5+

years building evaluation or testing infrastructure for ML / LLM systems or large-scale distributes systems.

o Proven ability to translate product requirements into measurable metrics and test plans.

o Strong Python skills (or similar language) and experience with modern data tooling.

o Hands-on experience running A / B tests, canaries, or experiment frameworks.

o Experience defining and maintaining operational reliability metrics (SLIs / SLOs) for AI-driven systems.

o Familiarity with large-scale distributed or streaming systems serving AI / agent workflows (millions of events or alerts / day).

o Excellent communication skills able to clearly convey technical results and trade-offs to engineer, PMs, and analysts.

o This position is not eligible for employment visa sponsorship. The successful candidate must not now, or in the future, require visa sponsorship to work in the US

Preferred Qualifications

o Experience evaluating or deploying

agentic or tool-using AI systems (multi-agent orchestration, retrieval-augmented reasoning, prompt lifecycle management).

o Familiarity with

LLM evaluation frameworks (e.g., model-graded evals, pairwise / rubric scoring, preference learning).

o Exposure to

AI safety testing, including prompt injection, data exfiltration, abuse taxonomies, and resilience validation.

o Understanding of

explainability and compliance requirements for autonomous workflows, ensuring traceability and auditability of AI behavior.

o Background in

security operations, incident response, or enterprise automation; comfortable interpreting logs, alerts, and playbooks.

o Startup experience delivering high-impact systems in fast-faced, evolving environments.

Equal Opportunities Monitoring

We are an Equal Opportunity Employer. It is our policy to ensure that all eligible persons have equal opportunity for employment and advancement on the basis of their ability, qualifications, and aptitude. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, pregnancy, genetic information, disability, status at a protected veteran, or any other protected category under applicable federal, state, and local laws.

If you are interested in applying for employment with Anomali and need special assistance or accommodation to apply for a posted position, contact our Recruiting team at recruiting@anomali.com .

Compensation Transparency

$140,000 - $200,000 USD

Please note that the annual base salary range is a guideline and, for candidates who receive an offer, the base pay will vary based on factors such as work location, as well as, knowledge, skills and experience of the candidate. In addition to base pay, this position is eligible for benefits, and may be el igible for equity.

Create a job alert for this search

Engineer Agentic Ai • Redwood City, CA, United States

Related jobs

PwC Technology - Senior AI Engineer

PwC • San Francisco, CA, United States

Full-time

IFS - Information Technology (IT).At PwC, our people in software and product innovation focus on developing cutting-edge software solutions and driving product innovation to meet the evolving needs...Show more

Last updated: 18 days ago • Promoted

Senior AI Engineer

Arbital Health • San Francisco, CA, United States

Full-time

Arbital Health is a rapidly growing healthcare technology and actuarial leader that centralizes, measures, and adjudicates value-based care contracts at scale. We enable payers and providers to desi...Show more

Last updated: 9 days ago • Promoted

Senior AI Engineer - GenAI Platform & Integrations

Morrison & Foerster LLP • San Francisco, CA, United States

Full-time

A leading law firm in California seeks a Senior AI Engineer to develop and manage its innovative generative-AI platform.You will design integrations with major enterprise systems, lead evaluations ...Show more

Last updated: 5 days ago • Promoted

Senior Lead AI Engineer

Capital One National Association • San Francisco, CA, United States

Full-time

At Capital One, we are creating responsible and reliable AI systems, changing banking for good.For years, Capital One has been an industry leader in using machine learning to create real‑time, pers...Show more

Last updated: 10 days ago • Promoted

Lead Generative AI Engineer

Madison-Davis, LLC • San Francisco, CA, United States

Full-time

We’re supporting a major global financial technology organization that’s making significant investments in AI innovation. They’re scaling their engineering teams across North America to drive develo...Show more

Last updated: 30+ days ago • Promoted

Senior AI Solutions Engineer (Forward Deployed)

Intercom • San Francisco, CA, United States

Full-time

A customer service technology company is seeking a Senior Forward Deployed Engineer to drive adoption of their AI solutions within strategic customer engagements in San Francisco.You will work clos...Show more

Last updated: 1 day ago • Promoted

Senior AI Engineer

You.com • San Francisco, CA, United States

Full-time

Get AI-powered advice on this job and more exclusive features.AI-powered search and productivity platform designed to empower users with personalized, efficient, and trustworthy search experiences....Show more

Last updated: 30+ days ago • Promoted

Senior Lead AI Engineer (AI Foundations)

Capital One • San Francisco, CA, United States

Full-time +1

Senior Lead AI Engineer (AI Foundations).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using m...Show more

Last updated: 18 days ago • Promoted

Senior AI Platform Engineer

University of California, Riverside • Oakland, CA, United States

Full-time

The Senior AI Platform Engineer is responsible for the technical design, development, and implementation of a comprehensive and scalable Generative AI platform for UC Riverside's faculty, staff, an...Show more

Last updated: 30+ days ago • Promoted

Senior AI Engineer

Oracle • Redwood City, California, USA

Full-time

Design develop troubleshoot and debug software programs for databases applications tools networks etc.As part of this team you will be designing and developing new applications that will serve as t...Show more

Last updated: 16 days ago • Promoted

Senior Machine Learning Engineer, AI Collaboration

Snowflake Computing • Menlo Park, CA, United States

Full-time

Snowflake is about empowering enterprises to achieve their full potential - and people too.With a culture that's all in on impact, innovation, and collaboration, Snowflake is the sweet spot for bui...Show more

Last updated: 8 days ago • Promoted

Senior AI Engineer – LLMs & Backend Systems

Heaven and Hire Recruiting • San Francisco, CA, United States

Full-time

San Francisco, CA (On-site; relocation assistance available).Sponsorship available for most types.AI system that will redefine the future of talent evaluation. You’ll join at a pivotal growth stage ...Show more

Last updated: 30+ days ago • Promoted

Senior Agentic AI Developer

GoFundMe • San Francisco, CA, United States

Full-time

Want to help us help others? We’re hiring!.GoFundMe is the world’s most powerful community for good, dedicated to helping people help each other. By uniting individuals and nonprofits in one place, ...Show more

Last updated: 10 days ago • Promoted

Senior AI / ML Engineer

Sigma Computing • San Francisco, CA, United States

Full-time

At Sigma, we're not just adding AI-we're building the future of how people work with data.Our platform already lets users explore billions of rows of data in seconds with a spreadsheet-like interfa...Show more

Last updated: 30+ days ago • Promoted

Remote Research Engineer : Agentic AI Evaluations

HUD • San Francisco, CA, United States

Remote

Full-time

A technology startup in San Francisco seeks a research engineer to build evaluation environments for AI systems.The candidate should have proficiency in Python, Docker, and Linux.Experience with LL...Show more

Last updated: less than 1 hour ago • Promoted • New!

Senior AI Platform Engineer

Synagi • San Francisco, CA, United States

Full-time

At Synagi, we are pushing the frontier of distributed and decentralised AI agents.Our research spans vector-driven retrieval systems, agentic swarms, and resource-efficient multi-agent architecture...Show more

Last updated: 7 days ago • Promoted

Senior AI Engineer

LegalOn Technologies • San Francisco, CA, United States

Full-time

LegalOn Technologies is a global leader in legal AI solutions, trusted by more than 7,000 companies and law firms worldwide. Our software empowers legal teams to review and negotiate contracts faste...Show more

Last updated: 1 day ago • Promoted

Senior Developer Advocate - Generative AI

Datadog • San Francisco, CA, United States

Full-time

Our Senior Developer Advocates are technical leaders and mentors that anchor our team.They own projects from beginning to end, facilitating collaboration and enabling Datadog's community to solve r...Show more

Last updated: 18 days ago • Promoted