Senior Engineer, AI Evaluation & Reliability (Agentic AI)Anomali • Redwood City, California, USA

Senior Engineer, AI Evaluation & Reliability (Agentic AI)

Anomali • Redwood City, California, USA

15 days ago

Job type

Full-time

Job description

Company :

Anomali is headquartered in Silicon Valley and is the Leading AI-Powered Security Operations Platform that is modernizing security operations. At the center of it is an omnipresent intelligent and multilingual Anomali Copilot that automates important tasks and empowers your team to deliver the requisite risk insights to management and the board in seconds. The Anomali Copilot navigates a proprietary cloud-native security data lake that consolidates legacy attempts at visibility and provides first-in-market speed scale and performance while reducing the cost of security analytics. Anomali combines ETL SIEM XDR SOAR and the largest repository of global intelligence in one efficient platform. Protect and drive your business with better productivity and talent retention.

Do more with less. Be Different. Be the Anomali.

Learn more at .

Job Description

Were looking for a Senior Engineer AI Evaluation & Reliability to lead the design and execution of evaluation quality assurance and release gating for our agentic AI features.

Youll develop the pipelines datasets and dashboards that measure and improve agent performance across real-world SOC workflows ensuring every release is safe reliable efficient and production-ready.

You will guarantee that our agentic AI features operate at full production scale ingesting and active on millions of SOC alerts per day with measurable impact on analyst productivity and risk mitigation. This role partners closely with the Product team to deliver operational excellence and trust in every AI-drive capability.

Key Responsibilities :

o Define quality metrics : Translate SOC use cases into measurable KPIs (e.g. precision / recall MTTR false-positive rate step success latency / cost budgets).

o Build continuous evaluations : Develop offine / online evaluation pipelines regression suites and A / B or canary test; integrate them into CI / CD for release gating.

o Curate and manage datasets : Maintain gold-standard datasets and red-team scenarios; establish data governance and drift monitoring practices.

o Ensure safety reliability and explainability : Partner with Platform and Security Research to encode guardrails policy enforcement and runtime safety checks.

o Expand adversarial test coverage (prompt injection data exfiltration abuse scenarios).

o Ensure explainability and auditability of agent decisions maintaining traceability and compliance of AI-driven workflows.

o Production reliability & observability : Monitor and maintain reliability of agentic AI features post-release define and uphold SLIs / SLOs establish alerting and rollback strategies and conduct incident post-mortems.

o Design and implement infrastructure to scale evaluation and production pipelines for real-time SOC workflows across cloud environments.

o Drive agentic system engineering : Experiment with multi-agent systems tool-using language models retrieval-augmented workflows and prompt orchestration.

o Manage model and prompt lifecycle track version rollout strategies and fallbacks; measure impact through statistically sound experiments.

o Collaborate cross-functionally : Work with Product UX and Engineering to prioritize high-leverage improvements resolve regressions quickly and advance overall system reliability.

Qualifications

Required Skills and Experience

o 5 years building evaluation or testing infrastructure for ML / LLM systems or large-scale distributes systems.

o Proven ability to translate product requirements into measurable metrics and test plans.

o Strong Python skills (or similar language) and experience with modern data tooling.

o Hands-on experience running A / B tests canaries or experiment frameworks.

o Experience defining and maintaining operational reliability metrics (SLIs / SLOs) for AI-driven systems.

o Familiarity with large-scale distributed or streaming systems serving AI / agent workflows (millions of events or alerts / day).

o Excellent communication skills able to clearly convey technical results and trade-offs to engineer PMs and analysts.

o This position is not eligible for employment visa sponsorship. The successful candidate must not now or in the future require visa sponsorship to work in the US

Preferred Qualifications

o Experience evaluating or deploying agentic or tool-using AI systems (multi-agent orchestration retrieval-augmented reasoning prompt lifecycle management).

o Familiarity with LLM evaluation frameworks (e.g. model-graded evals pairwise / rubric scoring preference learning).

o Exposure to AI safety testing including prompt injection data exfiltration abuse taxonomies and resilience validation.

o Understanding of explainability and compliance requirements for autonomous workflows ensuring traceability and auditability of AI behavior.

o Background in security operations incident response or enterprise automation; comfortable interpreting logs alerts and playbooks.

o Startup experience delivering high-impact systems in fast-faced evolving environments.

Equal Opportunities Monitoring

We are an Equal Opportunity Employer. It is our policy to ensure that all eligible persons have equal opportunity for employment and advancement on the basis of their ability qualifications and aptitude. All qualified applicants will receive consideration for employment without regard to race color religion sex sexual orientation gender identity national origin age pregnancy genetic information disability status at a protected veteran or any other protected category under applicable federal state and local laws.

If you are interested in applying for employment with Anomali and need special assistance or accommodation to apply for a posted position contact our Recruiting team at emailprotected .

Compensation Transparency

$140000 - $200000 USD

Please note that the annual base salary range is a guideline and for candidates who receive an offer the base pay will vary based on factors such as work location as well as knowledge skills and experience of the addition to base pay this position is eligible for benefits and may be el igible for equity.

We may use artificial intelligence (AI) tools to support parts of the hiring process such as reviewing applications analyzing resumes or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed please contact us.

Required Experience :

Senior IC

Key Skills

Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting

Employment Type : Full-Time

Experience : years

Vacancy : 1

Monthly Salary Salary : 140000 - 200000

Create a job alert for this search

Engineer Agentic Ai • Redwood City, California, USA

Related jobs

AI Engineer, Evaluation and Reliability

Mice Groups • Redwood City, CA, US

Permanent

Senior Engineer, AI Evaluation and Reliability / Contract-to-Hire or Direct Hire / Redwood City / Hybrid, onsite 3 days per week / This position pays $70-80 / hr. W2 for Contract, $140-190K annually u...Show more

Last updated: 9 days ago • Promoted

Senior AI Engineer - Agentic Systems

Sweya Information Technologies LLP • San Francisco, CA, United States

Full-time

Build autonomous AI agents that form feedback-driven, self-improving systems for enterprise operations.We're excited to learn more about you and how you can contribute to our team.Strong background...Show more

Last updated: 1 day ago • Promoted

Senior AI Engineer, AI.x

Fairygodboss • San Francisco, CA, United States

Full-time

At Schwab, your career is more than a job—it's an opportunity to impact the lives of millions.Innovation and creative problem solving thrive here, as we challenge conventional approaches and priori...Show more

Last updated: 1 hour ago • Promoted • New!

PwC Technology - Senior AI Engineer

PwC • San Francisco, CA, United States

Full-time

IFS - Information Technology (IT).At PwC, our people in software and product innovation focus on developing cutting-edge software solutions and driving product innovation to meet the evolving needs...Show more

Last updated: 19 days ago • Promoted

Senior AI Engineer

Arbital Health • San Francisco, CA, United States

Full-time

Arbital Health is a rapidly growing healthcare technology and actuarial leader that centralizes, measures, and adjudicates value-based care contracts at scale. We enable payers and providers to desi...Show more

Last updated: 10 days ago • Promoted

Senior AI Engineer - GenAI Platform & Integrations

Morrison & Foerster LLP • San Francisco, CA, United States

Full-time

A leading law firm in California seeks a Senior AI Engineer to develop and manage its innovative generative-AI platform.You will design integrations with major enterprise systems, lead evaluations ...Show more

Last updated: 5 days ago • Promoted

Lead Generative AI Engineer

Madison-Davis, LLC • San Francisco, CA, United States

Full-time

We’re supporting a major global financial technology organization that’s making significant investments in AI innovation. They’re scaling their engineering teams across North America to drive develo...Show more

Last updated: 30+ days ago • Promoted

Senior AI Agentic Systems Advocate & MLOps Evangelist

Menlo Ventures • San Francisco, CA, United States

Full-time

A leading AI technology firm in San Francisco is seeking a Senior Developer Advocate.This role emphasizes technical leadership and community-building to drive developer adoption of AI systems.Candi...Show more

Last updated: 7 days ago • Promoted

Senior AI Engineer

You.com • San Francisco, CA, United States

Full-time

Get AI-powered advice on this job and more exclusive features.AI-powered search and productivity platform designed to empower users with personalized, efficient, and trustworthy search experiences....Show more

Last updated: 30+ days ago • Promoted

AI Research Engineer, Enterprise Evaluations

Scale AI, Inc. • San Francisco, CA, United States

Full-time

Scale AI is seeking a technically rigorous and driven.This high-impact role is critical to our mission of delivering the industry's leading. You will be a hands-on contributor to the core systems th...Show more

Last updated: 3 days ago • Promoted

Senior AI Engineer (Unable to consider OPT / CPT candidates)

Volkswagen Group of America • Belmont, CA, United States

Full-time

Job Description - Senior AI Engineer - ML Ops and MUST HAVE Autonomous Driving experience (Unable to consider OPT / CPT) (VW 002511). Senior AI Engineer - ML Ops and MUST HAVE Autonomous Driving exper...Show more

Last updated: 1 day ago • Promoted

Senior Lead AI Engineer (AI Foundations)

Capital One • San Francisco, CA, United States

Full-time +1

Senior Lead AI Engineer (AI Foundations).At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using m...Show more

Last updated: 19 days ago • Promoted

Senior Lead AI Engineer

Capital One National Association • San Francisco, CA, United States

Full-time

At Capital One, we are creating responsible and reliable AI systems, changing banking for good.For years, Capital One has been an industry leader in using machine learning to create real‑time, pers...Show more

Last updated: 16 days ago • Promoted

Senior Machine Learning Engineer, AI Enablement

Genentech • San Francisco, CA, United States

Full-time

We advance science so that we all have more time with the people we love.It’s what drives us to innovate.To continuously advance science and ensure everyone has access to the healthcare they need t...Show more

Last updated: 30+ days ago • Promoted

Senior AI Platform Engineer

University of California, Riverside • Oakland, CA, United States

Full-time

The Senior AI Platform Engineer is responsible for the technical design, development, and implementation of a comprehensive and scalable Generative AI platform for UC Riverside's faculty, staff, an...Show more

Last updated: 30+ days ago • Promoted

Senior Machine Learning Engineer, AI Collaboration

Snowflake Computing • Menlo Park, CA, United States

Full-time

Snowflake is about empowering enterprises to achieve their full potential - and people too.With a culture that's all in on impact, innovation, and collaboration, Snowflake is the sweet spot for bui...Show more

Last updated: 8 days ago • Promoted

Senior AI Engineer – LLMs & Backend Systems

Heaven and Hire Recruiting • San Francisco, CA, United States

Full-time

San Francisco, CA (On-site; relocation assistance available).Sponsorship available for most types.AI system that will redefine the future of talent evaluation. You’ll join at a pivotal growth stage ...Show more

Last updated: 30+ days ago • Promoted

Senior Agentic AI Developer

GoFundMe • San Francisco, CA, United States

Full-time

Want to help us help others? We’re hiring!.GoFundMe is the world’s most powerful community for good, dedicated to helping people help each other. By uniting individuals and nonprofits in one place, ...Show more

Last updated: 10 days ago • Promoted