Talent.com
AI Evaluation Analyst

AI Evaluation Analyst

VirtualVocationsArlington, Virginia, United States
1 day ago
Job type
  • Full-time
Job description

A company is looking for an AI Agent Evaluation Analyst.

Key Responsibilities

Review evaluation tasks and scenarios for logic, completeness, and realism

Identify inconsistencies, missing assumptions, or unclear decision points

Help define clear expected behaviors (gold standards) for AI agents

Required Qualifications

Excellent analytical thinking regarding complex systems and logical implications

Familiarity with structured data formats, such as JSON / YAML

Experience with policy evaluation, logic puzzles, or structured scenario design

Background in consulting, academia, or research

Some understanding of scoring or evaluation in agent testing

Create a job alert for this search

Ai Analyst • Arlington, Virginia, United States

Related jobs
  • Promoted
Lead Data Engineer (Intelligent Foundations and Experiences)

Lead Data Engineer (Intelligent Foundations and Experiences)

Capital OneBALTIMORE, Maryland, United States
Full-time +1
Lead Data Engineer (Intelligent Foundations and Experiences).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborati...Show moreLast updated: 1 day ago
  • Promoted
Senior Data Engineer (Intelligent Foundations and Experiences)

Senior Data Engineer (Intelligent Foundations and Experiences)

Capital OneBALTIMORE, Maryland, United States
Full-time +1
Senior Data Engineer (Intelligent Foundations and Experiences).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collabora...Show moreLast updated: 1 day ago
  • Promoted
Remote Commercial Banking Analyst - AI Trainer

Remote Commercial Banking Analyst - AI Trainer

Data AnnotationAlexandria, Virginia
Remote
Full-time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago
  • Promoted
AI Agent Evaluation Analyst

AI Agent Evaluation Analyst

VirtualVocationsAlexandria, Virginia, United States
Full-time
A company is looking for an AI Agent Evaluation Analyst.Key Responsibilities Review evaluation tasks and scenarios for logic, completeness, and realism Identify inconsistencies, missing assumpti...Show moreLast updated: 1 day ago
  • Promoted
Remote AI Writing Evaluator

Remote AI Writing Evaluator

OutlierBaltimore, MD, United States
Remote
Full-time
Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show moreLast updated: 5 days ago
  • Promoted
Research Lead - AI Cyber Testing & Evaluation

Research Lead - AI Cyber Testing & Evaluation

RAND CorporationWashington, DC, United States
Temporary
Global and Emerging Risks (GER) division.As Research Lead - AI Cyber Testing & Evaluation, you'll direct a comprehensive research portfolio focused on assessing the offensive cyber capabilities of ...Show moreLast updated: 30+ days ago
Research Analyst Level 2

Research Analyst Level 2

Think Tank, Inc.Silver Spring, MD, USA
Full-time
Quick Apply
The Research Analyst Level 2 will support the Northeast Fisheries Science Center (NEFSC) by providing scientific analysis and research to assess the interactions between marine development activiti...Show moreLast updated: 30+ days ago
Exploitation Analyst

Exploitation Analyst

Belay TechnologiesHanover, MD, US
Full-time
Quick Apply
Belay Technologies has been voted Baltimore Business Journal's (BBJ) Best Places to Work 2019, runner up in 2020 and a finalist in 2021!. Belay is hiring an Exploitation Analysts (EA) to work ...Show moreLast updated: 30+ days ago
  • Promoted
AI Evaluation Analyst

AI Evaluation Analyst

VirtualVocationsRockville, Maryland, United States
Full-time
A company is looking for an AI Agent Evaluation Analyst.Key Responsibilities Review evaluation tasks and scenarios for logic, completeness, and realism Identify inconsistencies, missing assumpti...Show moreLast updated: 1 day ago
  • New!
AI Multimedia Generation Product Evaluator Job at ICONMA in Washington

AI Multimedia Generation Product Evaluator Job at ICONMA in Washington

MediabistroWashington, DC, United States
Full-time
Our Client, a Internet Content & Information company, is looking for an AI Multimedia Generation Product Evaluator for their Remote location. The Multimedia Generation team focuses on fine tuning an...Show moreLast updated: 17 hours ago
Artificial Intelligence Engineer / Analyst

Artificial Intelligence Engineer / Analyst

Synertex LLCArlington, VA, USA
Full-time
Quick Apply
Artificial Intelligence Engineer / Analyst.Full-Time | On-site | Position Contingent Upon Award.Join Synertex and apply your expertise to a mission that matters. We're seeking an AI Engineer / Analyst w...Show moreLast updated: 30+ days ago
  • Promoted
Remote Financial Analyst - AI Trainer

Remote Financial Analyst - AI Trainer

Data AnnotationAlexandria, Virginia
Remote
Full-time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago
  • Promoted
Remote Financial Planner - AI Trainer

Remote Financial Planner - AI Trainer

Data AnnotationFrederick, Maryland
Remote
Full-time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago
  • Promoted
Assessment & Evaluation Analyst (Cntr for Research & Reform in Education)

Assessment & Evaluation Analyst (Cntr for Research & Reform in Education)

InsideHigherEdBaltimore, Maryland, United States
Full-time
Assessment & Evaluation Analyst.This position will involve working with multiple studies on an as-needed basis to supplement senior-level project management and research work by resident CRRE staff...Show moreLast updated: 9 days ago
  • Promoted
Senior Agentic AI Test and Evaluation Engineer

Senior Agentic AI Test and Evaluation Engineer

Leidos IncReston, VA, United States
Full-time
At Leidos, you'll contribute to AI solutions that serve critical national and global missions-ranging from defense and intelligence to healthcare, energy, and space exploration.Our work emphasizes ...Show moreLast updated: 30+ days ago
Research Analyst Level 1

Research Analyst Level 1

Think Tank, Inc.Silver Spring, MD, USA
Full-time
Quick Apply
The Research Analyst Level 1 will support the Northeast Fisheries Science Center (NEFSC) by providing scientific analysis and research to assess the interactions between marine development activiti...Show moreLast updated: 30+ days ago
  • New!
AI Multimedia Generation Product Evaluator Job at ICONMA in Baltimore

AI Multimedia Generation Product Evaluator Job at ICONMA in Baltimore

MediabistroBaltimore, MD, United States
Full-time
Our Client, a Internet Content & Information company, is looking for an AI Multimedia Generation Product Evaluator for their Remote location. The Multimedia Generation team focuses on fine tuning an...Show moreLast updated: 17 hours ago
  • Promoted
Assessment & Evaluation Analyst (Cntr for Research & Reform in Education)

Assessment & Evaluation Analyst (Cntr for Research & Reform in Education)

Johns Hopkins UniversityBaltimore, MD, United States
Full-time
Assessment & Evaluation Analyst.This position will involve working with multiple studies on an as-needed basis to supplement senior-level project management and research work by resident CRRE staff...Show moreLast updated: 10 days ago
  • Promoted
Remote Senior Financial Analyst - AI Trainer

Remote Senior Financial Analyst - AI Trainer

Data AnnotationManassas, Virginia
Remote
Full-time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago
  • Promoted
Remote Finance Advisor - AI Trainer

Remote Finance Advisor - AI Trainer

Data AnnotationFrederick, Maryland
Remote
Full-time +1
We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...Show moreLast updated: 30+ days ago