Talent.com
AI Agent Evaluation Analyst
AI Agent Evaluation AnalystVirtualVocations • Dorchester, Massachusetts, United States
No longer accepting applications
AI Agent Evaluation Analyst

AI Agent Evaluation Analyst

VirtualVocations • Dorchester, Massachusetts, United States
4 days ago
Job type
  • Full-time
Job description

A company is looking for an AI Agent Evaluation Analyst.

Key Responsibilities

Review evaluation tasks and scenarios for logic, completeness, and realism

Identify inconsistencies, missing assumptions, or unclear decision points

Help define clear expected behaviors for AI agents

Required Qualifications

Excellent analytical thinking regarding complex systems and logical implications

Familiarity with structured data formats like JSON / YAML

Experience with policy evaluation, logic puzzles, or structured scenario design

Background in consulting, academia, or research

Some understanding of scoring or evaluation in agent testing

Create a job alert for this search

Ai Analyst • Dorchester, Massachusetts, United States

Related jobs
Design Evaluation Engineer Intern

Design Evaluation Engineer Intern

1010 Analog Devices Inc. • Wilmington, MA, United States
Full-time +1
Are you a problem solver looking for a hands-on internship position with a market-leading company that will help develop your career and reward you intellectually and professionally?.NASDAQ : ADI ...Show more
Last updated: 30+ days ago • Promoted
Intelligence Analyst

Intelligence Analyst

US Army • Boston, MA, United States
Part-time +1
Intelligence Analyst Job Overview : We are currently seeking a highly motivated and detail-oriented individuals to join our team. You will play a crucial role in supporting our decision-making proces...Show more
Last updated: 30+ days ago • Promoted
Research Lead - AI Cyber Testing & Evaluation

Research Lead - AI Cyber Testing & Evaluation

RAND • Boston, MA, United States
Temporary
Global and Emerging Risks (GER) division.As Research Lead - AI Cyber Testing & Evaluation, you'll direct a comprehensive research portfolio focused on assessing the offensive cyber capabilities of ...Show more
Last updated: 30+ days ago • Promoted
Research Lead - AI Cyber Testing & Evaluation

Research Lead - AI Cyber Testing & Evaluation

RAND Corporation • Boston, MA, United States
Temporary
RAND's Meselson Center, part of the Global and Emerging Risks (GER) division, is seeking an accomplished technical leader to drive our ambitious AI cyber evaluation agenda.As Research Lead - AI Cyb...Show more
Last updated: 30+ days ago • Promoted
Agentic AI Developer Newton, MA or Raleigh, NC

Agentic AI Developer Newton, MA or Raleigh, NC

Kovan Technology Solutions • MA, United States
Full-time
Quick Apply
MailCompose"> Role : Agentic AI Developer Location : Newton, MA or Raleigh, NC (Hybrid) De...Show more
Last updated: 1 day ago
Lead Data & AI Engineer, Physical Intelligence

Lead Data & AI Engineer, Physical Intelligence

ATI • Waltham, MA, United States
Full-time
We are a cutting-edge Series A company focused on revolutionizing the automotive robotics space.The team consists of experienced robotics, industrial automation, and automotive professionals with d...Show more
Last updated: 13 days ago • Promoted
Design Evaluation Engineer

Design Evaluation Engineer

1010 Analog Devices Inc. • Wilmington, MA, United States
Permanent
NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...Show more
Last updated: 30+ days ago • Promoted
AI Enablement Engineer

AI Enablement Engineer

Pryon • Boston, MA, US
Full-time
We’re a team of AI, technology, and language experts whose DNA lives in Alexa, Siri, Watson, and virtually every human language technology product on the market.Now we’re building an in...Show more
Last updated: 11 hours ago • Promoted • New!
AI Data Engineer

AI Data Engineer

C the Signs • Boston, MA, US
Full-time
The Data Engineer will play a crucial role in developing and fine-tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle...Show more
Last updated: 14 days ago • Promoted
Senior Data Engineer

Senior Data Engineer

Motional • Boston, MA, US
Full-time
We are seeking a highly skilled and motivated Senior Data Analysis Engineer for our large-scale AI model and software evaluation framework – Ground Truth Regression.The ideal candidate will h...Show more
Last updated: 14 days ago • Promoted
Machine Learning Engineer - Generative AI

Machine Learning Engineer - Generative AI

Zoox • Boston, MA, US
Full-time
The Perception team at Zoox is at the forefront of leveraging GenAI to create synthetic data, unlocking scalable training and evaluation for our autonomous system's perception and entire stack....Show more
Last updated: 11 hours ago • Promoted • New!
Design Evaluation (PhD)

Design Evaluation (PhD)

1010 Analog Devices Inc. • Wilmington, MA, United States
Permanent
NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...Show more
Last updated: 30+ days ago • Promoted
Applied AI / ML Engineer

Applied AI / ML Engineer

Catalyst Labs • Boston, MA, US
Full-time
Catalyst Labs is a leading talent agency with a specialized vertical in Applied AI, Machine Learning, and Data Science.We stand out as an agency thats deeply embedded in our clients recruitment ope...Show more
Last updated: 11 hours ago • Promoted • New!
WFH Media Search Analyst - English Speakers

WFH Media Search Analyst - English Speakers

TELUS Digital AI Data Solutions • New Hampshire, United States, United States
Remote
Part-time
Ready to say goodbye to the boring, traditional 9-5 routine and embrace a dynamic and exciting work environment that puts you in control? This position offers you the flexibility to set your own sc...Show more
Last updated: 2 days ago • Promoted
Senior AI Research Engineer, Model Inference (100% Remote)

Senior AI Research Engineer, Model Inference (100% Remote)

Tether Operations Limited • Boston, MA, US
Remote
Full-time
Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Show more
Last updated: 15 days ago
AI and Trading Analytics Researcher, AVP

AI and Trading Analytics Researcher, AVP

State Street • Cambridge, Massachusetts, United States
Full-time
This job is with State Street, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.AI and Trad...Show more
Last updated: 7 days ago • Promoted
AI Engineer Trainee (Remote, Experience with AI Coding)

AI Engineer Trainee (Remote, Experience with AI Coding)

FocusKPI Inc. • Boston, MA, US
Remote
Full-time
Remote in United States (EST Preferred).Full-Time / Unpaid Trainee Program.We only consider your application if you submit your resume and a brief cover letter outlining your experience with releva...Show more
Last updated: 15 days ago • Promoted
AI-Enabled Global Study Data Leader

AI-Enabled Global Study Data Leader

Sanofi • Cambridge, MA, US
Full-time
AI-Enabled Global Study Data Leader.Are you ready to shape the future of medicine? The race is on to speed up drug discovery and development to find answers for patients and their families.Your ski...Show more
Last updated: 29 days ago • Promoted
Intelligence Analyst

Intelligence Analyst

Contact Government Services, LLC • Boston, MA, US
Full-time
Employment Type : Full-Time, Experienced.Contact Government Services is hiring an Intelligence Analyst ready to be a member of a dynamic and fast paced intel analysis program for a federal agency su...Show more
Last updated: 11 hours ago • Promoted • New!
Analyst

Analyst

Rhapsody VP • Cambridge, MA, US
Full-time
Rhapsody Venture Partners is hiring an Analyst to help us bring important scientific inventions to market.Your mission is to find the next startup to join the Rhapsody portfolio! You will nurture a...Show more
Last updated: 30+ days ago • Promoted