Talent.com
AI Agent Evaluation Analyst

AI Agent Evaluation Analyst

MindriftRI, US
2 days ago
Job type
  • Part-time
  • Permanent
  • Remote
  • Quick Apply
Job description

This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.

At Mindrift , innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

What we do

The Mindrift platform, launched and powered by Toloka , connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we're looking for :

We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate.

Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?

This is a flexible, project-based opportunity well-suited for :

  • Analysts, researchers, or consultants with strong critical thinking skills.
  • Students (senior undergrads / grad students) looking for an intellectually interesting gig.
  • People open to a part-time and non-permanent opportunity.

About the project :

We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.

You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.

What you’ll be doing :

  • Reviewing evaluation tasks and scenarios for logic, completeness, and realism.
  • Identifying inconsistencies, missing assumptions, or unclear decision points.
  • Helping define clear expected behaviors (gold standards) for AI agents.
  • Annotating cause-effect relationships, reasoning paths, and plausible alternatives.
  • Thinking through complex systems and policies as a human would to ensure agents are tested properly.
  • Working closely with QA, writers, or developers to suggest refinements or edge case coverage.
  • How to get started :

    Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

    Requirements

  • Excellent analytical thinking : Can reason about complex systems, scenarios, and logical implications.
  • Strong attention to detail : Can spot contradictions, ambiguities, and vague requirements.
  • Familiarity with structured data formats : Can read, not necessarily write JSON / YAML.
  • Ability to assess scenarios holistically : What's missing, what’s unrealistic, what might break?
  • Good communication and clear writing (in English) to document your findings.
  • We also value applicants who have :

  • Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.
  • Background in consulting, academia, olympiads (e.g. logic / math / informatics), or research.
  • Exposure to LLMs, prompt engineering, or AI-generated content.
  • Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).
  • Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).
  • Benefits

  • Get paid for your expertise, with  rates that can go up to $55 / hour  depending on your skills, experience, and project needs.
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
  • Influence how future AI models understand and communicate in your field of expertise.
  • Create a job alert for this search

    Ai Analyst • RI, US

    Related jobs
    • Promoted
    AI Solutions Specialist

    AI Solutions Specialist

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for an AI Solutions Specialist to drive business innovation through artificial intelligence.Key Responsibilities Analyze complex datasets to uncover trends and actionable ins...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    AI Developer Consultant

    AI Developer Consultant

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Freelance Developer Consultant (Kotlin) - AI Trainer.Key Responsibilities Code generation and code review Training and evaluation of large language models Collaborati...Show moreLast updated: 10 hours ago
    • Promoted
    Senior AI Consultant

    Senior AI Consultant

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Senior Consultant AI.Key Responsibilities Utilize Azure AI Services to build and optimize OpenAI solutions Develop and manage data ingestion and transformation pipelin...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    AI Data Scientist

    AI Data Scientist

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Data Scientist (AI Focus).Key Responsibilities Design and build natural language interaction capabilities using Python and Jupyter environments Act as a subject matter...Show moreLast updated: 10 hours ago
    • Promoted
    AI Implementation Specialist

    AI Implementation Specialist

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for an AI Implementation Specialist for Automated Portfolio Analysis and Reporting.Key Responsibilities Develop a GUI-based prototype for AI-powered document analysis and vis...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Data and Automation Lead

    Data and Automation Lead

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Data and Automation Lead to support program management, automation engineering, data configuration, data analytics, and reporting activities. Key Responsibilities Provid...Show moreLast updated: 10 hours ago
    • Promoted
    Senior Business Intelligence Developer

    Senior Business Intelligence Developer

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Senior Business Intelligence Developer - Product & Platform Innovation.Key Responsibilities Collaborate with cross-functional teams to drive data enablement and support...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Agentic AI Engineer

    Agentic AI Engineer

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for an Agent Forward Engineer to lead the design and deployment of AI agents and software for application development and migration. Key Responsibilities Design and implement ...Show moreLast updated: 6 hours ago
    • Promoted
    Tech Lead for AI & Data

    Tech Lead for AI & Data

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Tech Lead with AI & Data expertise (Remote, LATAM).Key Responsibilities Lead the technical effort for the Delivery Team, ensuring clarity and confidence with clients an...Show moreLast updated: 1 day ago
    • Promoted
    Generative AI Scientist

    Generative AI Scientist

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Generative AI Scientist.Key Responsibilities Deliver solutions to identify payment integrity issues and improve healthcare quality Develop exploratory data analysis ap...Show moreLast updated: 1 day ago
    • Promoted
    Full Stack AI Engineer

    Full Stack AI Engineer

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Full Stack AI Engineer on a contract basis.Key Responsibilities Lead the design and implementation of a full-stack pilot application Build secure and efficient data in...Show moreLast updated: 30+ days ago
    • Promoted
    AI Prototype Specialist

    AI Prototype Specialist

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for an AI Prototype Specialist (Vibe Coder).Key Responsibilities Collaborate with strategy and sales teams to understand client needs and challenges Design and build functio...Show moreLast updated: 2 days ago
    • Promoted
    OmniStudio Developer

    OmniStudio Developer

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for an OmniStudio Developer to contribute to transformative public sector projects.Key Responsibilities : Design and configure responsive user interfaces using FlexCards Crea...Show moreLast updated: 2 days ago
    • Promoted
    • New!
    EDI Analyst

    EDI Analyst

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for an EDI Analyst responsible for internal EDI processes and implementation.Key Responsibilities Maintain knowledge of employee benefit systems and processes related to EDI ...Show moreLast updated: 12 hours ago
    • Promoted
    Deep Learning Software Engineer

    Deep Learning Software Engineer

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Deep Learning Software Engineer, Inference and Model Optimization - New College Grad 2025.Key Responsibilities Train, develop, and deploy generative AI models using the...Show moreLast updated: 1 day ago
    • Promoted
    Epic Cogito Reporting Advisor

    Epic Cogito Reporting Advisor

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for an Epic Cogito BID.Key Responsibilities Gather reporting requirements and design reports for healthcare organizations Conduct testing and validation of reports and provi...Show moreLast updated: 2 days ago
    • Promoted
    Senior AI Agent Developer

    Senior AI Agent Developer

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    Key Responsibilities Design, develop, and deploy production-ready AI agents that support critical workflows across the enterprise Rapidly prototype, test, and refine agents to ensure high perfor...Show moreLast updated: 1 day ago
    • Promoted
    Cortex XSIAM Engineer

    Cortex XSIAM Engineer

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Cortex XSIAM Consultant to join a premier cyber security organization remotely.Key Responsibilities Develop log ingestion strategies in collaboration with the technical...Show moreLast updated: 30+ days ago
    • Promoted
    Junior Machine Learning Engineer

    Junior Machine Learning Engineer

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for a Junior Machine Learning Engineer to support the design, development, and deployment of machine learning models. Key Responsibilities Support the end-to-end lifecycle of ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    AI Engineer III

    AI Engineer III

    VirtualVocationsCranston, Rhode Island, United States
    Full-time
    A company is looking for an AI Engineer III to lead the design and deployment of advanced AI systems.Key Responsibilities Define architecture, standards, and evaluation strategies for AI systems ...Show moreLast updated: 4 hours ago