Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Bellevue, Nebraska, US
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Bellevue, Nebraska, US
21 hours ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Bellevue, Nebraska, US

    Related jobs
    Journeyman Intelligence Editor

    Journeyman Intelligence Editor

    The Garrett Group • Bellevue, NE, USA
    Full-time
    Quick Apply
    This position is for proposed future work expected to begin in 2026.Journeyman-Level Intelligence Editor.Top Secret (TS) with SCI eligibility. NC2 / ESI and SAP / STO eligible.The Journeyman-Level Inte...Show more
    Last updated: 30+ days ago
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Bellevue, NE, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Travel Surgical Tech - CVOR - $2086 / Week

    Travel Surgical Tech - CVOR - $2086 / Week

    LRS Healthcare - Allied • Omaha, NE, US
    Full-time
    LRS Healthcare - Allied is seeking an experienced Surgical Tech - CVOR for an exciting Travel Allied job in Omaha, NE.Shift : Inquire Start Date : ASAP Duration : 13 weeks Pay : $2086 / Week.Ready to s...Show more
    Last updated: 2 days ago • Promoted
    Staff AI Research Engineer

    Staff AI Research Engineer

    Lockheed Martin • Offutt A F B, NE, US
    Full-time
    Job ID : 695377BR Date posted : Oct.Description : Who We Are : Lockheed Martin RMS provides mission software for the United States Strategic Command (USSTRATCOM) Nuclear Deterrence Mission.Our amazing ...Show more
    Last updated: 30+ days ago • Promoted
    Director of Data & AI Value and Governance (Omaha, NE)

    Director of Data & AI Value and Governance (Omaha, NE)

    First National Bank of Omaha • Omaha, Nebraska, USA
    Full-time
    At FNBO our employees are the heart of our storyand were committed to their success! Please see below the details of this career opportunity and how it fits into our organizations success.The Direc...Show more
    Last updated: 23 days ago • Promoted
    Managing Director - Practice Lead, AI and Technology Advisory

    Managing Director - Practice Lead, AI and Technology Advisory

    Premier • Omaha, NE, US
    Full-time
    Managing Director - Practice Lead, AI and Technology Advisory.This role extends beyond technical leadership to encompass two parallel transformations : advancing AI capabilities for Premier's health...Show more
    Last updated: 2 days ago • Promoted
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Bellevue, Nebraska
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Remote AI Content Reviewer

    Remote AI Content Reviewer

    Outlier • Bellevue, NE, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    AI Implementation Engineer

    AI Implementation Engineer

    Fiserv • Omaha, Nebraska, USA
    Full-time
    Calling all innovators find your future at Fiserv.Were Fiserv a global leader in Fintech and payments and we move money and information in a way that moves the world. We connect financial instituti...Show more
    Last updated: 11 days ago • Promoted
    Travel Surgical Tech - CVOR - $2290.57 / Week

    Travel Surgical Tech - CVOR - $2290.57 / Week

    Atlas MedStaff • Omaha, NE, US
    Full-time
    Atlas MedStaff is seeking an experienced Surgical Tech - CVOR for an exciting Travel Allied job in Omaha, NE.Shift : 4x10 hr days Start Date : 12 / 15 / 2025 Duration : 13 weeks Pay : $2290.Atlas Medstaff ...Show more
    Last updated: 21 days ago • Promoted
    Travel Surgical Tech - CVOR - $2285 / Week

    Travel Surgical Tech - CVOR - $2285 / Week

    LRS Healthcare - Allied • Omaha, NE, US
    Full-time
    LRS Healthcare - Allied is seeking an experienced Surgical Tech - CVOR for an exciting Travel Allied job in Omaha, NE.Shift : Inquire Start Date : 12 / 15 / 2025 Duration : 13 weeks Pay : $2285 / Week.Read...Show more
    Last updated: 2 days ago • Promoted
    Senior Associate Dean, Online Product Health

    Senior Associate Dean, Online Product Health

    Southern New Hampshire University • Omaha, NE, United States
    Full-time
    Southern New Hampshire University is a team of innovators.Individuals who believe in progress with purpose.Since 1932, our people-centered strategy has defined us - and helped us grow a team that n...Show more
    Last updated: 3 days ago • Promoted
    Senior Specialist, Technical Evaluations & Proposals

    Senior Specialist, Technical Evaluations & Proposals

    Resilience Corp. • Omaha, NE, United States
    Full-time
    A career at Resilience is more than just a job - it's an opportunity to change the future.Resilience is a technology-focused biomanufacturing company that's changing the way medicine is made.We're ...Show more
    Last updated: 4 days ago • Promoted
    AI Trainer -Remote Content QA Reviewer

    AI Trainer -Remote Content QA Reviewer

    Outlier • Bellevue, NE, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Staff Applied Scientist

    Staff Applied Scientist

    Relativity • Omaha, NE, United States
    Full-time
    At Relativity, we're building a world-class Applied Science team to push the boundaries of intelligent systems in the legal domain. We're looking for a Staff Applied Scientist to join our team.Agent...Show more
    Last updated: 1 day ago • Promoted
    Sr Statistical R Programmer (North America Only)

    Sr Statistical R Programmer (North America Only)

    Syneos Health / inVentiv Health Commercial LLC • Omaha, NE, United States
    Full-time
    Sr Statistical R Programmer (North America Only).Syneos Health is a leading fully integrated biopharmaceutical solutions organization built to accelerate customer success.We translate unique clin...Show more
    Last updated: 1 day ago • Promoted
    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Data Annotation • Bellevue, Nebraska
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 30+ days ago • Promoted
    Remote Financial Advising Expert - AI Trainer ($50-$60 / hour)

    Remote Financial Advising Expert - AI Trainer ($50-$60 / hour)

    Data Annotation • Bellevue, Nebraska
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted