Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Methuen, Massachusetts, US
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Methuen, Massachusetts, US
19 hours ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Methuen, Massachusetts, US

    Related jobs
    Mid-Level AI Engineer

    Mid-Level AI Engineer

    Credence • Bedford, Massachusetts, US
    Full-time
    Overview Credence is one of the largest privately held technology services firms in the U.Top Workplace and featured on the Inc. Fastest-Growing Private Companies list for 12 consecutive years.We se...Show more
    Last updated: 13 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Givzey • Newton, Massachusetts, United States
    Full-time
    We're looking for a talented and motivated Machine Learning Engineer to join our team and help develop cutting-edge AI solutions. In this role, you'll have the opportunity to shape and create our ma...Show more
    Last updated: 30+ days ago • Promoted
    Performance Audit Senior Consultant (Remote)

    Performance Audit Senior Consultant (Remote)

    Blue Cross Blue Shield Association • Manchester, NH, United States
    Remote
    Full-time
    The role ensure Plans' operations handle customer interactions accurately and promptly, leaving a positive brand impression. Execute audit programs and validate reported results.Present updates and ...Show more
    Last updated: 21 days ago • Promoted
    Senior AI Engineer Salesforce & Applied Intelligence

    Senior AI Engineer Salesforce & Applied Intelligence

    ZoomInfo Technologies • Waltham, Massachusetts, United States
    Full-time
    We’re transforming how Go-To-Market (GTM) teams — across.Sales, Marketing, and Customer Experience.AI to drive smarter, faster, and more predictive decisions. Our mission is to build intelligent sys...Show more
    Last updated: 29 days ago • Promoted
    Principal Data Scientist in Epidemiology and Patient Data Products

    Principal Data Scientist in Epidemiology and Patient Data Products

    Valo Health • Lexington, Massachusetts, United States
    Remote
    Full-time +1
    Valo Health is a human-centric, AI-enabled biotechnology company working to make new drugs for patients faster.The company’s Opal Computational Platform transforms drug discovery and development th...Show more
    Last updated: 19 days ago • Promoted
    Travel Surgical Tech - $1,796 per week in Nashua, NH

    Travel Surgical Tech - $1,796 per week in Nashua, NH

    AlliedTravelCareers • Manchester, New Hampshire, US
    Full-time
    AlliedTravelCareers is working with Triage Staffing LLC to find a qualified Surg Tech in Nashua, New Hampshire, 03060!.Shift Details : 12H Days (7 : 00 AM-7 : 30 PM).Length : 13 WEEKS 13 wee...Show more
    Last updated: 22 hours ago • Promoted • New!
    Senior Director, Software Engineering & AI

    Senior Director, Software Engineering & AI

    Entegris • Billerica, MA, United States
    Full-time
    Senior Director, Software Engineering & AI.Here at Entegris, we use advanced science to enable technologies that transform the world, and we are seeking employees who have the drive to continue tha...Show more
    Last updated: 30+ days ago • Promoted
    Principal Test Engineer

    Principal Test Engineer

    The Computer Merchant, LTD. • Tewksbury, MA, US
    Temporary
    JOB TITLE : PRINCIPAL TEST ENGINEER LOCATION : TEWKSBURY, MA WAGE RANGE • : $90.PER HOUR JOB NUMBER : 14936037 REQUIRED EXPERIENCE : Typically requires a Bachelor's in Science, Technology, Engineering, o...Show more
    Last updated: 30+ days ago • Promoted
    Expert Prompt Curators for Advanced AI Evaluation Dataset

    Expert Prompt Curators for Advanced AI Evaluation Dataset

    Mercor • Lowell, Massachusetts, US
    Remote
    Full-time
    Role Overview • • Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge a...Show more
    Last updated: 19 hours ago • Promoted • New!
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Lowell, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Staff Applied Scientist

    Staff Applied Scientist

    Relativity • Manchester, NH, United States
    Full-time
    At Relativity, we're building a world-class Applied Science team to push the boundaries of intelligent systems in the legal domain. We're looking for a Staff Applied Scientist to join our team.Agent...Show more
    Last updated: 1 day ago • Promoted
    Automation Machine Vision Engineer

    Automation Machine Vision Engineer

    DEKA Research and Development • Manchester, NH, United States
    Full-time
    DEKA Research & Development, located in Manchester, NH, is seeking an Automation Machine Vision Engineer to join our industrial automation team developing vision inspection systems for medical devi...Show more
    Last updated: 30+ days ago • Promoted
    Cath Lab Tech - Electrophysiology (EP) Technologist

    Cath Lab Tech - Electrophysiology (EP) Technologist

    Massachusetts General Brigham - 100 Brigham Way • Westwood, MA, United States
    Full-time
    Massachusetts General Brigham - 100 Brigham Way.Electrophysiology (EP) Technologist.Cath Lab Tech - Electrophysiology (EP) Technologist. A Cath Lab Tech assists in cardiac catheterization procedures...Show more
    Last updated: 17 days ago • Promoted
    Travel Certified Surgical Technologist (CST) - $2,253 per week in Lebanon, NH

    Travel Certified Surgical Technologist (CST) - $2,253 per week in Lebanon, NH

    AlliedTravelCareers • Manchester, New Hampshire, US
    Full-time
    AlliedTravelCareers is working with AHS Staffing to find a qualified CST in Lebanon, New Hampshire, 03756!.AHS Staffing is looking for a CST Certified Surgical Technologist in Lebanon, NH for a Lon...Show more
    Last updated: 30+ days ago • Promoted
    Histocompatibility Lab Director

    Histocompatibility Lab Director

    American Red Cross • Dedham, MA, United States
    Full-time
    As one of the nation's premier humanitarian organizations, the American Red Cross is dedicated to helping people in need throughout the United States and, in association with other Red Cross networ...Show more
    Last updated: 30+ days ago • Promoted
    Travel Electrophysiology Technologist (EP Lab) - $2,480 per week

    Travel Electrophysiology Technologist (EP Lab) - $2,480 per week

    GLC On-The-Go • Manchester, NH, United States
    Full-time
    GLC On-The-Go is seeking a travel Electrophysiology Technician for a travel job in Manchester, New Hampshire.Job Description & Requirements. Pay package is based on 8 hour shifts and 40 hours per we...Show more
    Last updated: 1 day ago • Promoted
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Methuen, Massachusetts
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Innovation Cup 2026 - Team Synthetic Biology (all genders)

    Innovation Cup 2026 - Team Synthetic Biology (all genders)

    Merck KGaA • Manchester, NH, United States
    Full-time
    Ready to explore, break barriers, and discover more? We know you've got big plans - so do we! Our colleagues across the globe love innovating with science and technology to enrich people's lives wi...Show more
    Last updated: 16 days ago • Promoted