Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Trenton, New Jersey, US
No longer accepting applications
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Trenton, New Jersey, US
1 day ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Trenton, New Jersey, US

    Related jobs
    Travel Cath Lab Tech - $3173.6 / Week

    Travel Cath Lab Tech - $3173.6 / Week

    FlexCare • Somerville, NJ, US
    Full-time
    FlexCare is seeking an experienced Cath Lab Tech for an exciting Travel Allied job in Somerville, NJ.Shift : 4x10 hr days Start Date : 12 / 30 / 2025 Duration : 13 weeks Pay : $3173.Why Clinicians Choose F...Show more
    Last updated: 8 days ago • Promoted
    AI Engineer

    AI Engineer

    Chubb • Readington, New Jersey, United States
    Full-time
    The North America Data Analytics team at Chubb is seeking a AI Engineer with 4+ years of industry experience to join our fast-paced, high-energy team. This team is responsible for delivering generat...Show more
    Last updated: 1 day ago • Promoted
    Data Architect - Semantic & AI Readiness (Ewing)

    Data Architect - Semantic & AI Readiness (Ewing)

    GS1 • Ewing, NJ, US
    Part-time
    This is a contract-based position (contractor) with project funding for 3.The Future of Data Sharing Programme is central to GS1s Vision 2030 building a globally unified, interoperable and trusted...Show more
    Last updated: 3 hours ago • Promoted • New!
    Hiring : Associate Scientist / QC Sample Management || Warren NJ

    Hiring : Associate Scientist / QC Sample Management || Warren NJ

    Sunrise Systems • Warren, New Jersey, United States
    Temporary
    Quick Apply
    Job Title : Associate Scientist / QC Sample Management.Duration : 06 Month Contract (Possible extension based on work performance). Work Schedule : Mon - Fri, Business Hours.Some late nights or weekend t...Show more
    Last updated: 30+ days ago
    Travel Cath Lab Tech - $3163 / Week

    Travel Cath Lab Tech - $3163 / Week

    LRS Healthcare - Allied • Somerville, NJ, US
    Full-time
    LRS Healthcare - Allied is seeking an experienced Cath Lab Tech for an exciting Travel Allied job in Somerville, NJ.Shift : Inquire Start Date : 12 / 30 / 2025 Duration : 13 weeks Pay : $3163 / Week.Ready ...Show more
    Last updated: 2 days ago • Promoted
    QA Lead - Enterprise Data & AI Platform

    QA Lead - Enterprise Data & AI Platform

    Eliassen Group • Trenton, NJ, United States
    Full-time
    QA Lead - Enterprise Data & AI Platform.Our client is a leading institution of higher education dedicated to providing accessible, career-focused learning opportunities for working adults and servi...Show more
    Last updated: 4 days ago • Promoted
    SAFe Scrum Master for AI Projects | Hybrid Contract

    SAFe Scrum Master for AI Projects | Hybrid Contract

    Matlen Silver • Pennington, NJ, United States
    Full-time
    A leading technology solutions provider is seeking an experienced Scrum Master to facilitate Agile processes and manage vendor interactions in a hybrid setting. Candidates should have strong backgro...Show more
    Last updated: 2 days ago • Promoted
    Director of Artificial Intelligence (Bedminster)

    Director of Artificial Intelligence (Bedminster)

    Innovatix Technology Partners • Bedminster, NJ, US
    Full-time +1
    Job title : Director of Innovation (Artificial Intelligence).Mode of work : Hybrid - 3 days per week onsite.We are seeking a visionary and strategic Innovation Director to lead the design, developmen...Show more
    Last updated: less than 1 hour ago • Promoted • New!
    Tech Product Owner - AI

    Tech Product Owner - AI

    InfoVision Inc. • Basking Ridge, NJ, United States
    Full-time
    This role owns the product roadmap for.AI- and data-driven network intelligence solutions.AI-enabled insights for network performance, coverage, and customer experience. Product Vision & Strategy – ...Show more
    Last updated: 1 hour ago • Promoted • New!
    Cytotechnologist

    Cytotechnologist

    LabCorp • Raritan, NJ, United States
    Part-time
    Sign-On Bonus! (External candidates only)_.Are you a certified Cytotechnologist? Are you in school pursuing a Cytotechnologist degree / certification? If so, Labcorp wants to speak with you about exc...Show more
    Last updated: 30+ days ago • Promoted
    Workday AI Talent Acquisition Engineer

    Workday AI Talent Acquisition Engineer

    Purple Drive • Basking Ridge, NJ, United States
    Full-time
    Role : Workday AI Talent Acquisition Engineer.Experienced Workday AI Talent Acquisition Engineer (to work with VZ HR Team). Responsible for designing, configuring, and optimizing Workday Talent Acqui...Show more
    Last updated: 24 days ago • Promoted
    Full-Stack AI Product Developer

    Full-Stack AI Product Developer

    Lawyer.com • Basking Ridge, NJ, US
    Full-time
    Quick Apply
    Climb is a World Media Group company alongside Lawyer.We are a fast paced start-up based in Manhattan.We are developing IoT enabled health and wellness equipment for the workplace of the future, wi...Show more
    Last updated: 30+ days ago
    Engineer

    Engineer

    Quality Talent Group • Falls township, Pennsylvania, United States
    Full-time
    Quick Apply
    Our client is a leading force in advancing safer, smarter AI technology.Their work has been featured in.They’ve built a global community of expert contributors and have already paid out more ...Show more
    Last updated: 7 days ago
    Travel Surgical First Assistant

    Travel Surgical First Assistant

    CT Assist • Browns Mills, NJ, US
    Permanent
    CT Assist is seeking a travel Surgical First Assistant for a travel job in Browns Mills, New Jersey.Job Description & Requirements. Looking for your next locums assignment? We’d love to he...Show more
    Last updated: 30+ days ago • Promoted
    Remote STEM AI QA & Research Specialist

    Remote STEM AI QA & Research Specialist

    RippleMatch Opportunities • Princeton, NJ, United States
    Remote
    Part-time
    A tech evaluation firm is seeking a skilled Research and STEM Expert for a part-time contractor role.This remote position involves evaluating AI-generated outputs in various STEM domains, providing...Show more
    Last updated: 14 days ago • Promoted
    Workday AI Talent Acquisition Engineer

    Workday AI Talent Acquisition Engineer

    E-Solutions • Basking Ridge, NJ, United States
    Full-time
    Role : Workday AI Talent Acquisition Engineer.Experienced Workday AI Talent Acquisition Engineer (to work with VZ HR Team). Responsible for designing, configuring, and optimizing Workday Talent Acqu...Show more
    Last updated: 21 days ago • Promoted
    Senior Statistical Programmer

    Senior Statistical Programmer

    Pharmaron • Franklin Township, NJ, USA
    Full-time
    Quick Apply
    Senior Statistical Programmer .Work authorization sponsorship is not available for this role.Pharmaron is a global CRO (Contract Research Organization) helping pharma and biotech companies bri...Show more
    Last updated: 30+ days ago
    Mercor - STEM Expert, application via RippleMatch

    Mercor - STEM Expert, application via RippleMatch

    RippleMatch Opportunities • Princeton, NJ, United States
    Part-time
    Mercor - STEM Expert, application via RippleMatch.Mercor uses RippleMatch to find top talent.Mercor is seeking a highly skilled. AI evaluation and technical quality assurance team.In this role, you ...Show more
    Last updated: 14 days ago • Promoted