Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Brockton, Massachusetts, US
No longer accepting applications
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Brockton, Massachusetts, US
1 day ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Brockton, Massachusetts, US

    Related jobs
    Research Lead - Securing Frontier AI

    Research Lead - Securing Frontier AI

    RAND Corporation • Boston, MA, United States
    Temporary
    Global and Emerging Risks (GER) division.As Research Lead - Securing Frontier AI, you'll direct a comprehensive research portfolio focused on ensuring that the world's most important AI systems are...Show more
    Last updated: 30+ days ago • Promoted
    Senior PM — AI Safety Platform & Trust

    Senior PM — AI Safety Platform & Trust

    Spotify • Boston, MA, United States
    Full-time
    A leading audio streaming service is seeking a Senior Product Manager to join their AI Safety Platform team.In this role, you'll drive product vision and build systems that ensure AI safety.Ideal c...Show more
    Last updated: 14 days ago • Promoted
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND Corporation • Boston, MA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's.Center on AI, Security, and Technology (CAST).AI x Bio intersection through assessment, technical work, and policy analysis.RAND's...Show more
    Last updated: 27 days ago • Promoted
    Biosecurity Research Resident - AI-Enabled Biological Tools

    Biosecurity Research Resident - AI-Enabled Biological Tools

    RAND Corporation • Boston, MA, United States
    Full-time +1
    This position will contribute to groundbreaking research on how advances in artificial intelligence intersect with biotechnology, national security, and innovation. RAND's reputation for excellence ...Show more
    Last updated: 27 days ago • Promoted
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND • Boston, MA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND’s Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...Show more
    Last updated: 18 days ago • Promoted
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Cambridge, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 2 days ago • Promoted
    Senior On-Site AI Engineer

    Senior On-Site AI Engineer

    1872 Consulting • Roxbury, MA, United States
    Full-time
    Cedar Rapids, IA - Fully onsite 5 days per week.Partnering with each project's AI Champion (Project Manager or Superintendent), you'll uncover pain points, redesign workflows, and deploy AI agents ...Show more
    Last updated: 4 days ago • Promoted
    Principal Machine Learning Engineer, AI Inference

    Principal Machine Learning Engineer, AI Inference

    Red Hat • Boston, MA, United States
    Full-time +1
    Principal Machine Learning Engineer, AI Inference page is loaded## Principal Machine Learning Engineer, AI Inferenceremote type : Hybridlocations : Bostonposted on : Posted Todayjob requisition ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Expert II AI and Computational Sciences

    Senior Expert II AI and Computational Sciences

    Novartis Group Companies • Cambridge, MA, United States
    Full-time
    Novartis has embraced a bold strategy to drive a company-wide digital transformation.Our objective is to position Novartis as an industry leader by proactively adopting digital technologies that fo...Show more
    Last updated: 3 days ago • Promoted
    AI Product Leader

    AI Product Leader

    DeepRec.ai • Boston, MA, US
    Full-time
    Senior AI Product Lead <> $200M Backed Materials Discovery Platform <> USA.I'm representing a DeepTech organisation working at the intersection of advanced AI, computational chemistry, ...Show more
    Last updated: 14 days ago • Promoted
    Director, AI-Powered Discovery & Personalization

    Director, AI-Powered Discovery & Personalization

    Scribd, Inc. • Boston, MA, US
    Full-time
    A leading digital content platform is seeking a Director of Product Discovery to lead the strategy and execution of the discovery ecosystem. This role involves overseeing personalized and AI-driven ...Show more
    Last updated: 4 days ago • Promoted
    PwC Technology - Senior AI Engineer

    PwC Technology - Senior AI Engineer

    PwC • Boston, MA, United States
    Full-time
    IFS - Information Technology (IT).At PwC, our people in software and product innovation focus on developing cutting-edge software solutions and driving product innovation to meet the evolving needs...Show more
    Last updated: 4 days ago • Promoted
    AI Product Lead — Hybrid (Onsite / Remote) AI Solutions

    AI Product Lead — Hybrid (Onsite / Remote) AI Solutions

    Ccrps • Boston, MA, United States
    Remote
    Full-time
    A leading educational institution in Boston is seeking an AI Technical Product Manager to bridge the gap between AI capabilities and product applications. The ideal candidate will have extensive exp...Show more
    Last updated: 3 days ago • Promoted
    Remote AI Writing Specialist

    Remote AI Writing Specialist

    Outlier • Brockton, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 2 days ago • Promoted
    ProFound Therapeutics, Inc. | Boston, MA Senior / Principal Scientist, Biostatistics & Computatio[...]

    ProFound Therapeutics, Inc. | Boston, MA Senior / Principal Scientist, Biostatistics & Computatio[...]

    Flagship Pioneering • Boston, MA, United States
    Full-time
    Senior / Principal Scientist, Biostatistics & Computational Biology.ProFound Therapeutics is seeking a pioneering.Senior Scientist / Principal Scientist, Biostatistics & Computational Biology.ProFoundr...Show more
    Last updated: 30+ days ago • Promoted
    Senior Expert II (AI Methods), AI Computational Sciences

    Senior Expert II (AI Methods), AI Computational Sciences

    Novartis Group Companies • Cambridge, MA, United States
    Full-time
    Novartis has embraced a bold strategy to drive a company-wide digital transformation.Our objective is to position Novartis as an industry leader by proactively adopting digital technologies that fo...Show more
    Last updated: 3 days ago • Promoted
    Research Lead - AI Cyber Testing & Evaluation

    Research Lead - AI Cyber Testing & Evaluation

    RAND Corporation • Boston, MA, United States
    Temporary
    Global and Emerging Risks (GER) division.As Research Lead - AI Cyber Testing & Evaluation, you'll direct a comprehensive research portfolio focused on assessing the offensive cyber capabilities of ...Show more
    Last updated: 30+ days ago • Promoted
    Computer Aided Drug Design, AI / ML Expert

    Computer Aided Drug Design, AI / ML Expert

    Novartis Group Companies • Cambridge, MA, United States
    Full-time
    At Global Discovery Chemistry (GDC) of Biomedical Research in Cambridge, MA, we are at the heart of Novartis' purpose.We are seeking a highly motivated and passionate researcher with a strong scien...Show more
    Last updated: 30+ days ago • Promoted