Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Brockton, Massachusetts, US
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Brockton, Massachusetts, US
23 hours ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Brockton, Massachusetts, US

    Related jobs
    Senior PM — AI Safety Platform & Trust

    Senior PM — AI Safety Platform & Trust

    Spotify • Boston, MA, United States
    Full-time
    A leading audio streaming service is seeking a Senior Product Manager to join their AI Safety Platform team.In this role, you'll drive product vision and build systems that ensure AI safety.Ideal c...Show more
    Last updated: 14 days ago • Promoted
    Biology Specialist (PhD) (Boston)

    Biology Specialist (PhD) (Boston)

    Turing • Boston, MA, US
    Part-time
    Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more
    Last updated: 1 hour ago • Promoted • New!
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND Corporation • Boston, MA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's.Center on AI, Security, and Technology (CAST).AI x Bio intersection through assessment, technical work, and policy analysis.RAND's...Show more
    Last updated: 27 days ago • Promoted
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND • Boston, MA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND’s Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...Show more
    Last updated: 17 days ago • Promoted
    AI Product Engineer

    AI Product Engineer

    Datalign Advisory • Cambridge, Massachusetts, United States
    Full-time
    We're looking for an innovative AI Product Engineer to join our team and help build cutting-edge AI-powered solutions.In this role, you'll bridge the gap between machine learning capabilities and p...Show more
    Last updated: 30+ days ago • Promoted
    AI Engineer (remote greater Boston area)

    AI Engineer (remote greater Boston area)

    Emburse • Boston, Massachusetts, United States
    Remote
    Full-time
    At Emburse, you’ll not just imagine the future – you’ll build it.As a leader in travel and expense solutions, we are creating a future where technology drives business value and inspires extraordin...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Engineer (Greater Boston area)

    Senior AI Engineer (Greater Boston area)

    Emburse • Boston, Massachusetts, United States
    Full-time
    At Emburse, you’ll not just imagine the future – you’ll build it.As a leader in travel and expense solutions, we are creating a future where technology drives business value and inspires extraordin...Show more
    Last updated: 30+ days ago • Promoted
    Biology Expert (PhD) (Boston)

    Biology Expert (PhD) (Boston)

    Turing • Boston, MA, US
    Part-time
    Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more
    Last updated: 1 hour ago • Promoted • New!
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Brockton, Massachusetts
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Senior Expert II (Biomedical AI Methods Expert

    Senior Expert II (Biomedical AI Methods Expert

    Novartis • Cambridge, Massachusetts, United States
    Full-time
    This job is with Novartis, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.Novartis has em...Show more
    Last updated: 23 hours ago • Promoted
    AI Product Leader

    AI Product Leader

    DeepRec.ai • Boston, MA, US
    Full-time
    Senior AI Product Lead <> $200M Backed Materials Discovery Platform <> USA.I'm representing a DeepTech organisation working at the intersection of advanced AI, computational chemistry, ...Show more
    Last updated: 13 days ago • Promoted
    Director, AI-Powered Discovery & Personalization

    Director, AI-Powered Discovery & Personalization

    Scribd, Inc. • Boston, MA, US
    Full-time
    A leading digital content platform is seeking a Director of Product Discovery to lead the strategy and execution of the discovery ecosystem. This role involves overseeing personalized and AI-driven ...Show more
    Last updated: 3 days ago • Promoted
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Brockton, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Remote AI Writing Specialist

    Remote AI Writing Specialist

    Outlier • Brockton, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Lead AI Engineer

    Lead AI Engineer

    Cd Projekt Red • Boston, MA, United States
    Full-time
    To create revolutionary, story-driven RPGs which go straight to the hearts of gamers — this is our mission.Want to dive deeper into our company’s culture? Explore our social media and check out our...Show more
    Last updated: 30+ days ago • Promoted
    Principal AI Engineer

    Principal AI Engineer

    Validity • Boston, Massachusetts, United States
    Full-time
    We are looking for a Principal AI Engineer who moves fast, experiments relentlessly, and ships agentic AI systems that solve real problems. A doer : you're hands-on with code and you experiment daily...Show more
    Last updated: 14 hours ago • Promoted • New!
    Senior AI Engineer

    Senior AI Engineer

    Klaviyo • Boston, Massachusetts, United States
    Full-time
    At Klaviyo, we believe the future of software lies not in productivity tools for human users but in software that can run and optimize itself based on outcome or reward metrics.We’ve built the infr...Show more
    Last updated: 30+ days ago • Promoted
    Senior Expert II (AI Methods), AI Computational Sciences

    Senior Expert II (AI Methods), AI Computational Sciences

    Novartis Group Companies • Cambridge, MA, United States
    Full-time
    Novartis has embraced a bold strategy to drive a company-wide digital transformation.Our objective is to position Novartis as an industry leader by proactively adopting digital technologies that fo...Show more
    Last updated: 2 days ago • Promoted