Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Lowell, Massachusetts, US
No longer accepting applications
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Lowell, Massachusetts, US
1 day ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Lowell, Massachusetts, US

    Related jobs
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Manchester, NH, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 2 days ago • Promoted
    AI Trainer -Remote Content QA Reviewer

    AI Trainer -Remote Content QA Reviewer

    Outlier • Manchester, NH, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 2 days ago • Promoted
    Senior AI Integration Engineer

    Senior AI Integration Engineer

    Openkyber • MA, United States
    Full-time
    Quick Apply
    RESPONSIBILITIES : Kforce has a client in Acton, MA that is seeking a Director of Data and Biostatistics who will lead data operations for the Clinical Affairs team focused on med...Show more
    Last updated: 1 day ago
    Clinical Content- Clinician (AI Strategist - UpToDate®)

    Clinical Content- Clinician (AI Strategist - UpToDate®)

    Wolters Kluwer N.V. • Waltham, MA, United States
    Full-time
    Candidates within commuting distance of a Wolters Kluwer office will be considered for hybrid employment.Candidates not within commuting distances will be considered for remote employment.Wolters K...Show more
    Last updated: 4 days ago • Promoted
    Senior Director, Program Lead - Artificial Intelligence in Drug Discovery (AIDD)

    Senior Director, Program Lead - Artificial Intelligence in Drug Discovery (AIDD)

    Sigma • North Billerica, MA, US
    Full-time
    Dynamic Leader For Next-Generation Drug Discovery Lab.Work Your Magic with us! Ready to explore, break barriers, and discover more? We know you've got big plans so do we! Our colleagues across the...Show more
    Last updated: 26 days ago • Promoted
    Mid-Level AI Engineer

    Mid-Level AI Engineer

    Credence • Bedford, Massachusetts, US
    Full-time
    Overview Credence is one of the largest privately held technology services firms in the U.Top Workplace and featured on the Inc. Fastest-Growing Private Companies list for 12 consecutive years.We se...Show more
    Last updated: 14 days ago • Promoted
    Lead AI Engineer - Semiconductor AI Innovation

    Lead AI Engineer - Semiconductor AI Innovation

    Onto • Wilmington, MA, United States
    Permanent
    Onto Innovation is a leader in process control, combining global scale with an expanded portfolio of leading-edge technologies that include : 3D metrology spanning the chip from nanometer-scale tran...Show more
    Last updated: 4 days ago • Promoted
    Senior UX Researcher (Quantitative & Evaluative Focus)

    Senior UX Researcher (Quantitative & Evaluative Focus)

    Motorola Solutions • Framingham, MA, United States
    Full-time
    At Motorola Solutions, we believe that everything starts with our people.We're a global close-knit community, united by the relentless pursuit to help keep people safer everywhere.Our critical comm...Show more
    Last updated: 17 days ago • Promoted
    Remote Financial Advising Expert - AI Trainer ($50-$60 / hour)

    Remote Financial Advising Expert - AI Trainer ($50-$60 / hour)

    Data Annotation • Manchester, New Hampshire
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 23 days ago • Promoted
    Remote AI Content Reviewer

    Remote AI Content Reviewer

    Outlier • Manchester, NH, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 2 days ago • Promoted
    Senior Assay Development Scientist / Engineer (Burlington)

    Senior Assay Development Scientist / Engineer (Burlington)

    SiPhox Health • Burlington, MA, US
    Part-time
    SiPhox Health is redefining clinical immunoassay diagnostics by miniaturizing the analytical power of a central lab into an accessible, affordable, at-home platform. Our silicon-photonics architectu...Show more
    Last updated: 19 hours ago • Promoted • New!
    Temporary Micro-Credential Grader - Industry-Focused Prompt Engineering for ROI-Driven Results

    Temporary Micro-Credential Grader - Industry-Focused Prompt Engineering for ROI-Driven Results

    Brandeis University • Waltham, MA, United States
    Part-time
    Rabb School of Continuing Studies, Brandeis University.Part-Time, 4 months, varying hours, no more than 25 hours per week. Assistant Dean of Education and Learning Innovation.Brandeis University's R...Show more
    Last updated: 4 days ago • Promoted
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Manchester, New Hampshire
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 23 days ago • Promoted
    Remote Audio Generalist Evaluator Expert - AI Trainer ($35-$40 per hour)

    Remote Audio Generalist Evaluator Expert - AI Trainer ($35-$40 per hour)

    Mercor • Manchester, New Hampshire, US
    Remote
    Full-time
    Mercor is seeking detail-oriented writing experts to contribute to a high-impact audio AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evalu...Show more
    Last updated: 18 hours ago • Promoted • New!
    Remote Private Wealth Management Expert - AI Trainer ($100-$100 per hour)

    Remote Private Wealth Management Expert - AI Trainer ($100-$100 per hour)

    Mercor • Methuen, Massachusetts, US
    Remote
    Full-time
    UK / Canada / Europe / Australia-based • •Private Wealth Management Experts • • for a research project with a leading foundational model AI lab. You are a good fit if you : - Have • •at least 2 years of experi...Show more
    Last updated: 18 hours ago • Promoted • New!
    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Data Annotation • Manchester, New Hampshire
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 30+ days ago • Promoted
    Remote Biology Expert (PhD) (Lowell)

    Remote Biology Expert (PhD) (Lowell)

    Turing • Lowell, MA, US
    Remote
    Part-time
    Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more
    Last updated: 6 hours ago • Promoted • New!
    AI Scientist, Computational Radiology (Associate Director)

    AI Scientist, Computational Radiology (Associate Director)

    AstraZeneca • Waltham, Massachusetts, USA
    Full-time
    Job Title : AI Scientist Computational Biology (Associate Director).Location : Waltham / Boston MA United States.Are you ready to lead the charge in transforming radiology imaging into life-saving biom...Show more
    Last updated: 9 days ago • Promoted