Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Waterbury, Connecticut, US
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Waterbury, Connecticut, US
21 hours ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Waterbury, Connecticut, US

    Related jobs
    Home-Based Market Research Contributor (Hiring Immediately)

    Home-Based Market Research Contributor (Hiring Immediately)

    Earn Haus • East Haven, Connecticut, US
    Remote
    Full-time +1
    We are urgently looking for people interested in taking online surveys for Fortune 500 brands.If you are a self-starter, looking for flexible hours throughout the week, this may be for you! Earn up...Show more
    Last updated: 30+ days ago • Promoted
    CRO Senior Manager, Translational Research and Biomarker Research

    CRO Senior Manager, Translational Research and Biomarker Research

    Alexion • New Haven, Connecticut, USA
    Full-time
    Are you ready to make a significant impact in the world of rare diseases As a Biomarker CRO Senior Manager in Alexion R&Ds Translational Sciences group you will play a crucial role in advancing...Show more
    Last updated: 26 days ago • Promoted
    Remote Image Analyst I — Data QC & Imaging

    Remote Image Analyst I — Data QC & Imaging

    Perceptive Group • CT, United States
    Remote
    Full-time
    A leading clinical research organization is seeking an Image Analyst I to analyze imaging data and ensure quality control. This remote position targets candidates with a BS in relevant fields and of...Show more
    Last updated: 6 days ago • Promoted
    QA Testers with AI Experience

    QA Testers with AI Experience

    Akaasa Technologies • CT, United States
    Full-time
    Quick Apply
    Hartford, CT (Hybrid 3 days a week) final interview face to Face < / ...Show more
    Last updated: 3 days ago
    Yale New Haven Health is BC / BE OBGYN for Brand New Practice in Westchester County

    Yale New Haven Health is BC / BE OBGYN for Brand New Practice in Westchester County

    University of Iowa Hospitals & Clinics : Korn Ferry • CT, US
    Full-time +1
    Yale New Haven Health, one of the nation’s leading health systems, is excited to announce a new OB / GYN practice startup in Westchester County, with an initial focus in New Rochelle and Rye Br...Show more
    Last updated: 30+ days ago • Promoted
    Surgical Tech (Prospect)

    Surgical Tech (Prospect)

    Yale New Haven Health • Prospect, Connecticut, US
    Part-time
    To be part of our organization, every employee should understand and share in the YNHHS Vision, support our Mission, and live our Values. These values - integrity, patient-centered, respect, account...Show more
    Last updated: 12 hours ago • Promoted • New!
    R&D Engineer III New Haven, CT

    R&D Engineer III New Haven, CT

    CONMED Corporation • New Haven, Connecticut, USA
    Full-time
    The Engineer III R&D assists in the research design and development of novel medical devices through assessment of clinical needs concept design engineering analysis implant characterization sp...Show more
    Last updated: 8 days ago • Promoted
    Lead Data Engineer

    Lead Data Engineer

    C the Signs • CT, US
    Remote
    Full-time
    Quick Apply
    We are seeking a Lead Data Engineer to architect, build, and scale our next-generation healthcare data platform.In this role, you will lead the effort to design robust pipelines, modernize data arc...Show more
    Last updated: 14 days ago
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • New Haven, Connecticut
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Remote Sales & Trading Associate - AI Trainer ($50-$60 / hour)

    Remote Sales & Trading Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • West Haven, Connecticut
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Solution Leader GenAI

    Solution Leader GenAI

    Oxford Global Resources • Oxford, Connecticut, USA
    Full-time
    The AI Solutions Leader / SME has full lifecycle ownership of the architecture and design of the AI offering with a primary emphasis on scoping framework integrity credibility and delivery.As the tec...Show more
    Last updated: 11 days ago • Promoted
    Academic Medical Center in New England Hiring Research-Focused Neurosurgeon

    Academic Medical Center in New England Hiring Research-Focused Neurosurgeon

    Rosman Search • CT, United States
    Full-time
    A prestigious University is looking to expand its neurosurgery faculty by hiring a research-focused PhD or MD / PhD physician. This is a primarily research position, however, some clinical time may be...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Integration Engineer

    Senior AI Integration Engineer

    Openkyber • CT, United States
    Full-time
    Quick Apply
    Job Title : IBM Watson Developer Location : Hartford, CT (Onsite ) Duration : Long-term Contract Type-#W2&l...Show more
    Last updated: 1 day ago
    Associate Director, LC Development, Biologics Analytical Development

    Associate Director, LC Development, Biologics Analytical Development

    Alexion • New Haven, Connecticut, USA
    Full-time
    The Associate Director LC Development will lead the development optimization transfer and lifecycle management of liquid chromatography-based methods supporting biologics programs across clinical a...Show more
    Last updated: 10 hours ago • Promoted • New!
    Neurosurgery Research Faculty : Teaching, Genomics, Impact

    Neurosurgery Research Faculty : Teaching, Genomics, Impact

    Rosman Search • CT, United States
    Full-time
    An esteemed academic institution is seeking a dedicated neurosurgeon to join its faculty, focusing on research with opportunities for teaching and clinical practice. This role offers a unique blend ...Show more
    Last updated: 9 days ago • Promoted
    Financial Expert - AI Trainer ($150 per hour)

    Financial Expert - AI Trainer ($150 per hour)

    Mercor • New Haven, Connecticut, US
    Remote
    Full-time
    UK / Canada / Europe / Singapore / Dubai / Australia-based • •Investment Banking or Private Equity Experts • • for a research project with a leading foundational model AI lab. You are a good fit if you : - Have •...Show more
    Last updated: 21 hours ago • Promoted • New!
    Yale Medicine : Join Our Anesthesiology Clinical Faculty

    Yale Medicine : Join Our Anesthesiology Clinical Faculty

    Yale New Haven Health System - Korn Ferry • New Haven, CT, US
    Permanent
    Yale School of Medicine is actively seeking .New Haven and Bridgeport Connecticut area.We have multiple openings available, with a particular focus on the following sub-specialties : .Provide co...Show more
    Last updated: 21 days ago • Promoted
    CT Technologist - Up to $53 / hour - Connecticut

    CT Technologist - Up to $53 / hour - Connecticut

    The Medicus Firm • New Haven, CT, US
    Full-time +1
    Permanent, full-time hospital-based opportunity.Fixed, predictable hours; Monday to Friday schedule.Full benefits package with significant PTO. Opportunities for advanced training and career develop...Show more
    Last updated: 30+ days ago • Promoted