Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Waterbury, Connecticut, US
No longer accepting applications
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Waterbury, Connecticut, US
1 day ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Waterbury, Connecticut, US

    Related jobs
    Flexible Online Research Contributor (Hiring Immediately)

    Flexible Online Research Contributor (Hiring Immediately)

    Earn Haus • Oakville, Connecticut, US
    Full-time +1
    We are urgently looking for people interested in taking online surveys for Fortune 500 brands.If you are a self-starter, looking for flexible hours throughout the week, this may be for you! Earn up...Show more
    Last updated: 30+ days ago • Promoted
    Project Scientist

    Project Scientist

    Loureiro Engineering Associates, Inc. • Plainville, CT, US
    Full-time
    Quick Apply
    Loureiro Engineering Associates is seeking a Project Scientist to join our Environmental Division in Connecticut.This role is ideal for a motivated professional with strong technical and organizati...Show more
    Last updated: 30+ days ago
    Remote Image Analyst I — Data QC & Imaging

    Remote Image Analyst I — Data QC & Imaging

    Perceptive Group • CT, United States
    Remote
    Full-time
    A leading clinical research organization is seeking an Image Analyst I to analyze imaging data and ensure quality control. This remote position targets candidates with a BS in relevant fields and of...Show more
    Last updated: 6 days ago • Promoted
    QA Testers with AI Experience

    QA Testers with AI Experience

    Akaasa Technologies • CT, United States
    Full-time
    Quick Apply
    Hartford, CT (Hybrid 3 days a week) final interview face to Face < / ...Show more
    Last updated: 4 days ago
    Media Engineer : 25-05747

    Media Engineer : 25-05747

    Akraya Inc • Bristol, Connecticut, United States
    Full-time
    Quick Apply
    Primary Skills : Broadcast-Engineering (Expert), Cloud-Infrastructure (Advanced), Video-Production (Expert), Networking (Advanced), Troubleshooting (Expert). Location : Bristol, CT (Onsite)(#LI-Onsite...Show more
    Last updated: 30+ days ago
    Quality Project Engineer

    Quality Project Engineer

    Gpac • Torrington, Connecticut, United States
    Full-time
    Quick Apply
    New Product Launch & Equipment Integration Support.Lead quality activities for new product introductions (NPI) and equipment integration projects. Support the integration of manufacturing equipm...Show more
    Last updated: 23 hours ago
    Sr. Data Engineer with Ads industry exp : 25-06168 (No C2C)

    Sr. Data Engineer with Ads industry exp : 25-06168 (No C2C)

    Akraya Inc • Bristol, Connecticut, United States
    Full-time
    Quick Apply
    Primary Skills : SQL (Expert), Data Visualization (Expert), Python (Intermediate), Data Warehousing (Advanced), Data Modelling (Proficient). Duration : 12 months with possible extension.Location : Bris...Show more
    Last updated: 30+ days ago
    Precision CNC Machine Operator

    Precision CNC Machine Operator

    Howmet • Winsted, CT, United States
    Full-time
    Winsted Indust Pk, 145 Price Rd, Winsted, CT, 06098, US.Remote Work Schedule Availability?.This position entails access to export-controlled items and employment offers are conditioned upon an appl...Show more
    Last updated: 30+ days ago • Promoted
    Process Engineer

    Process Engineer

    Howmet • Winsted, CT, United States
    Full-time
    Winsted Indust Pk, 145 Price Rd, Winsted, CT, 06098, US.This position entails access to export-controlled items and employment offers are conditioned upon an applicant's ability to lawfully obtain ...Show more
    Last updated: 3 days ago • Promoted
    Lead Data Engineer

    Lead Data Engineer

    C the Signs • CT, US
    Remote
    Full-time
    Quick Apply
    We are seeking a Lead Data Engineer to architect, build, and scale our next-generation healthcare data platform.In this role, you will lead the effort to design robust pipelines, modernize data arc...Show more
    Last updated: 15 days ago
    Academic Medical Center in New England Hiring Research-Focused Neurosurgeon

    Academic Medical Center in New England Hiring Research-Focused Neurosurgeon

    Rosman Search • CT, United States
    Full-time
    A prestigious University is looking to expand its neurosurgery faculty by hiring a research-focused PhD or MD / PhD physician. This is a primarily research position, however, some clinical time may be...Show more
    Last updated: 30+ days ago • Promoted
    Oncology Data Specialist (Hiring Immediately)

    Oncology Data Specialist (Hiring Immediately)

    Middlesex Health • Bristol, Connecticut, us
    Full-time
    Position Highlights Department : Cancer Center Hours : 40.Shift : Shift 1 Position Summary The Tumor Registrar (Oncology Data Specialist) assures thorough, accurate and quality data collection as requ...Show more
    Last updated: 10 hours ago • New!
    Remote Side Hustle Evaluator - Flexible Online Gig Work

    Remote Side Hustle Evaluator - Flexible Online Gig Work

    Finance Buzz • Winsted, Connecticut, US
    Remote
    Temporary
    Are you looking to earn extra income from the comfort of your home? We're seeking motivated individuals to explore and test a variety of remote side hustle opportunities featured on FinanceBuzz.Thi...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Integration Engineer

    Senior AI Integration Engineer

    Openkyber • CT, United States
    Full-time
    Quick Apply
    Job Title : IBM Watson Developer Location : Hartford, CT (Onsite ) Duration : Long-term Contract Type-#W2&l...Show more
    Last updated: 2 days ago
    Surgical Tech (Prospect)

    Surgical Tech (Prospect)

    Yale New Haven Health • Prospect, Connecticut, US
    Part-time
    To be part of our organization, every employee should understand and share in the YNHHS Vision, support our Mission, and live our Values. These values - integrity, patient-centered, respect, account...Show more
    Last updated: 14 hours ago • Promoted • New!
    Neurosurgery Research Faculty : Teaching, Genomics, Impact

    Neurosurgery Research Faculty : Teaching, Genomics, Impact

    Rosman Search • CT, United States
    Full-time
    An esteemed academic institution is seeking a dedicated neurosurgeon to join its faculty, focusing on research with opportunities for teaching and clinical practice. This role offers a unique blend ...Show more
    Last updated: 9 days ago • Promoted
    Flexible Online Market Research Participant (Hiring Immediately)

    Flexible Online Market Research Participant (Hiring Immediately)

    Earn Haus • Wolcott, Connecticut, US
    Full-time +2
    We are urgently looking for people interested in taking online surveys for Fortune 500 brands.If you are a self-starter, looking for flexible hours throughout the week, this may be for you! Earn up...Show more
    Last updated: 30+ days ago • Promoted
    Image Analyst I

    Image Analyst I

    Perceptive Group • CT, United States
    Full-time
    Image Analyst I page is loaded## Image Analyst Ilocations : US CT Decentralized : Remote (US)time type : Full timeposted on : Posted Yesterdayjob requisition id : JR104288 • •About Perceptive • •O...Show more
    Last updated: 6 days ago • Promoted