Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Troy, New York, US
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Troy, New York, US
23 hours ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Troy, New York, US

    Related jobs
    Research Assistant

    Research Assistant

    TradeJobsWorkForce • 12866 Saratoga Springs, NY, US
    Full-time
    Research Assistant Job Duties : Generates hypotheses and designs and performs experiments to test ...Show more
    Last updated: 30+ days ago • Promoted
    Surgical Technologist - General Specialty PM Shift

    Surgical Technologist - General Specialty PM Shift

    Advocate Aurora • Kinderhook, NY, United States
    Full-time +1
    Aurora St Lukes Medical Center - 2900 W Oklahoma Ave, Milwaukee, WI 53215.At Advocate Health, we are dedicated to offering a comprehensive suite of Total Rewards including competitive compensation,...Show more
    Last updated: 22 hours ago • Promoted • New!
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Albany, New York
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Business Intelligence Engineer 2

    Business Intelligence Engineer 2

    CenterWell • Albany, NY, United States
    Full-time
    Become a part of our caring community and help us put health first.The Business Intelligence Engineer 2 solves complex business problems and issues using data from internal and external sources to ...Show more
    Last updated: 1 day ago • Promoted
    Senior Engineer, Workday AI Developer

    Senior Engineer, Workday AI Developer

    Cardinal Health • Albany, NY, United States
    Full-time
    What Application Development & Maintenance contributes to Cardinal Health.Information Technology oversees the effective development, delivery, and operation of computing and information services.Th...Show more
    Last updated: 4 days ago • Promoted
    AI First Software Engineer - Senior Associate

    AI First Software Engineer - Senior Associate

    PwC • Albany, NY, United States
    Full-time
    At PwC, our people in software and product innovation focus on developing cutting-edge software solutions and driving product innovation to meet the evolving needs of clients.These individuals comb...Show more
    Last updated: 3 days ago • Promoted
    Travel Surgical Tech - $2005 / Week

    Travel Surgical Tech - $2005 / Week

    Cynet Health • Albany, NY, US
    Full-time
    Cynet Health is seeking an experienced Surgical Tech for an exciting Travel Allied job in Albany, NY.Shift : 4x10 hr days Start Date : 01 / 05 / 2026 Duration : 8 weeks Pay : $2005 / Week.Ranked #5 Best Tr...Show more
    Last updated: 1 day ago • Promoted
    100 % Remote - AI Developer / LLM Engineering Specialist (Albany)

    100 % Remote - AI Developer / LLM Engineering Specialist (Albany)

    Beacon Hill • Albany, NY, US
    Remote
    Part-time
    NOTE : THIS IS A 100 % REMOTE POSITION.AI Developer / LLM Engineering Specialist.Seeking a highly skilled AI / LLM Developer with strong enterprise-level software engineering experience and hands-on e...Show more
    Last updated: less than 1 hour ago • Promoted • New!
    Travel Surgical Tech - CVOR - $1717 / Week

    Travel Surgical Tech - CVOR - $1717 / Week

    LRS Healthcare - Allied • Albany, NY, US
    Full-time
    LRS Healthcare - Allied is seeking an experienced Surgical Tech - CVOR for an exciting Travel Allied job in Albany, NY.Shift : Inquire Start Date : ASAP Duration : 13 weeks Pay : $1717 / Week.Ready to ...Show more
    Last updated: 2 days ago • Promoted
    Remote Market Research Participant (Hiring Immediately)

    Remote Market Research Participant (Hiring Immediately)

    Earn Haus • New Baltimore, New York, US
    Remote
    Full-time +2
    We are urgently looking for people interested in taking online surveys for Fortune 500 brands.If you are a self-starter, looking for flexible hours throughout the week, this may be for you! Earn up...Show more
    Last updated: 30+ days ago • Promoted
    Travel Certified Surgical Technologist

    Travel Certified Surgical Technologist

    Trinity Health FirstChoice • Albany, NY, US
    Part-time
    Trinity Health FirstChoice is seeking a travel Certified Surgical Technologist for a travel job in Albany, New York.Job Description & Requirements. Certified Surgical Technologist.Are you an exp...Show more
    Last updated: 30+ days ago • Promoted
    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Data Annotation • Albany, New York
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 30+ days ago • Promoted
    Title Searcher

    Title Searcher

    Canacre • Alb, NY, US
    Full-time
    Quick Apply
    Canacre’s core services focus on leadership in Environment and Land services throughout the project lifecycle.At Canacre, we emphasize continuous development and growth.Our commitment to inve...Show more
    Last updated: 30+ days ago
    Freelance Luxury Brand Evaluator - New York City Metro Area

    Freelance Luxury Brand Evaluator - New York City Metro Area

    CXG • Mechanicville, NY, US
    Full-time
    Quick Apply
    Turn your passion for luxury into a career opportunity.Explore the world of premium brands and make a lasting impact in fashion, beauty, jewelry, or automobiles. Join CXG, the global leader in custo...Show more
    Last updated: 30+ days ago
    Data Integration Engineer

    Data Integration Engineer

    Eliassen Group • Albany, NY, United States
    Full-time +1
    We are currently seeking a talented Data Integration Principal to join us as we develop our data driven, analytic focused and predictive products to help revolutionize the law enforcement sector.Th...Show more
    Last updated: 30+ days ago • Promoted
    Travel Surgical Tech - CVOR - $2901.6 / Week

    Travel Surgical Tech - CVOR - $2901.6 / Week

    CrossMed Healthcare • Albany, NY, US
    Full-time
    CrossMed Healthcare is seeking an experienced Surgical Tech - CVOR for an exciting Travel Allied job in Albany, NY.Shift : 8 hr days Start Date : ASAP Duration : 13 weeks Pay : $2901.At CrossMed Health...Show more
    Last updated: 30+ days ago • Promoted
    Travel Surgical Tech - Certified - $1689.57 / Week

    Travel Surgical Tech - Certified - $1689.57 / Week

    Atlas MedStaff • Albany, NY, US
    Full-time
    Atlas MedStaff is seeking an experienced Surgical Tech - Certified for an exciting Travel Allied job in Albany, NY.Shift : 3x12 hr nights Start Date : 12 / 15 / 2025 Duration : 13 weeks Pay : $1689.Atlas M...Show more
    Last updated: 17 days ago • Promoted
    Endpoint Vulnerability Management Subject-Matter Expert / Technical Lead

    Endpoint Vulnerability Management Subject-Matter Expert / Technical Lead

    GovCIO • Albany, NY, United States
    Full-time
    GovCIO is currently hiring for Endpoint Vulnerability Management Subject-Matter Expert / Technical Lead for our NIH Proposal. The Technical Lead will support our client's contract needs.This position ...Show more
    Last updated: 10 days ago • Promoted