Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Philadelphia, Pennsylvania, US
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Philadelphia, Pennsylvania, US
19 hours ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Philadelphia, Pennsylvania, US

    Related jobs
    Travel Mammography Technologist - $2,840 per week

    Travel Mammography Technologist - $2,840 per week

    Vibra Travels • Sewell, NJ, United States
    Full-time
    Vibra Travels is seeking a travel Mammography Technologist for a travel job in Sewell, New Jersey.Job Description & Requirements. VIBRA TRAVELS is seeking a CORPORATE TRAVEL MAMMOGRAPHY TECHNOLOGIST...Show more
    Last updated: 30+ days ago • Promoted
    Test Products from Home – $25-$45 / hr + Freebies

    Test Products from Home – $25-$45 / hr + Freebies

    OCPA • Gloucester, New Jersey, us
    Part-time +1
    Product Testers are wanted to work from home nationwide in the US to fulfill upcoming contracts with national and international companies. We guarantee 15-25 hours per week with an hourly pay of bet...Show more
    Last updated: 30+ days ago • Promoted
    Travel CT Tech - $2006.34 / Week

    Travel CT Tech - $2006.34 / Week

    AMN Healthcare Allied • Langhorne, PA, US
    Full-time
    AMN Healthcare Allied is seeking an experienced CT Tech for an exciting Travel Allied job in Langhorne, PA.Shift : 12 hr nights Start Date : ASAP Duration : 13 weeks Pay : $2006.Job Description & R...Show more
    Last updated: 14 days ago • Promoted
    AI Trainer -Remote Content QA Reviewer

    AI Trainer -Remote Content QA Reviewer

    Outlier • Philadelphia, PA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Executive Partner, Artificial Intelligence Advisory / AI Strategist

    Executive Partner, Artificial Intelligence Advisory / AI Strategist

    Gartner • Philadelphia, PA, United States
    Full-time
    Gartner Executive Programs (ExP) and Gartner Enterprise IT Leaders (EITL) are services within Gartner Executive Technology Services (ETS) and is the indispensable tool for digital leaders.It is an ...Show more
    Last updated: 30+ days ago • Promoted
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Philadelphia, PA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Director of Data Science & AI Innovation

    Director of Data Science & AI Innovation

    LexisNexis • Horsham, Pennsylvania, US
    Part-time
    At LexisNexis Intellectual Property Solutions, our mission is to bring clarity to innovation by delivering better outcomes to the innovation community. Every day, our teams support the development o...Show more
    Last updated: 17 hours ago • Promoted • New!
    Principal Product Manager

    Principal Product Manager

    Syneos Health / inVentiv Health Commercial LLC • Newtown, PA, United States
    Full-time
    Syneos Health is a leading fully integrated biopharmaceutical solutions organization built to accelerate customer success. We translate unique clinical, medical affairs and commercial insights into ...Show more
    Last updated: 30+ days ago • Promoted
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Philadelphia, Pennsylvania
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Senior AI Engineer

    Senior AI Engineer

    Reply • Philadelphia, Pennsylvania, United States
    Full-time
    Valorem Reply is an award-winning digital transformation firm focused on delivering data-driven enterprise, IT modernization, customer experience, product transformation and digital workplace.Throu...Show more
    Last updated: 30+ days ago • Promoted
    CRNA - Anesthesiology job available in Langhorne, Pennsylvania

    CRNA - Anesthesiology job available in Langhorne, Pennsylvania

    Vituity • Langhorne, PA, US
    Full-time +1
    Up to 50K Sign On Bonus! – Langhorne, PA – Seeking CRNAs.Become a Valued Member of Your Anesthesia Team.As a CRNA, you play a critical role in our mission to improve lives in Anes...Show more
    Last updated: 30+ days ago • Promoted
    Principal, Data Engineering (Remote)

    Principal, Data Engineering (Remote)

    Jazz Pharmaceuticals • Philadelphia, PA, United States
    Remote
    Full-time
    If you are a current Jazz employee please apply via the Internal Career site.Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and ...Show more
    Last updated: 30+ days ago • Promoted
    Medical Screener

    Medical Screener

    Biolife Plasma Services Careers • DEPTFORD, New Jersey, US
    Part-time +1
    By clicking the “Apply” button, I understand that my employment application process with Takeda will commence and that the information I provide in my application will be processed in line with Tak...Show more
    Last updated: 30+ days ago • Promoted
    Sub Investigator (NP / PA-C)

    Sub Investigator (NP / PA-C)

    Hawthorne Health, Inc. • Clementon, NJ, US
    Full-time
    Assist the Principal Investigator (PI) in overseeing and managing clinical trials conducted at the research site, ensuring adherence to the study protocol, Good Clinical Practice (GCP), ICH guideli...Show more
    Last updated: 20 days ago • Promoted
    Research Associate I

    Research Associate I

    bioMerieux Inc. • Philadelphia, PA, United States
    Full-time
    We are looking for a R&D technician to join and grow within the 'Assay Development' team of the 'Food Molecular Diagnostic Franchise' from the 'Industrial Applications' department.The technician wi...Show more
    Last updated: 1 day ago • Promoted
    AI / ML Data Engineer

    AI / ML Data Engineer

    Chabez Tech • Berlin, New Jersey, United States
    Temporary
    We are seeking a highly skilled and motivated AI Engineer to join our data and analytics team.This role is ideal for someone passionate about building intelligent data-driven solutions using cuttin...Show more
    Last updated: 22 days ago • Promoted
    Machine Learning / Data Science Engineer

    Machine Learning / Data Science Engineer

    Captech Consulting • Philadelphia, Pennsylvania, United States
    Full-time
    CapTech is an award-winning consulting firm that collaborates with clients to achieve what’s possible through the power of technology. At CapTech, we’re passionate about the work we do and the resul...Show more
    Last updated: 30+ days ago • Promoted
    Innovation Cup 2026 - Team Synthetic Biology (all genders)

    Innovation Cup 2026 - Team Synthetic Biology (all genders)

    Merck KGaA • Philadelphia, PA, United States
    Full-time
    Ready to explore, break barriers, and discover more? We know you've got big plans - so do we! Our colleagues across the globe love innovating with science and technology to enrich people's lives wi...Show more
    Last updated: 16 days ago • Promoted