Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Burbank, California, US
No longer accepting applications
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Burbank, California, US
2 days ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Burbank, California, US

    Related jobs
    Research Lead - AI Cyber Testing & Evaluation

    Research Lead - AI Cyber Testing & Evaluation

    RAND Corporation • Santa Monica, CA, United States
    Temporary
    Global and Emerging Risks (GER) division.As Research Lead - AI Cyber Testing & Evaluation, you'll direct a comprehensive research portfolio focused on assessing the offensive cyber capabilities of ...Show more
    Last updated: 30+ days ago • Promoted
    AI & Innovation Project Manager (Los Angeles)

    AI & Innovation Project Manager (Los Angeles)

    Liebert Cassidy Whitmore • Los Angeles, CA, US
    Part-time
    Were looking for a legal industry professional who can guide and accelerate our firm's AI adoption journey.This hands-on leadership role will report to the Executive Director and work closely with ...Show more
    Last updated: 1 day ago • Promoted
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Remote, LA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 3 days ago • Promoted
    Biosecurity Research Resident - AI-Enabled Biological Tools

    Biosecurity Research Resident - AI-Enabled Biological Tools

    RAND Corporation • Santa Monica, CA, United States
    Full-time +1
    This position will contribute to groundbreaking research on how advances in artificial intelligence intersect with biotechnology, national security, and innovation. RAND's reputation for excellence ...Show more
    Last updated: 28 days ago • Promoted
    Remote Biology Expert (PhD)

    Remote Biology Expert (PhD)

    Turing • Los Angeles, CA, United States
    Remote
    Full-time
    Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more
    Last updated: 18 hours ago • Promoted • New!
    AI Trainer -Remote Content QA Reviewer

    AI Trainer -Remote Content QA Reviewer

    Outlier • Remote, LA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 3 days ago • Promoted
    AI Cyber Testing & Evaluation Research Lead

    AI Cyber Testing & Evaluation Research Lead

    RAND Corporation • Santa Monica, California, United States
    Full-time
    A leading research organization is seeking a Research Lead for AI Cyber Testing & Evaluation in Santa Monica.The ideal candidate will have over 6 years of technical and management experience, leadi...Show more
    Last updated: 5 days ago • Promoted
    Senior Associate, AI Engineer

    Senior Associate, AI Engineer

    KPMG • Los Angeles, CA, United States
    Full-time
    KPMG Advisory practice is currently our fastest growing practice.We are seeing tremendous client demand, and looking forward we do not anticipate that slowing down. In this ever-changing market envi...Show more
    Last updated: 25 days ago • Promoted
    Remote STEM PHDs - Biology (Los Angeles)

    Remote STEM PHDs - Biology (Los Angeles)

    Turing • Los Angeles, CA, US
    Remote
    Part-time
    Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more
    Last updated: 18 hours ago • Promoted • New!
    Remote STEM PHDs - Biology

    Remote STEM PHDs - Biology

    Turing • Los Angeles, CA, United States
    Remote
    Full-time
    Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more
    Last updated: 12 hours ago • Promoted • New!
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND Corporation • Santa Monica, CA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's.Center on AI, Security, and Technology (CAST).AI x Bio intersection through assessment, technical work, and policy analysis.RAND's...Show more
    Last updated: 28 days ago • Promoted
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND • Santa Monica, CA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...Show more
    Last updated: 5 days ago • Promoted
    AI Implementation Specialist

    AI Implementation Specialist

    Partner Engineering and Science • Torrance, CA, United States
    Full-time
    PARTNER offers full-service engineering, environmental and energy consulting, and design services throughout the Americas, Europe, and around the globe. As a leading firm in the Commercial Real Esta...Show more
    Last updated: 25 days ago • Promoted
    AI & Innovation Project Manager

    AI & Innovation Project Manager

    Liebert Cassidy Whitmore • Los Angeles, CA, United States
    Full-time
    We’re looking for a legal industry professional who can guide and accelerate our firm's AI adoption journey.This hands-on leadership role will report to the Executive Director and work closely with...Show more
    Last updated: 1 day ago • Promoted
    Artificial Intelligence (AI) Technology Instructor - Pool - UCLA Extension

    Artificial Intelligence (AI) Technology Instructor - Pool - UCLA Extension

    University of California - Los Angeles (UCLA) • Los Angeles, CA, United States
    Full-time
    Wednesday, Dec 31, 2025 at 11 : 59pm (Pacific Time).Apply by this date to ensure full consideration by the committee.Tuesday, Jun 30, 2026 at 11 : 59pm (Pacific Time). Applications will continue to be a...Show more
    Last updated: 30+ days ago • Promoted
    Remote Biology Expert (PhD) (Los Angeles)

    Remote Biology Expert (PhD) (Los Angeles)

    Turing • Los Angeles, CA, US
    Remote
    Part-time
    Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more
    Last updated: 18 hours ago • Promoted • New!
    Research Lead - Securing Frontier AI

    Research Lead - Securing Frontier AI

    RAND Corporation • Santa Monica, CA, United States
    Temporary
    Global and Emerging Risks (GER) division.As Research Lead - Securing Frontier AI, you'll direct a comprehensive research portfolio focused on ensuring that the world's most important AI systems are...Show more
    Last updated: 30+ days ago • Promoted
    Senior Backend Engineer (Analytics / AI Personalization)

    Senior Backend Engineer (Analytics / AI Personalization)

    Tapcart • Santa Monica, California, United States
    Remote
    Full-time
    Tapcart is the leading mobile app platform for the world’s fastest-growing Shopify brands.We help marketers and eCommerce teams strengthen their brands and create differentiated customer experience...Show more
    Last updated: 30+ days ago • Promoted