Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Burbank, California, US
No longer accepting applications
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Burbank, California, US
1 day ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Burbank, California, US

    Related jobs
    Remote Biology Expert (PhD, Master's, or Olympiad Participants) - AI Trainer ($60-$80 per hour)

    Remote Biology Expert (PhD, Master's, or Olympiad Participants) - AI Trainer ($60-$80 per hour)

    Mercor • Burbank, California, US
    Remote
    Full-time
    Role Overview • • Mercor is collaborating with a leading AI research lab on a project to advance frontier biology problem-solving. We are looking for biology experts who hold a • •PhD or Master’s degre...Show more
    Last updated: 13 hours ago • Promoted • New!
    Principal AI Engineer

    Principal AI Engineer

    Kizen • Los Angeles, California, United States
    Full-time
    Hybrid (Minimum 4 days in office) | Full Time | https : / / kizen.At Kizen, we’re engineering a more humane future where AI drives universal healthcare, more impactful work, world-class education, ...Show more
    Last updated: 30+ days ago • Promoted
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Remote, LA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 2 days ago • Promoted
    Biosecurity Research Resident - AI-Enabled Biological Tools

    Biosecurity Research Resident - AI-Enabled Biological Tools

    RAND Corporation • Santa Monica, CA, United States
    Full-time +1
    This position will contribute to groundbreaking research on how advances in artificial intelligence intersect with biotechnology, national security, and innovation. RAND's reputation for excellence ...Show more
    Last updated: 28 days ago • Promoted
    Remote Astronomy Expert (PhD, Master's, or Olympiad Participants) - AI Trainer ($60-$80 per hour)

    Remote Astronomy Expert (PhD, Master's, or Olympiad Participants) - AI Trainer ($60-$80 per hour)

    Mercor • Burbank, California, US
    Remote
    Full-time
    Role Overview Mercor is collaborating with a leading AI research lab on a project to advance • •frontier astronomy problem-solving • •. We are looking for astronomy experts who hold a • •PhD or Master’s...Show more
    Last updated: 13 hours ago • Promoted • New!
    Remote PhD-Level Computational Biologist — Design Data-Centric Benchmarks for AI Models - AI Trainer ($65-$85 per hour)

    Remote PhD-Level Computational Biologist — Design Data-Centric Benchmarks for AI Models - AI Trainer ($65-$85 per hour)

    Mercor • Burbank, California, US
    Remote
    Full-time
    Role Overview • • Mercor is seeking computational biology experts to contribute to a unique project with a top-tier AI research organization. This short-term initiative challenges AI models with hidde...Show more
    Last updated: 14 hours ago • Promoted • New!
    AI Engineer

    AI Engineer

    Open Roles At Outsmart • Culver City, California, United States
    Full-time
    In this role, you’ll be at the forefront of the machine learning that powers Outsmart’s applications.As an early member of the engineering team you’ll play a critical role in setting the tone for O...Show more
    Last updated: 18 days ago • Promoted
    AI Trainer -Remote Content QA Reviewer

    AI Trainer -Remote Content QA Reviewer

    Outlier • Remote, LA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 2 days ago • Promoted
    Applied AI Engineer

    Applied AI Engineer

    Axle Mobility • Los Angeles, California, United States
    Remote
    Full-time
    Axle is rebuilding the back office of commercial vehicle operations with AI-native tools for the people who keep the world moving : drivers, technicians, service leaders, operators, and fleet manage...Show more
    Last updated: 30+ days ago • Promoted
    AI Cyber Testing & Evaluation Research Lead

    AI Cyber Testing & Evaluation Research Lead

    RAND Corporation • Santa Monica, California, United States
    Full-time
    A leading research organization is seeking a Research Lead for AI Cyber Testing & Evaluation in Santa Monica.The ideal candidate will have over 6 years of technical and management experience, leadi...Show more
    Last updated: 4 days ago • Promoted
    Senior Associate, AI Engineer

    Senior Associate, AI Engineer

    KPMG • Los Angeles, CA, United States
    Full-time
    KPMG Advisory practice is currently our fastest growing practice.We are seeing tremendous client demand, and looking forward we do not anticipate that slowing down. In this ever-changing market envi...Show more
    Last updated: 24 days ago • Promoted
    Experienced AI Prompt Specialist Needed USCanadian natives

    Experienced AI Prompt Specialist Needed USCanadian natives

    Summa Linguae Technologies • Los Angeles, California, USA
    Full-time
    Hello dear partner! I hope you are doing well : ).We have a new project for one of out top clients.Only native US / Canada candidates are accepted (US / Canadian citizens abroad also welcome).We have a ...Show more
    Last updated: 13 days ago • Promoted
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND • Santa Monica, CA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND’s Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...Show more
    Last updated: 15 days ago • Promoted
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND Corporation • Santa Monica, CA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's.Center on AI, Security, and Technology (CAST).AI x Bio intersection through assessment, technical work, and policy analysis.RAND's...Show more
    Last updated: 28 days ago • Promoted
    Remote Biology Expert (PhD) (Los Angeles)

    Remote Biology Expert (PhD) (Los Angeles)

    Turing • Los Angeles, CA, US
    Remote
    Part-time
    Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more
    Last updated: 2 hours ago • Promoted • New!
    Remote Expert Prompt Curators for Advanced AI Evaluation Dataset - AI Trainer ($50-$70 per hour)

    Remote Expert Prompt Curators for Advanced AI Evaluation Dataset - AI Trainer ($50-$70 per hour)

    Mercor • Burbank, California, US
    Remote
    Full-time
    Role Overview • • Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge a...Show more
    Last updated: 13 hours ago • Promoted • New!
    Lead Engineer, AI

    Lead Engineer, AI

    Absurd Ventures • Santa Monica, California, United States
    Full-time
    We are seeking a highly skilled and experienced.As the Lead AI Engineer, you will be responsible for overseeing the design, development, and optimization of game artificial intelligence systems- fr...Show more
    Last updated: 30+ days ago • Promoted
    Remote Audio Generalist Evaluator Expert - AI Trainer ($35-$40 per hour)

    Remote Audio Generalist Evaluator Expert - AI Trainer ($35-$40 per hour)

    Mercor • Burbank, California, US
    Remote
    Full-time
    Mercor is seeking detail-oriented writing experts to contribute to a high-impact audio AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evalu...Show more
    Last updated: 13 hours ago • Promoted • New!