Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Burbank, California, US

No longer accepting applications

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Burbank, California, US

2 days ago

Job type

Full-time

Remote

Job description

1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

2\. Key Responsibilities

Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##

3\. Ideal Qualifications

Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##

4\. More About the Opportunity

Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##

5\. Compensation & Contract Terms

Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##

6\. Application Process

Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##

7\. About Mercor

Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.

Create a job alert for this search

Curator • Burbank, California, US

Related jobs

Research Lead - AI Cyber Testing & Evaluation

RAND Corporation • Santa Monica, CA, United States

Temporary

Global and Emerging Risks (GER) division.As Research Lead - AI Cyber Testing & Evaluation, you'll direct a comprehensive research portfolio focused on assessing the offensive cyber capabilities of ...Show more

Last updated: 30+ days ago • Promoted

AI & Innovation Project Manager (Los Angeles)

Liebert Cassidy Whitmore • Los Angeles, CA, US

Part-time

Were looking for a legal industry professional who can guide and accelerate our firm's AI adoption journey.This hands-on leadership role will report to the Executive Director and work closely with ...Show more

Last updated: 1 day ago • Promoted

Remote AI Writing Evaluator

Outlier • Remote, LA, United States

Remote

Full-time

Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more

Last updated: 3 days ago • Promoted

Biosecurity Research Resident - AI-Enabled Biological Tools

RAND Corporation • Santa Monica, CA, United States

Full-time +1

This position will contribute to groundbreaking research on how advances in artificial intelligence intersect with biotechnology, national security, and innovation. RAND's reputation for excellence ...Show more

Last updated: 28 days ago • Promoted

Remote Biology Expert (PhD)

Turing • Los Angeles, CA, United States

Remote

Full-time

Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more

Last updated: 18 hours ago • Promoted • New!

AI Trainer -Remote Content QA Reviewer

Outlier • Remote, LA, United States

Remote

Full-time

Last updated: 3 days ago • Promoted

AI Cyber Testing & Evaluation Research Lead

RAND Corporation • Santa Monica, California, United States

Full-time

A leading research organization is seeking a Research Lead for AI Cyber Testing & Evaluation in Santa Monica.The ideal candidate will have over 6 years of technical and management experience, leadi...Show more

Last updated: 5 days ago • Promoted

Senior Associate, AI Engineer

KPMG • Los Angeles, CA, United States

Full-time

KPMG Advisory practice is currently our fastest growing practice.We are seeing tremendous client demand, and looking forward we do not anticipate that slowing down. In this ever-changing market envi...Show more

Last updated: 25 days ago • Promoted

Remote STEM PHDs - Biology (Los Angeles)

Turing • Los Angeles, CA, US

Remote

Part-time

Last updated: 18 hours ago • Promoted • New!

Remote STEM PHDs - Biology

Turing • Los Angeles, CA, United States

Remote

Full-time

Last updated: 12 hours ago • Promoted • New!

Biosecurity Research Resident - AI x Bio Evaluations

RAND Corporation • Santa Monica, CA, United States

Full-time +1

RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's.Center on AI, Security, and Technology (CAST).AI x Bio intersection through assessment, technical work, and policy analysis.RAND's...Show more

Last updated: 28 days ago • Promoted

Biosecurity Research Resident - AI x Bio Evaluations

RAND • Santa Monica, CA, United States

Full-time +1

RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...Show more

Last updated: 5 days ago • Promoted

AI Implementation Specialist

Partner Engineering and Science • Torrance, CA, United States

Full-time

PARTNER offers full-service engineering, environmental and energy consulting, and design services throughout the Americas, Europe, and around the globe. As a leading firm in the Commercial Real Esta...Show more

Last updated: 25 days ago • Promoted

AI & Innovation Project Manager

Liebert Cassidy Whitmore • Los Angeles, CA, United States

Full-time

We’re looking for a legal industry professional who can guide and accelerate our firm's AI adoption journey.This hands-on leadership role will report to the Executive Director and work closely with...Show more

Last updated: 1 day ago • Promoted

Artificial Intelligence (AI) Technology Instructor - Pool - UCLA Extension

University of California - Los Angeles (UCLA) • Los Angeles, CA, United States

Full-time

Wednesday, Dec 31, 2025 at 11 : 59pm (Pacific Time).Apply by this date to ensure full consideration by the committee.Tuesday, Jun 30, 2026 at 11 : 59pm (Pacific Time). Applications will continue to be a...Show more

Last updated: 30+ days ago • Promoted

Remote Biology Expert (PhD) (Los Angeles)

Turing • Los Angeles, CA, US

Remote

Part-time

Last updated: 18 hours ago • Promoted • New!

Research Lead - Securing Frontier AI

RAND Corporation • Santa Monica, CA, United States

Temporary

Global and Emerging Risks (GER) division.As Research Lead - Securing Frontier AI, you'll direct a comprehensive research portfolio focused on ensuring that the world's most important AI systems are...Show more

Last updated: 30+ days ago • Promoted

Senior Backend Engineer (Analytics / AI Personalization)

Tapcart • Santa Monica, California, United States

Remote

Full-time

Tapcart is the leading mobile app platform for the world’s fastest-growing Shopify brands.We help marketers and eCommerce teams strengthen their brands and create differentiated customer experience...Show more

Last updated: 30+ days ago • Promoted