Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Burbank, California, US

No longer accepting applications

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Burbank, California, US

1 day ago

Job type

Full-time

Remote

Job description

1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

2\. Key Responsibilities

Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##

3\. Ideal Qualifications

Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##

4\. More About the Opportunity

Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##

5\. Compensation & Contract Terms

Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##

6\. Application Process

Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##

7\. About Mercor

Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.

Create a job alert for this search

Curator • Burbank, California, US

Related jobs

Remote Biology Expert (PhD, Master's, or Olympiad Participants) - AI Trainer ($60-$80 per hour)

Mercor • Burbank, California, US

Remote

Full-time

Role Overview • • Mercor is collaborating with a leading AI research lab on a project to advance frontier biology problem-solving. We are looking for biology experts who hold a • •PhD or Master’s degre...Show more

Last updated: 13 hours ago • Promoted • New!

Principal AI Engineer

Kizen • Los Angeles, California, United States

Full-time

Hybrid (Minimum 4 days in office) | Full Time | https : / / kizen.At Kizen, we’re engineering a more humane future where AI drives universal healthcare, more impactful work, world-class education, ...Show more

Last updated: 30+ days ago • Promoted

Remote AI Writing Evaluator

Outlier • Remote, LA, United States

Remote

Full-time

Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more

Last updated: 2 days ago • Promoted

Biosecurity Research Resident - AI-Enabled Biological Tools

RAND Corporation • Santa Monica, CA, United States

Full-time +1

This position will contribute to groundbreaking research on how advances in artificial intelligence intersect with biotechnology, national security, and innovation. RAND's reputation for excellence ...Show more

Last updated: 28 days ago • Promoted

Remote Astronomy Expert (PhD, Master's, or Olympiad Participants) - AI Trainer ($60-$80 per hour)

Mercor • Burbank, California, US

Remote

Full-time

Role Overview Mercor is collaborating with a leading AI research lab on a project to advance • •frontier astronomy problem-solving • •. We are looking for astronomy experts who hold a • •PhD or Master’s...Show more

Last updated: 13 hours ago • Promoted • New!

Remote PhD-Level Computational Biologist — Design Data-Centric Benchmarks for AI Models - AI Trainer ($65-$85 per hour)

Mercor • Burbank, California, US

Remote

Full-time

Role Overview • • Mercor is seeking computational biology experts to contribute to a unique project with a top-tier AI research organization. This short-term initiative challenges AI models with hidde...Show more

Last updated: 14 hours ago • Promoted • New!

AI Engineer

Open Roles At Outsmart • Culver City, California, United States

Full-time

In this role, you’ll be at the forefront of the machine learning that powers Outsmart’s applications.As an early member of the engineering team you’ll play a critical role in setting the tone for O...Show more

Last updated: 18 days ago • Promoted

AI Trainer -Remote Content QA Reviewer

Outlier • Remote, LA, United States

Remote

Full-time

Last updated: 2 days ago • Promoted

Applied AI Engineer

Axle Mobility • Los Angeles, California, United States

Remote

Full-time

Axle is rebuilding the back office of commercial vehicle operations with AI-native tools for the people who keep the world moving : drivers, technicians, service leaders, operators, and fleet manage...Show more

Last updated: 30+ days ago • Promoted

AI Cyber Testing & Evaluation Research Lead

RAND Corporation • Santa Monica, California, United States

Full-time

A leading research organization is seeking a Research Lead for AI Cyber Testing & Evaluation in Santa Monica.The ideal candidate will have over 6 years of technical and management experience, leadi...Show more

Last updated: 4 days ago • Promoted

Senior Associate, AI Engineer

KPMG • Los Angeles, CA, United States

Full-time

KPMG Advisory practice is currently our fastest growing practice.We are seeing tremendous client demand, and looking forward we do not anticipate that slowing down. In this ever-changing market envi...Show more

Last updated: 24 days ago • Promoted

Experienced AI Prompt Specialist Needed USCanadian natives

Summa Linguae Technologies • Los Angeles, California, USA

Full-time

Hello dear partner! I hope you are doing well : ).We have a new project for one of out top clients.Only native US / Canada candidates are accepted (US / Canadian citizens abroad also welcome).We have a ...Show more

Last updated: 13 days ago • Promoted

Biosecurity Research Resident - AI x Bio Evaluations

RAND • Santa Monica, CA, United States

Full-time +1

RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND’s Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...Show more

Last updated: 15 days ago • Promoted

Biosecurity Research Resident - AI x Bio Evaluations

RAND Corporation • Santa Monica, CA, United States

Full-time +1

RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's.Center on AI, Security, and Technology (CAST).AI x Bio intersection through assessment, technical work, and policy analysis.RAND's...Show more

Last updated: 28 days ago • Promoted

Remote Biology Expert (PhD) (Los Angeles)

Turing • Los Angeles, CA, US

Remote

Part-time

Remote contract for PhDs in Biology, Biotechnology, Biochemistry, or related fields.Work on cutting-edge projects with top AI labs while earning up to $50+ / hour, fully remote, with flexible weekly ...Show more

Last updated: 2 hours ago • Promoted • New!

Remote Expert Prompt Curators for Advanced AI Evaluation Dataset - AI Trainer ($50-$70 per hour)

Mercor • Burbank, California, US

Remote

Full-time

Role Overview • • Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge a...Show more

Last updated: 13 hours ago • Promoted • New!

Lead Engineer, AI

Absurd Ventures • Santa Monica, California, United States

Full-time

We are seeking a highly skilled and experienced.As the Lead AI Engineer, you will be responsible for overseeing the design, development, and optimization of game artificial intelligence systems- fr...Show more

Last updated: 30+ days ago • Promoted

Remote Audio Generalist Evaluator Expert - AI Trainer ($35-$40 per hour)

Mercor • Burbank, California, US

Remote

Full-time

Mercor is seeking detail-oriented writing experts to contribute to a high-impact audio AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evalu...Show more

Last updated: 13 hours ago • Promoted • New!