Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Redwood City, California, US
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Redwood City, California, US
19 hours ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Redwood City, California, US

    Related jobs
    AI Prompt Engineer

    AI Prompt Engineer

    Monograph • San Francisco, California, United States
    Remote
    Full-time
    Look around you today, every store, home, hospital, school, was made possible by the coordination of architects and a team of professionals. They are charged with the responsibility of creating our ...Show more
    Last updated: 30+ days ago • Promoted
    AI Engineer, Evaluation and Reliability

    AI Engineer, Evaluation and Reliability

    Mice Groups • Redwood City, CA, US
    Permanent
    Senior Engineer, AI Evaluation and Reliability / Contract-to-Hire or Direct Hire / Redwood City / Hybrid, onsite 3 days per week / This position pays $70-80 / hr. W2 for Contract, $140-190K annually u...Show more
    Last updated: 13 days ago • Promoted
    Production AI Evals Engineer — Build Reproducible Pipelines

    Production AI Evals Engineer — Build Reproducible Pipelines

    OpenAI • San Francisco, CA, United States
    Full-time
    A global AI research and deployment company is hiring product-minded engineers in San Francisco to design evals for advanced AI systems. You will define evaluation signals, prototype solutions, and ...Show more
    Last updated: 8 days ago • Promoted
    Prompt Engineer / AI Engineer

    Prompt Engineer / AI Engineer

    HyperFi • San Francisco, CA, United States
    Full-time
    We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connec...Show more
    Last updated: 17 days ago • Promoted
    Police Officer - New Recruit (Entry Level)

    Police Officer - New Recruit (Entry Level)

    City and County of San Francisco • Davenport, CA, United States
    Full-time +1
    Police Officer — New Recruit (Entry-Level).San Francisco Police Department (Q002) | .Full-time, Permanent Civil Service.Comprehensive City & County benefits. Protect life and property through proac...Show more
    Last updated: 21 days ago • Promoted
    Senior Applied AI Engineer

    Senior Applied AI Engineer

    Pulley • San Francisco, California, United States
    Full-time
    At Pulley, we are on a mission to help construction teams break ground faster.Meanwhile, retail and office real estate.Permitting requirements vary vastly by jurisdiction, making it challenging to ...Show more
    Last updated: 30+ days ago • Promoted
    Applied AI Engineer

    Applied AI Engineer

    Mem0 • San Francisco Bay, California, United States
    Full-time
    You’ll turn vague customer use cases into working proofs-of-concept that showcase what Mem0 can do.This means rapid full-stack prototyping, stitching together AI tools, and aggressively experimenti...Show more
    Last updated: 17 days ago • Promoted
    Director, Product Ownership & Applied AI Evaluations

    Director, Product Ownership & Applied AI Evaluations

    BMO Financial Group • San Francisco, CA, United States
    Full-time
    Application Deadline : 11 / 29 / 2025.Job Family Group : Analytics & Reporting.Hybrid Work Model : 2-3 days / week in office (non‑negotiable). Must be a Senior Leader (people manager) with solid experience i...Show more
    Last updated: 16 days ago • Promoted
    AI Engineer

    AI Engineer

    Elicit • Oakland, California, United States
    Full-time
    Elicit is an AI research assistant that uses language models to help researchers figure out what’s true and make better decisions, starting with common research tasks like literature review.Elicit ...Show more
    Last updated: 30+ days ago • Promoted
    Founding AI Engineer

    Founding AI Engineer

    HartleyCo • San Francisco, CA, United States
    Full-time
    This range is provided by HartleyCo.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Direct message the job poster from HartleyCo.Headhunter & Di...Show more
    Last updated: 30+ days ago • Promoted
    AI Engineer

    AI Engineer

    Create • San Francisco, CA, United States
    Full-time
    We're hiring AI engineers to build the systems that make Anything intelligent.Often that involves staring down ambiguous research problems. And shipping clever ways of solving them.Prompt engineerin...Show more
    Last updated: 15 days ago • Promoted
    Computer Science Subject Matter Expert - AI Evaluation (US-Remote)

    Computer Science Subject Matter Expert - AI Evaluation (US-Remote)

    Braintrust • San Francisco, CA, United States
    Remote
    Full-time
    Computer Science Subject Matter Expert – AI Evaluation (US-Remote).Computer Science Subject Matter Expert – AI Evaluation (US-Remote). This range is provided by Braintrust.Your actual pay will be ba...Show more
    Last updated: 12 days ago • Promoted
    AI Strategy Consultant, Frontier Tech

    AI Strategy Consultant, Frontier Tech

    Scale AI • San Francisco, CA, United States
    Part-time
    As a member of our Frontier Tech Consultant team, you will be accountable for driving future revenue by ensuring that Scale AI successfully executes new product experiments in a timely manner while...Show more
    Last updated: 30+ days ago • Promoted
    Hybrid AI Research Engineer — Quality & Prompting

    Hybrid AI Research Engineer — Quality & Prompting

    gamma.app • San Francisco, CA, United States
    Full-time
    A leading AI-driven presentation tool company in San Francisco is seeking a Research Engineer to enhance the quality of AI outputs. You'll design evaluation frameworks, improve production prompts, a...Show more
    Last updated: 9 days ago • Promoted
    APPLIED AI ENGINEER (1042) - Department of Technology

    APPLIED AI ENGINEER (1042) - Department of Technology

    San Francisco • San Francisco, CA, United States
    Full-time +1
    Department : DT / Emergent Technology.Role type : Permanent Exempt (PEX), Full Time position is excluded by the Charter from the competitive civil service examination process and shall serve at the dis...Show more
    Last updated: 17 days ago • Promoted
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Redwood City, California
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Senior Applied AI Engineer

    Senior Applied AI Engineer

    Omada Health • South San Francisco, CA, United States
    Full-time
    Omada Health is on a mission to inspire and engage people in lifelong health, one step at a time.Omada Health is a digital care provider that empowers individuals to reach their health goals throug...Show more
    Last updated: 30+ days ago • Promoted
    AI Strategy Consultant, Frontier Tech

    AI Strategy Consultant, Frontier Tech

    Scale AI, Inc. • San Francisco, CA, United States
    Part-time
    As a member of our Frontier Tech Consultant team, you will play a critical role in advancing cutting-edge AI innovations by conducting high-impact experiments and ensuring seamless execution at the...Show more
    Last updated: 30+ days ago • Promoted