Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Burbank, California, US
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Burbank, California, US
10 hours ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Burbank, California, US

    Related jobs
    Travel Physical Therapy Assistant (PTA) in West Lancaster, CA

    Travel Physical Therapy Assistant (PTA) in West Lancaster, CA

    AlliedTravelCareers • West Lancaster, CA, US
    Full-time
    AlliedTravelCareers is working with Aequor to find a qualified Physical Therapy Assistant (PTA) in West Lancaster, California, 93534!. Physical Therapist Assistant (PTA).With Aequor, you can enjoy t...Show more
    Last updated: 30+ days ago • Promoted
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Remote, LA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Adjunct Counselor Non Instructor

    Adjunct Counselor Non Instructor

    InsideHigherEd • Del Sur, California, United States
    Part-time
    Antelope Valley College invites applications for our adjunct (Temporary, part-time) faculty applicant pool for the following discipline : Counseling. To provide academic advisement for students.Assis...Show more
    Last updated: 4 days ago • Promoted
    Adjunct Learning Disability Specialist

    Adjunct Learning Disability Specialist

    InsideHigherEd • Del Sur, California, United States
    Part-time
    Antelope Valley College invites applications for our adjunct (Temporary, part-time) faculty applicant pool for the following Disabled Student Services division. Provides assessment services to stude...Show more
    Last updated: 4 days ago • Promoted
    AI Engineer

    AI Engineer

    Open Roles At Outsmart • Culver City, California, United States
    Full-time
    In this role, you’ll be at the forefront of the machine learning that powers Outsmart’s applications.As an early member of the engineering team you’ll play a critical role in setting the tone for O...Show more
    Last updated: 17 days ago • Promoted
    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Data Annotation • Burbank, California
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 30+ days ago • Promoted
    AI Trainer -Remote Content QA Reviewer

    AI Trainer -Remote Content QA Reviewer

    Outlier • Remote, LA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Applied AI Engineer

    Applied AI Engineer

    Axle Mobility • Los Angeles, California, United States
    Remote
    Full-time
    Axle is rebuilding the back office of commercial vehicle operations with AI-native tools for the people who keep the world moving : drivers, technicians, service leaders, operators, and fleet manage...Show more
    Last updated: 30+ days ago • Promoted
    Experienced AI Prompt Specialist Needed USCanadian natives

    Experienced AI Prompt Specialist Needed USCanadian natives

    Summa Linguae Technologies • Los Angeles, California, USA
    Full-time
    Hello dear partner! I hope you are doing well : ).We have a new project for one of out top clients.Only native US / Canada candidates are accepted (US / Canadian citizens abroad also welcome).We have a ...Show more
    Last updated: 11 days ago • Promoted
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND • Santa Monica, CA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND’s Center on AI, Security, and Technology (CAST) to develop new approaches to evaluate and understand risk at the AI x Bio intersecti...Show more
    Last updated: 14 days ago • Promoted
    Biosecurity Research Resident - AI x Bio Evaluations

    Biosecurity Research Resident - AI x Bio Evaluations

    RAND Corporation • Santa Monica, CA, United States
    Full-time +1
    RAND is seeking a mission-driven.AI x Bio portfolio lead at RAND's.Center on AI, Security, and Technology (CAST).AI x Bio intersection through assessment, technical work, and policy analysis.RAND's...Show more
    Last updated: 26 days ago • Promoted
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Burbank, California
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Director, Machine Learning Engineering, Programmatic Ads

    Director, Machine Learning Engineering, Programmatic Ads

    Pinterest • Los Angeles, CA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show more
    Last updated: 30+ days ago • Promoted
    Adjunct Counselor AVC Bridge 3SP Non-Instructional

    Adjunct Counselor AVC Bridge 3SP Non-Instructional

    InsideHigherEd • Del Sur, California, United States
    Part-time
    Antelope Valley College invites applications for our adjunct (Temporary, part-time) faculty applicant pool for the following discipline : Counselor AVC Bridge 3SP. Faculty participate in curricular p...Show more
    Last updated: 4 days ago • Promoted
    Remote AI Content Reviewer

    Remote AI Content Reviewer

    Outlier • Remote, LA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Lead Engineer, AI

    Lead Engineer, AI

    Absurd Ventures • Santa Monica, California, United States
    Full-time
    We are seeking a highly skilled and experienced.As the Lead AI Engineer, you will be responsible for overseeing the design, development, and optimization of game artificial intelligence systems- fr...Show more
    Last updated: 30+ days ago • Promoted
    AI Engineer

    AI Engineer

    Capsule • Los Angeles, California, United States
    Full-time
    Capsule is building the world’s most intuitive AI-powered video editing tool, designed for modern teams.We’re a Series A startup led by two serial founders who have built and scaled creative platfo...Show more
    Last updated: 30+ days ago • Promoted
    Digital Innovation Division Lead of AI Operations

    Digital Innovation Division Lead of AI Operations

    The Aerospace Corporation • El Segundo, CA, United States
    Full-time
    The Aerospace Corporation is the trusted partner to the nation's space programs, solving the hardest problems and providing unmatched technical expertise. As the operator of a federally funded resea...Show more
    Last updated: 30+ days ago • Promoted