Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Fall River, Massachusetts, US
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Fall River, Massachusetts, US
21 hours ago
Job type
  • Full-time
  • Remote
Job description

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Create a job alert for this search

    Curator • Fall River, Massachusetts, US

    Related jobs
    Financial Analyst - AI Trainer ($150 per hour)

    Financial Analyst - AI Trainer ($150 per hour)

    Mercor • Fall River, Massachusetts, US
    Remote
    Full-time
    UK / Canada / Europe / Singapore / Dubai / Australia-based • •Investment Banking or Private Equity Experts • • for a research project with a leading foundational model AI lab. You are a good fit if you : - Have •...Show more
    Last updated: 21 hours ago • Promoted • New!
    Cytotechnologist

    Cytotechnologist

    Southcoast Health • Dartmouth, MA, United States
    Full-time
    Join Southcoast Health, where your future is as promising as the care we provide.Our commitment to each other, our patients, and our community is more than a mission - it's our way of life, and you...Show more
    Last updated: 30+ days ago • Promoted
    Advanced Clinical Tech II-Emergency Dept (Paramedic)

    Advanced Clinical Tech II-Emergency Dept (Paramedic)

    Southcoast Health • New Bedford, MA, United States
    Full-time
    Find out for yourself why Southcoast Health has been voted 'Best Place to Work' for 6 years in a row!.Join Southcoast Health, where your future is as promising as the care we provide.Our commitment...Show more
    Last updated: 30+ days ago • Promoted
    English Writing and Content Reviewing Expertise Sought for AI Training

    English Writing and Content Reviewing Expertise Sought for AI Training

    Outlier • Brockton, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Certified Medical Assistant-Oncology Practice-Lead

    Certified Medical Assistant-Oncology Practice-Lead

    Southcoast Health • Fall River, MA, United States
    Full-time
    Join Southcoast Health, where your future is as promising as the care we provide.Our commitment to each other, our patients, and our community is more than a mission - it's our way of life, and you...Show more
    Last updated: 8 days ago • Promoted
    AI Product Manager - AI Infrastructure & Cloud Platforms (Remote, East Coast US)

    AI Product Manager - AI Infrastructure & Cloud Platforms (Remote, East Coast US)

    Black Recruitment SL • New Bedford, MA, US
    Remote
    Permanent
    Product Manager – AI Scale-Up (East Coast : NYC, Boston, or Washington DC, etc.Are you a Product Manager with deep expertise in AI and infrastructure — and a passion for building products that will ...Show more
    Last updated: 17 days ago • Promoted
    Remote Market Research Contributor (Hiring Immediately)

    Remote Market Research Contributor (Hiring Immediately)

    Earn Haus • Stoughton, Massachusetts, US
    Remote
    Full-time +2
    We are urgently looking for people interested in taking online surveys for Fortune 500 brands.If you are a self-starter, looking for flexible hours throughout the week, this may be for you! Earn up...Show more
    Last updated: 30+ days ago • Promoted
    Remote Financial Advising Expert - AI Trainer ($50-$60 / hour)

    Remote Financial Advising Expert - AI Trainer ($50-$60 / hour)

    Data Annotation • New Bedford, Massachusetts
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Remote Content QA Reviewer

    Remote Content QA Reviewer

    Outlier • Brockton, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    AI Writing Reviewer - Remote

    AI Writing Reviewer - Remote

    Outlier • Brockton, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Financial Expert - AI Trainer ($150 per hour)

    Financial Expert - AI Trainer ($150 per hour)

    Mercor • Taunton, Massachusetts, US
    Remote
    Full-time
    UK / Canada / Europe / Singapore / Dubai / Australia-based • •Investment Banking or Private Equity Experts • • for a research project with a leading foundational model AI lab. You are a good fit if you : - Have •...Show more
    Last updated: 21 hours ago • Promoted • New!
    EMI Coordinator - Surgical Technologist

    EMI Coordinator - Surgical Technologist

    Southcoast Health • Fall River, MA, United States
    Full-time
    Find out for yourself why Southcoast Health has been voted 'Best Place to Work' for 6 years in a row!.Join Southcoast Health, where your future is as promising as the care we provide.Our commitment...Show more
    Last updated: 17 days ago • Promoted
    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

    Data Annotation • New Bedford, Massachusetts
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 30+ days ago • Promoted
    Remote AI Writing Trainer

    Remote AI Writing Trainer

    Outlier • Brockton, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Travel Cath Lab Technologist - $3,552 per week

    Travel Cath Lab Technologist - $3,552 per week

    Advantage Staffing Services • Fall River, MA, United States
    Full-time
    Advantage Staffing Services is seeking a travel Cath Lab Technologist for a travel job in Fall River, Massachusetts.Job Description & Requirements. Open to both local and travel candidates.Advantage...Show more
    Last updated: 17 days ago • Promoted
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • New Bedford, Massachusetts
    Remote
    Full-time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show more
    Last updated: 22 days ago • Promoted
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Brockton, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Remote AI Writing Specialist

    Remote AI Writing Specialist

    Outlier • Brockton, MA, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted