Talent.com
Expert Prompt Curators for Advanced AI Evaluation Dataset
Expert Prompt Curators for Advanced AI Evaluation DatasetMercor • Lowell, Massachusetts, US
No se aceptan más aplicaciones
Expert Prompt Curators for Advanced AI Evaluation Dataset

Expert Prompt Curators for Advanced AI Evaluation Dataset

Mercor • Lowell, Massachusetts, US
Hace 1 día
Tipo de contrato
  • A tiempo completo
  • Teletrabajo
Descripción del trabajo

##

  • 1\. Role Overview

Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge across diverse fields to design extremely challenging prompts that cannot be solved by existing AI systems without internet search or browsing capabilities. The goal is to create a benchmark dataset that pushes the limits of current AI reasoning and retrieval. This is a short-term research engagement with significant impact on AI evaluation. ##

  • 2\. Key Responsibilities
  • Create original, expert-level prompts that require tool use (e.g., search, browse, or code execution). - Ensure prompts are objective, self-contained, and yield clear, unambiguous answers. - Test prompts against advanced AI models and document failures / successes. - Provide reasoning steps and solutions for each prompt. - Classify prompts into subject domains for dataset organization. - Collaborate with reviewers for expert validation and prompt refinement. ##
  • 3\. Ideal Qualifications
  • Advanced academic or professional expertise in a specialized subject (STEM, law, finance, history, cultural studies, etc.). - Strong ability to design precise, high-difficulty questions requiring deep knowledge and external references. - Experience in academic research, benchmarking, or test question design preferred. - Attention to detail and ability to provide concise reasoning explanations. - Familiarity with AI models and their limitations is a plus. ##
  • 4\. More About the Opportunity
  • Remote and asynchronous — set your own hours. - Expected commitment : ~10–20 hours / week. - Project duration : ~2 months, with possible extensions based on dataset needs. - Opportunity to contribute to high-impact AI safety and evaluation research. ##
  • 5\. Compensation & Contract Terms
  • Competitive hourly compensation based on expertise. - Independent contractor engagement. - Payments for services rendered processed weekly via Stripe Connect. ##
  • 6\. Application Process
  • Submit your resume or CV highlighting your subject matter expertise. - Complete a brief questionnaire about your background and areas of specialization. - Selected applicants may be asked to draft a short test prompt. - You’ll receive follow-up within a few days regarding next steps. ##
  • 7\. About Mercor
  • Mercor is a talent marketplace that connects top experts with leading AI labs and research organizations. - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey. - Thousands of professionals across domains like law, creatives, engineering, and research have joined Mercor to work on frontier projects shaping the next era of AI.
  • Crear una alerta de empleo para esta búsqueda

    Curator • Lowell, Massachusetts, US

    Ofertas relacionadas
    Mid-Level AI Engineer

    Mid-Level AI Engineer

    Credence • Bedford, Massachusetts, US
    A tiempo completo
    Overview Credence is one of the largest privately held technology services firms in the U.Top Workplace and featured on the Inc. Fastest-Growing Private Companies list for 12 consecutive years.We se...Mostrar más
    Última actualización: hace 13 días • Oferta promocionada
    AI Trainer -Remote Content QA Reviewer

    AI Trainer -Remote Content QA Reviewer

    Outlier • Nashua, NH, United States
    Teletrabajo
    A tiempo completo
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Mostrar más
    Última actualización: hace 2 días • Oferta promocionada
    Product Complaints Engineer - Team Lead

    Product Complaints Engineer - Team Lead

    DEKA Research and Development • Manchester, NH, United States
    A tiempo completo
    DEKA R&D has an immediate opening for a Product Complaints Engineer - Team Lead to work in a dynamic Medical Device Research and Development environment. The position reports to the Product Complain...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Remote Bilingual STEM Expert (Spanish and English) - AI Trainer ($24.5-$45.5 per hour)

    Remote Bilingual STEM Expert (Spanish and English) - AI Trainer ($24.5-$45.5 per hour)

    Mercor • Methuen, Massachusetts, US
    Teletrabajo
    A tiempo completo
    Overview : • • We are looking for a Bilingual STEM Expert (Spanish and English) to review, evaluate, and refine AI-generated content involving scientific, mathematical, and technical reasoning across ...Mostrar más
    Última actualización: hace 15 horas • Oferta promocionada • Nueva oferta
    Remote STEM PhDs – Physics - AI Trainer ($65-$75 per hour)

    Remote STEM PhDs – Physics - AI Trainer ($65-$75 per hour)

    Mercor • Manchester, New Hampshire, US
    Teletrabajo
    A tiempo completo
    Mercor is seeking • •Physics PhDs • • for a premier project with one of the world's top AI labs.This role pays between • •$65-75 / hour. In this role, you will contribute your subject matter expertise to ...Mostrar más
    Última actualización: hace 15 horas • Oferta promocionada • Nueva oferta
    Senior AI Engineer Salesforce & Applied Intelligence

    Senior AI Engineer Salesforce & Applied Intelligence

    ZoomInfo Technologies • Waltham, Massachusetts, United States
    A tiempo completo
    We’re transforming how Go-To-Market (GTM) teams — across.Sales, Marketing, and Customer Experience.AI to drive smarter, faster, and more predictive decisions. Our mission is to build intelligent sys...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Principal Data Scientist in Epidemiology and Patient Data Products

    Principal Data Scientist in Epidemiology and Patient Data Products

    Valo Health • Lexington, Massachusetts, United States
    Teletrabajo
    A tiempo completo +1
    Valo Health is a human-centric, AI-enabled biotechnology company working to make new drugs for patients faster.The company’s Opal Computational Platform transforms drug discovery and development th...Mostrar más
    Última actualización: hace 20 días • Oferta promocionada
    Senior Director, Software Engineering & AI

    Senior Director, Software Engineering & AI

    Entegris • Billerica, MA, United States
    A tiempo completo
    Senior Director, Software Engineering & AI.Here at Entegris, we use advanced science to enable technologies that transform the world, and we are seeking employees who have the drive to continue tha...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Travel Certified Surgical Technologist - $2,350 per week

    Travel Certified Surgical Technologist - $2,350 per week

    HealthTrust Workforce Solutions HCA • Manchester, NH, United States
    A tiempo completo
    HealthTrust Workforce Solutions HCA is seeking a travel Certified Surgical Technologist for a travel job in Manchester, New Hampshire. Job Description & Requirements.Certified Surgical Technologist....Mostrar más
    Última actualización: hace 9 días • Oferta promocionada
    Remote Expert Prompt Curators for Advanced AI Evaluation Dataset - AI Trainer ($50-$70 per hour)

    Remote Expert Prompt Curators for Advanced AI Evaluation Dataset - AI Trainer ($50-$70 per hour)

    Mercor • Nashua, New Hampshire, US
    Teletrabajo
    A tiempo completo
    Role Overview • • Mercor is collaborating with a leading AI research lab to develop a next-generation evaluation dataset for frontier AI models. We are seeking experts with advanced domain knowledge a...Mostrar más
    Última actualización: hace 15 horas • Oferta promocionada • Nueva oferta
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Nashua, NH, United States
    Teletrabajo
    A tiempo completo
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Mostrar más
    Última actualización: hace 2 días • Oferta promocionada
    Staff Applied Scientist

    Staff Applied Scientist

    Relativity • Manchester, NH, United States
    A tiempo completo
    At Relativity, we're building a world-class Applied Science team to push the boundaries of intelligent systems in the legal domain. We're looking for a Staff Applied Scientist to join our team.Agent...Mostrar más
    Última actualización: hace 2 días • Oferta promocionada
    Automation Machine Vision Engineer

    Automation Machine Vision Engineer

    DEKA Research and Development • Manchester, NH, United States
    A tiempo completo
    DEKA Research & Development, located in Manchester, NH, is seeking an Automation Machine Vision Engineer to join our industrial automation team developing vision inspection systems for medical devi...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Travel Certified Surgical Technologist (CST) - $2,253 per week in Lebanon, NH

    Travel Certified Surgical Technologist (CST) - $2,253 per week in Lebanon, NH

    AlliedTravelCareers • Manchester, New Hampshire, US
    A tiempo completo
    AlliedTravelCareers is working with AHS Staffing to find a qualified CST in Lebanon, New Hampshire, 03756!.AHS Staffing is looking for a CST Certified Surgical Technologist in Lebanon, NH for a Lon...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Travel Surgical Tech - $1874 / Week

    Travel Surgical Tech - $1874 / Week

    Cynet Health • Manchester, NH, US
    A tiempo completo
    Cynet Health is seeking an experienced Surgical Tech for an exciting Travel Allied job in Manchester, NH.Shift : 4x10 hr days Start Date : 12 / 08 / 2025 Duration : 13 weeks Pay : $1874 / Week.Ranked #5 Be...Mostrar más
    Última actualización: hace 12 horas • Oferta promocionada • Nueva oferta
    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Remote M&A Associate - AI Trainer ($50-$60 / hour)

    Data Annotation • Methuen, Massachusetts
    Teletrabajo
    A tiempo completo +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Mostrar más
    Última actualización: hace 23 días • Oferta promocionada
    Travel Surgical Tech - Certified

    Travel Surgical Tech - Certified

    Axis Medical Staffing • Manchester, NH, US
    A tiempo completo
    Axis Medical Staffing is seeking an experienced Surgical Tech - Certified for an exciting Travel Allied job in Manchester, NH. Shift : 4x10 hr days Start Date : 12 / 08 / 2025 Duration : 13 weeks.TOP RANKE...Mostrar más
    Última actualización: hace 12 horas • Oferta promocionada • Nueva oferta
    Travel Surgical Tech - $1874 / Week

    Travel Surgical Tech - $1874 / Week

    Pulse Healthcare Services • Manchester, NH, US
    A tiempo completo
    Pulse Healthcare Services is seeking an experienced Surgical Tech for an exciting Travel Allied job in Manchester, NH.Shift : 4x10 hr days Start Date : 12 / 08 / 2025 Duration : 13 weeks Pay : $1874 / Week...Mostrar más
    Última actualización: hace menos de 1 hora • Oferta promocionada • Nueva oferta