Mercor is looking for a STEM Expert with experience in biology / physics / chemistry theory and problem-solving to develop and refine high-quality reasoning questions that evaluate AI models. Your knowledge in biology / physics / chemistry will ensure the accuracy, rigor, and instructional quality of these test items.
- ###
- Job Details :
- Design and Optimize STEM-Based Prompts
- : Create detailed prompts with multiple constraints and scientific instructions. -
- Define and Document Evaluation Standards
- : Establish high-level expectations for correct responses in STEM contexts, and develop comprehensive rubric. -
- Conduct Model Testing and Grading
- : Run prompts through models and assess preliminary outputs against expectations. -
- Support Benchmarking and Quality Assurance
- : Collaborate in QA review processes to ensure prompt tasks and rubrics meet scientific rigor, maintaining consistency and reliability before integration into official benchmarks. ###
- Minimum Qualifications :
- BS or BA in Biology / Physics / Chemistry - Familiarity with undergraduate-level biology / physics / chemistry topics - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. ###
- Preferred Qualifications :
- 2+ years of experience in teaching, research, or applied STEM discipline field. ###
- Application & Onboarding Process :
- Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ###
- More Details About This Role :
- This is a
- remote and asynchronous
- role — work on your own schedule. - Expect to contribute at least
- 20 hours per week
- . - Expect a commitment of around 1 month. - You’ll be working in a structured project environment with clear goals and tools.
- ###
- About
- Mercor
- ](https : / / mercor.com / )
- Our team is based in San Francisco, CA - We [ in recruiting experts for top AI labs - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey