Research and STEM Expert
Location
Denver
Salary
60000 - a year (s)
Description
This role is with Mercor. Mercor uses RippleMatch to find top talent.
About the Role
Mercor is seeking a highly skilled Research and STEM Expert to join our AI evaluation and technical quality assurance team. In this role, you will analyze, evaluate, and fact-check AI-generated outputs across scientific, mathematical, and technical domains — ensuring the highest standards of factual accuracy, logical reasoning, and clarity.
You will help improve the reasoning and reliability of cutting-edge Large Language Models (LLMs) by providing structured feedback and expert judgment across diverse STEM fields. This position is ideal for individuals with strong academic training, analytical precision, and a passion for advancing AI alignment in research and science.
Key Responsibilities
- Evaluate and critique AI-generated responses in STEM-related subjects (e.g., computer science, mathematics, physics, biology, and engineering).
- Conduct fact-checking and research validation using reputable public and academic sources.
- Assess scientific explanations, calculations, and reasoning for correctness and clarity.
- Provide structured written feedback to improve the model’s understanding and communication of technical topics.
- Collaborate with the AI quality team to improve annotation guidelines and maintain consistency across evaluations.
Minimum Requirements
- BS, MS, or PhD in a STEM domain (e.g., Computer Science, Mathematics, Biology, Physics, Engineering, etc.)
- English expert with excellent comprehension and communication skills
- Excellent at high school–level math
- Expert at fact-checking information across multiple domains (medical, legal, financial, technical, etc.) using trusted public sources
- Excellent writing skills and attention to detail
- Significant experience using Large Language Models
Please mention the word COMMENDABLY and tag RMTYxLjM1LjE4MS45Mg== when applying to show you read the job post completely (#RMTYxLjM1LjE4MS45Mg==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.
Job type
Remote job
Tags
- technical
- reliability