
PhD Mathematician AI Evaluation Specialist
Mercor, San Francisco, CA, United States
About the job
Mercor
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
Benchmark ,
General Catalyst ,
Peter Thiel ,
Adam D'Angelo ,
Larry Summers , and
Jack Dorsey . Position:
Mathematics AI Evaluator Type:
Full-time or Part-time Contract Work Compensation:
$73/hour Location:
USA, UK, Canada, EU Role Responsibilities Write and refine prompts to guide model behavior in mathematical contexts. Evaluate
LLM-generated responses
to mathematics-related queries for correctness, rigor, and logical coherence. Verify mathematical claims, derivations, and proofs using domain expertise. Conduct fact-checking using authoritative public sources and domain knowledge. Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies. Ensure model responses align with expected conversational behavior and system guidelines. Qualifications Must-Have PhD in Mathematics or a closely related field . Demonstrated experience in
Probability & Statistics . Significant experience using
large language models
(LLMs). Excellent writing skills
for explaining complex mathematical concepts. Strong attention to detail
with the ability to notice subtle issues. Experience reviewing or editing technical or academic writing. Preferred Prior experience with
RLHF , model evaluation, or data annotation work. Experience teaching, mentoring, or explaining mathematical concepts to non-expert audiences. Familiarity with evaluation rubrics, benchmarks, or structured review frameworks. Application Process (Takes 20–30 mins to complete) Upload resume AI interview based on your resume Submit form Resources & Support For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome For any help or support, reach out to: support@mercor.com PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
#J-18808-Ljbffr
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
Benchmark ,
General Catalyst ,
Peter Thiel ,
Adam D'Angelo ,
Larry Summers , and
Jack Dorsey . Position:
Mathematics AI Evaluator Type:
Full-time or Part-time Contract Work Compensation:
$73/hour Location:
USA, UK, Canada, EU Role Responsibilities Write and refine prompts to guide model behavior in mathematical contexts. Evaluate
LLM-generated responses
to mathematics-related queries for correctness, rigor, and logical coherence. Verify mathematical claims, derivations, and proofs using domain expertise. Conduct fact-checking using authoritative public sources and domain knowledge. Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies. Ensure model responses align with expected conversational behavior and system guidelines. Qualifications Must-Have PhD in Mathematics or a closely related field . Demonstrated experience in
Probability & Statistics . Significant experience using
large language models
(LLMs). Excellent writing skills
for explaining complex mathematical concepts. Strong attention to detail
with the ability to notice subtle issues. Experience reviewing or editing technical or academic writing. Preferred Prior experience with
RLHF , model evaluation, or data annotation work. Experience teaching, mentoring, or explaining mathematical concepts to non-expert audiences. Familiarity with evaluation rubrics, benchmarks, or structured review frameworks. Application Process (Takes 20–30 mins to complete) Upload resume AI interview based on your resume Submit form Resources & Support For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome For any help or support, reach out to: support@mercor.com PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
#J-18808-Ljbffr