Mediabistro logo
job logo

Remote | Finance Model Prompt Evaluator — $50–$80/hour

24-MAG, San Francisco, CA, United States


We are sharing a specialised part-time consulting opportunity for expert finance and economics professionals with strong backgrounds in financial analysis, quantitative methods, regulatory reasoning, and high‑quality technical writing. This role supports an exciting collaboration with a leading frontier AI research laboratory focused on improving financial reasoning and model evaluation through rigorous, high‑quality prompt authoring and verification workflows. Selected professionals will author and verify open‑ended financial analysis prompts across core subdomains such as quantitative finance, macroeconomics, risk management, banking, regulation, tax, and asset‑focused finance. The goal is to help advanced AI systems produce higher‑quality reasoning in complex financial contexts by building challenging, unambiguous evaluation tasks and applying expert judgment to assess prompt quality, scope, and difficulty. Key Responsibilities

Prompt Authoring

Create original, open‑ended prompts within an assigned financial subdomain across varying difficulty levels, including undergraduate, advanced undergraduate, and graduate or professional levels. Design prompts that require human judgment to evaluate the quality of the AI's response, including tasks involving quantitative analysis, risk modeling, or regulatory reasoning. Ensure prompts are clear, well‑scoped, and sufficiently challenging for meaningful model evaluation. Prompt Verification & Quality Review

Review authored prompts for clarity, uniqueness, scope alignment, and difficulty accuracy. Edit prompts and difficulty assignments where standards are not met. Ensure that prompts within each task are sufficiently distinct from one another and aligned with project expectations. Financial Reasoning Evaluation Support

Apply expert judgment to assess the depth and quality of financial reasoning required by each prompt. Help establish rigorous evaluation standards for frontier language models operating in financial and economic domains. Support high‑quality task design across a broad set of financial analysis areas. Ideal Profile

A Master's degree or higher in Finance, Economics, Financial Engineering, or a closely related field. 2–6 years of professional experience in financial services, investment banking, asset management, or a related field. Strong command of financial modeling, quantitative methods, and domain‑specific regulatory frameworks. Excellent written English and the ability to craft precise, well‑scoped technical questions. Comfort working across structured evaluation tasks requiring depth, clarity, and domain judgment. Preferred Qualifications

CFA, FRM, CPA, or equivalent professional certification. Experience across one or more of the following areas: quantitative finance, derivatives and trading, macroeconomics, rates, FX, banking, lending, risk management, insurance, wealth management, tax, compliance, or cross‑border structuring. Ability to design open‑ended financial questions that require nuanced reasoning rather than simple factual recall. Strong editorial judgment when reviewing scope, clarity, and difficulty calibration. Why This Opportunity

Contribute specialised finance and economics expertise to a cutting‑edge AI collaboration. Help improve how advanced AI systems reason through financial analysis, regulation, and complex quantitative tasks. Work on high‑impact evaluation workflows that shape financial model benchmarking standards. Flexible remote work with structured expectations and competitive hourly compensation. Contract Details

Independent contractor role. Fully remote with flexible scheduling. Hourly compensation of $50–$80 per hour. Expected commitment of 10+ hours per week. Work is fully asynchronous. Projects may be extended, shortened, or concluded early depending on project needs and performance. Weekly payments via Stripe or Wise. Work will not involve access to confidential or proprietary information from any employer, client, or institution. Please note: We are unable to support H1‑B or STEM OPT candidates at this time. Start date: Immediate.

#J-18808-Ljbffr