Mindrift

Remote AI Evaluation Scenario Architect & Agent Testing

Mindrift, Denver, Colorado, United States

A leading AI company is seeking a professional to design structured evaluation scenarios for LLM-based agents. You will create test cases that simulate human tasks, define gold-standard behaviors, and analyze decision paths. Ideal candidates have a relevant degree and expertise in testing and data analysis. Contribute flexibly, with rates up to $80/hour based on skills and experience. This remote role allows you to shape AI technology while managing your own schedule. #J-18808-Ljbffr