Mindrift
Remote AI Evaluation Scenario Architect & Agent Testing
Mindrift, Denver, Colorado, United States
A leading AI company is seeking a professional to design structured evaluation scenarios for LLM-based agents. You will create test cases that simulate human tasks, define gold-standard behaviors, and analyze decision paths. Ideal candidates have a relevant degree and expertise in testing and data analysis. Contribute flexibly, with rates up to $80/hour based on skills and experience. This remote role allows you to shape AI technology while managing your own schedule.
#J-18808-Ljbffr