Senior Reward Models Scientist - RLHF

Anthropic, San Francisco, CA, United States

A leading AI research organization in San Francisco is seeking a Senior Research Scientist focused on reward modeling for large language models. The successful candidate will lead innovative research to improve how AI systems learn human preferences and ensure alignment with human values. Responsibilities include developing training methodologies and collaborating with various teams to enhance AI systems' reliability and safety. A Bachelor's degree and a strong research background in related fields are essential for this role.
#J-18808-Ljbffr