Mediabistro logo
job logo

LLM Dataset Engineer: High-Scale Data for Frontier Models

Sciforium, San Francisco, CA, United States


Sciforium, based in San Francisco, is looking for a visionary LLM Dataset Engineer to lead the strategy and creation of datasets for their multimodal AI models. The ideal candidate will have over 5 years of industry experience, deep proficiency in Python, and experience with petabyte-scale datasets. You will be responsible for crafting pre-training datasets, developing data pipelines, and ensuring data quality. Benefits include insurance, a 401k plan, and a flexible time-off policy.
#J-18808-Ljbffr