
TonicAI is hiring: Machine Learning Engineer (NLP) in San Francisco
TonicAI, San Francisco, CA, United States
Tonic.ai is looking for a hands-on Machine Learning Engineer to help build production-grade NLP systems that power our data privacy and information extraction products. You'll join a small, experienced team working at the intersection of LLMs, data privacy, and applied AI — developing and fine-tuning models that detect and redact sensitive information across diverse datasets.
What You’ll Do Build and ship models. Fine-tune and evaluate transformer-based models (e.g., RoBERTa, Gemma, LLaMA) to support PII redaction, entity extraction, and synthetic data generation.
Own the ML lifecycle. From dataset curation and experiment tracking to model deployment and monitoring — you’ll own the full path from prototype to production.
Collaborate cross-functionally. Partner with Product and Design to shape how ML models drive user-facing features, and work with the broader engineering team to integrate them into scalable systems.
Experiment responsibly. Document your experiments, evaluate results rigorously, and help push the frontier of safe and explainable AI for data privacy.
What You’ll Bring 3+ years of professional experience in applied ML or data science with a focus on NLP
Proficiency in Python and deep learning frameworks such as PyTorch and Hugging Face Transformers
Hands-on experience with experiment tracking (e.g., Weights & Biases), distributed training (e.g., Accelerate), and model serving (e.g., vLLM)
Comfort working independently and iterating quickly — you enjoy the mix of research, engineering, and product thinking
Strong communication and collaboration skills
Bonus Points For: Experience with supervised and reinforcement learning fine-tuning (e.g. TRL)
Familiarity with data privacy, PII redaction, or healthcare data
A public portfolio, blog, or open-source contributions that demonstrate your technical depth and curiosity
Why You’ll Love It Here High autonomy and meaningful ownership — your models will ship to production, not sit in a notebook
Small, collaborative team with deep expertise in NLP and privacy
Opportunity to work with real-world, high-impact data in domains like healthcare and financial services
About Tonic.ai Tonic.ai empowers developers while protecting customer privacy by enabling companies to create safe, synthetic versions of their data for use in software development, model training, and AI implementation. Founded in 2018, with offices in San Francisco, Atlanta, New York, and London, the company is pioneering enterprise tools for data transformation, de-identification, synthesis, and subsetting, in pursuit of its mission to make data usable. Thousands of developers use data generated with Tonic.ai on a daily basis to build their products faster in industries as wide ranging as healthcare, financial services, logistics, edtech, and e-commerce. Working with customers like eBay, Cigna, American Express, and Volvo, Tonic.ai innovates to advance its goal of advocating for the privacy of individuals while enabling companies to do their best work. For more information, visit https://www.tonic.ai or follow /tonicfakedata on LinkedIn.
Benefits we offer Competitive salary and equity
Unlimited paid time off
401k plan with employer contribution
Medical, dental, and vision insurance
Generous parental leave policy
Remote-friendly work environment
Benefits we offer Generous comp plan with uncapped commission/earning potential
Computer of choice and stipend to purchase office equipment, etc
#J-18808-Ljbffr
In Summary: Tonic.ai is looking for a hands-on Machine Learning Engineer to help build production-grade NLP systems that power our data privacy and information extraction products . You'll join a small, experienced team working at the intersection of LLMs, data privacy, and applied AI .
En Español: Tonic.ai está buscando un ingeniero de aprendizaje automático para ayudar a construir sistemas NLP de grado de producción que impulsen nuestros productos de privacidad y extracción de datos. Se unirá a un equipo pequeño y experimentado que trabaja en la intersección de LLM, privacidad de datos y IA aplicada desarrollando y sintonizando modelos que detectan y redactan información sensible a través de diversos conjuntos de datos . Documente sus experimentos, evalúe los resultados rigurosamente y ayude a impulsar la frontera de IA segura y explicable para la privacidad de datos. Lo que usted traerá 3+ años de experiencia profesional en ML aplicado o ciencia de datos con un enfoque en NLP Proficiencia en Python y marcos de aprendizaje profundo como PyTorch y Hugging Face Transformers Experiencia práctica con el seguimiento del experimento (por ejemplo, Pesas & Prejuicios), capacitación distribuida (p.ej., Acelerar) y servicio de modelos (por ej., vLLM) Comodidad trabajando independientemente e iterando rápidamente Disfrutarás de la mezcla de investigación, ingeniería y pensamiento de productos Fuerte comunicación y colaboración Habilidades de confidencialidad: Experiencia con diseño supervisionado y refuerzo TRL Care Familiarización con áreas seguras de protección de la vida privada, PII Database o una cartera de datos pública Oportunidades para desarrollar soluciones técnicas de desarrollo tecnológicas con las compañías de software y alta capacidad de trabajo, ¿Por qué no te darás cuenta de tus propias versiones de TI? Fundada en 2018, con oficinas en San Francisco, Atlanta, Nueva York y Londres, Tonic.ai es pionera en herramientas empresariales para la transformación de datos, desidentificación, síntesis y subsetión, en búsqueda de su misión de hacer que los datos sean utilizables. Miles de desarrolladores utilizan todos los días los datos generados con Tonic .ai para construir sus productos más rápidamente en industrias tan amplias como salud, servicios financieros, logística, tecnología informática y comercio electrónico. Trabajando con clientes como eBay, Cigna, American Express y Volvo, Tonica.ai innova para avanzar en su objetivo de defender la privacidad de las personas mientras permite a las empresas realizar su mejor trabajo. Para obtener más información, visite https://www.tonic.ai o siga / tonesedata oferta en LinkedIn.
What You’ll Do Build and ship models. Fine-tune and evaluate transformer-based models (e.g., RoBERTa, Gemma, LLaMA) to support PII redaction, entity extraction, and synthetic data generation.
Own the ML lifecycle. From dataset curation and experiment tracking to model deployment and monitoring — you’ll own the full path from prototype to production.
Collaborate cross-functionally. Partner with Product and Design to shape how ML models drive user-facing features, and work with the broader engineering team to integrate them into scalable systems.
Experiment responsibly. Document your experiments, evaluate results rigorously, and help push the frontier of safe and explainable AI for data privacy.
What You’ll Bring 3+ years of professional experience in applied ML or data science with a focus on NLP
Proficiency in Python and deep learning frameworks such as PyTorch and Hugging Face Transformers
Hands-on experience with experiment tracking (e.g., Weights & Biases), distributed training (e.g., Accelerate), and model serving (e.g., vLLM)
Comfort working independently and iterating quickly — you enjoy the mix of research, engineering, and product thinking
Strong communication and collaboration skills
Bonus Points For: Experience with supervised and reinforcement learning fine-tuning (e.g. TRL)
Familiarity with data privacy, PII redaction, or healthcare data
A public portfolio, blog, or open-source contributions that demonstrate your technical depth and curiosity
Why You’ll Love It Here High autonomy and meaningful ownership — your models will ship to production, not sit in a notebook
Small, collaborative team with deep expertise in NLP and privacy
Opportunity to work with real-world, high-impact data in domains like healthcare and financial services
About Tonic.ai Tonic.ai empowers developers while protecting customer privacy by enabling companies to create safe, synthetic versions of their data for use in software development, model training, and AI implementation. Founded in 2018, with offices in San Francisco, Atlanta, New York, and London, the company is pioneering enterprise tools for data transformation, de-identification, synthesis, and subsetting, in pursuit of its mission to make data usable. Thousands of developers use data generated with Tonic.ai on a daily basis to build their products faster in industries as wide ranging as healthcare, financial services, logistics, edtech, and e-commerce. Working with customers like eBay, Cigna, American Express, and Volvo, Tonic.ai innovates to advance its goal of advocating for the privacy of individuals while enabling companies to do their best work. For more information, visit https://www.tonic.ai or follow /tonicfakedata on LinkedIn.
Benefits we offer Competitive salary and equity
Unlimited paid time off
401k plan with employer contribution
Medical, dental, and vision insurance
Generous parental leave policy
Remote-friendly work environment
Benefits we offer Generous comp plan with uncapped commission/earning potential
Computer of choice and stipend to purchase office equipment, etc
#J-18808-Ljbffr
In Summary: Tonic.ai is looking for a hands-on Machine Learning Engineer to help build production-grade NLP systems that power our data privacy and information extraction products . You'll join a small, experienced team working at the intersection of LLMs, data privacy, and applied AI .
En Español: Tonic.ai está buscando un ingeniero de aprendizaje automático para ayudar a construir sistemas NLP de grado de producción que impulsen nuestros productos de privacidad y extracción de datos. Se unirá a un equipo pequeño y experimentado que trabaja en la intersección de LLM, privacidad de datos y IA aplicada desarrollando y sintonizando modelos que detectan y redactan información sensible a través de diversos conjuntos de datos . Documente sus experimentos, evalúe los resultados rigurosamente y ayude a impulsar la frontera de IA segura y explicable para la privacidad de datos. Lo que usted traerá 3+ años de experiencia profesional en ML aplicado o ciencia de datos con un enfoque en NLP Proficiencia en Python y marcos de aprendizaje profundo como PyTorch y Hugging Face Transformers Experiencia práctica con el seguimiento del experimento (por ejemplo, Pesas & Prejuicios), capacitación distribuida (p.ej., Acelerar) y servicio de modelos (por ej., vLLM) Comodidad trabajando independientemente e iterando rápidamente Disfrutarás de la mezcla de investigación, ingeniería y pensamiento de productos Fuerte comunicación y colaboración Habilidades de confidencialidad: Experiencia con diseño supervisionado y refuerzo TRL Care Familiarización con áreas seguras de protección de la vida privada, PII Database o una cartera de datos pública Oportunidades para desarrollar soluciones técnicas de desarrollo tecnológicas con las compañías de software y alta capacidad de trabajo, ¿Por qué no te darás cuenta de tus propias versiones de TI? Fundada en 2018, con oficinas en San Francisco, Atlanta, Nueva York y Londres, Tonic.ai es pionera en herramientas empresariales para la transformación de datos, desidentificación, síntesis y subsetión, en búsqueda de su misión de hacer que los datos sean utilizables. Miles de desarrolladores utilizan todos los días los datos generados con Tonic .ai para construir sus productos más rápidamente en industrias tan amplias como salud, servicios financieros, logística, tecnología informática y comercio electrónico. Trabajando con clientes como eBay, Cigna, American Express y Volvo, Tonica.ai innova para avanzar en su objetivo de defender la privacidad de las personas mientras permite a las empresas realizar su mejor trabajo. Para obtener más información, visite https://www.tonic.ai o siga / tonesedata oferta en LinkedIn.