
Yobi AI is hiring: Machine Learning Engineer - Inference / Serving in New York
Yobi AI, New York, NY, United States
Machine Learning Engineer - Inference / Serving
Join to apply for the Machine Learning Engineer - Inference / Serving role at Yobi AI
Overview
Yobi is a rapidly growing Behavioral AI company on a mission to ethically democratize the benefits of data and AI. Since 2019, we have built one of the largest consented behavioral datasets in the United States, extending far beyond the walled gardens of Big Tech. Unlike traditional LLM companies, Yobi builds foundation models of human behavior grounded in real‑world actions such as purchases and store visits. Our private‑by‑design modeling enables state‑of‑the‑art personalization and decisioning for leading brands and agencies while protecting privacy, safety, and ethics.
Today, we are focused on bringing the performance of closed‑web user acquisition to the open web and connected TV, giving brands walled‑garden results without the walls. At our core, Yobi is building the behavioral intelligence layer for any system that makes a personalization decision.
Working at Yobi
We’re at an inflection point—customer adoption is accelerating, but there’s still room to shape the architecture and culture from the ground up. Engineers here own major surface areas, build 0→1 systems in large‑scale data and model infrastructure, and help define how Behavioral AI scales ethically and effectively.
Highlights
Well‑funded with 5+ years of runway. We are scaling revenue quickly and project to be breakeven in 2026.
Partnerships with Microsoft and Databricks.
Fully remote or hybrid from hubs in SF Bay Area, Seattle, NYC.
World‑class team of Machine Learning experts with experience at Amazon, Uber, Twitter, Meta, etc.
Product and Go‑to‑Market teams that have taken ideas from concept to nine‑figure revenue streams.
Benefits
Competitive base salary.
Meaningful equity and financial upside.
Annual bonus target based on personal and company performance.
Health, dental, vision plans with low out‑of‑pocket costs.
Unlimited PTO.
401(k) with company match.
About the Role
As a Machine Learning Engineer focused on inference and serving at Yobi, you’ll design, optimize, and operate the systems that bring our Behavioral AI models to life in real time. You’ll work at the core of our production environment, turning trained models into performant, reliable, and continuously improving services that power our open‑web and CTV products.
This is an applied ML systems role—equal parts engineering depth, deployment craft, and model intuition. You’ll shape how models are packaged, versioned, rolled out, and observed across environments, ensuring every prediction is fast, accurate, and accountable.
Responsibilities & Expectations
Build and scale production ML serving systems—handle versioning, rollouts, rollback strategies, and live experimentation.
Ensure low‑latency inference by optimizing model graphs, quantizing, batching, caching, and efficient feature retrieval.
Write robust, high‑performance code in Go, Rust, C++, or Java and bridge to Python for model integration and analysis.
Treat inference as a living system—monitor drift, track model lineage, and ensure observability from input to outcome.
Make serving systems reproducible and portable without over‑engineering—for instance, custom runtime design, model registries, or lightweight orchestration.
Reason about model performance and trade‑offs, and work with researchers to deploy more practical models.
Qualifications
Deep expertise in model deployment and production ML serving.
Strong low‑latency mindset and knowledge of inference optimization techniques.
Systems fluency: comfortable writing high‑performance code and bridging to Python.
Operational maturity: experienced with monitoring, drift detection, and observability.
Infrastructure intuition: understanding of custom runtimes, registries, and orchestration.
Applied ML understanding: can interpret performance, reasoning about trade‑offs, and collaborate with researchers.
Seniority Level
Mid‑Senior level.
Employment Type
Full‑time.
Job Function
Engineering and Information Technology. Software Development industry.
#J-18808-Ljbffr
In Summary: Yobi is a rapidly growing Behavioral AI company on a mission to ethically democratize the benefits of data and AI . Yobi builds foundation models of human behavior grounded in real‑world actions such as purchases and store visits . Our private‑by‑design modeling enables state‑of‑the‑art personalization and decisioning for leading brands .
En Español:
Ingeniero de aprendizaje automático - Inferencia / Servicio Unirse para solicitar el ingenio de Machine Learning - Enferencia/Servicio en Yobo AI Overview Yobi es una compañía de IA comportamental que crece rápidamente y con la misión de democratizar éticamente los beneficios de datos e inteligencia artificial. Desde 2019, hemos construido uno de los conjuntos de datos conductuales consentidos más grandes en Estados Unidos, extendiéndonos mucho más allá de los jardines amurados de Big Tech. A diferencia de las compañías tradicionales de LLM, Yobi construye modelos básicos del comportamiento humano basados en acciones reales como compras y visitas a tiendas. Nuestro modelo de diseño privado permite personalización y toma de decisiones para marcas y agencias líderes al tiempo que protege la privacidad, seguridad y ética. Estamos aumentando los ingresos rápidamente y proyectamos equilibrarlos en 2026. Alianzas con Microsoft y Databricks. Planeos totalmente remotos o híbridos desde centros de SF Bay Area, Seattle, NYC. Un equipo de expertos en aprendizaje automático de clase mundial con experiencia en Amazon, Uber, Twitter, Meta, etc. Equipos de producto y Go-to-Market que han llevado las ideas del concepto a flujos de ingresos de nueve cifras. Beneficios Salario base competitivo. Equidad significativa y ventaja financiera. Objetivo de bonificación anual basado en la visión personal y empresarial. Planes de salud, odontología, rendimiento con bajos costos fuera de bolsillo. PTO ilimitada. 401k) con el juego de empresa. Sobre el papel Como ingeniero de aprendizaje automatico enfocado en inferir y servir en Jobit, diseñará, optimizará y operará los sistemas que llevan a la profundidad en tiempo real. Desarrollará la forma en que los modelos se empaquetan, versionan, despliegan y observan a través de entornos, asegurando que cada predicción sea rápida, precisa y responsable. Responsabilidades y expectativas Construir y escalar sistemas de producción ML servidores manejar versión, lanzamientos, estrategias de retroceso y experimentación en vivo. Asegurar inferencia de baja latencia mediante la optimización de gráficos de modelo, cuantificación, lotes, almacenamiento caché y recuperación eficiente de características. Escribir código robusto y de alto rendimiento en Go, Rust, C++, o Java y puente a Python para la integración y análisis del modelo. Tratar las inferencias como un monitoreo de sistema vivo, el nivel de derivación del modelo y garantizar la observabilidad desde la entrada hasta la salida. Reproducir datos prácticos sin exceso de ingeniería de tiempo, empleaciones personalizadas, escritura por cuenta ajustada, codificación de códigos, y una comprensión más amplia sobre el funcionamiento y la implementación de modelos de trabajo con tecnología avanzada. Industria del desarrollo de software.