Distributed LLM Inference Engineer (vLLM)

Red Hat, Inc., Boston, MA, United States

Red Hat, Inc. is seeking a Machine Learning Engineer to innovate in distributed vLLM infrastructure and enhance AI deployments through Kubernetes. This full-time role involves developing scalable systems, contributing to design discussions, and collaborating with engineering teams. The ideal candidate has strong Python or GoLang skills, experience with cloud-native technologies, and the ability to work independently in a fast-paced environment. Benefit offerings include comprehensive healthcare and retirement plans.
#J-18808-Ljbffr