TETRAMEM INC is hiring: Senior Machine Learning Engineer in San Jose

TETRAMEM INC, San Jose, CA, United States

At TetraMem, we are redefining the future of AI with our groundbreaking innovations in In-Memory Computing. Leveraging world-record multi-level RRAM technology, we deliver highly efficient solutions for AI computations, enabling superior performance and energy efficiency across applications ranging from edge devices to data centers. Our talented team of engineers and industry-leading executives drives this progress, making TetraMem a leader in advanced memory technologies.

If you are passionate about cutting-edge technology and thrive in a fast-paced, collaborative environment, TetraMem is the place for you. Join our global team to shape the future of AI computations and sustainable technology solutions while working at the forefront of innovation. Together, we can make a lasting impact.

Are you ready for new challenges and new opportunities?

Join our team!

Current job opportunities are posted here as they become available.

Subscribe to our RSS feeds to receive instant updates as new positions become available.

Develop, optimize, and deploy lightweight machine learning models for edge AI applications, particularly for audio processing.

Implement and optimize ML models on embedded platforms, including FPGA and custom ASIC solutions.

Work closely with hardware and software teams to integrate ML models into production systems.

Research and implement state-of-the-art ML techniques to enhance model efficiency, latency, and power consumption for embedded AI applications.

Improve inference efficiency and model compression techniques, including quantization, pruning, and knowledge distillation.

Collaborate with cross-functional teams to drive innovation and contribute to the overall system architecture.

Provide technical leadership and mentorship to junior engineers.

Publish research findings, present at conferences, and contribute to open-source projects when applicable.

Requirements

5+ years of experience or PhD in Computer Science, Electrical Engineering, or related fields.

Strong experience in machine learning, with a focus on edge AI and lightweight model deployment.

Expertise in ML frameworks such as PyTorch, TensorFlow, JAX.

Proficiency in programming languages such as C/C++, Python, and experience with ML model optimization.

Ability to work independently and collaboratively in a fast-paced startup environment.

Experience in one or more of the following areas considered a strong plus

Understanding of ML compiler and runtime design.

Experience working with tools such as Optimum, ONNX, TensorRT, TFLite/LiteRT, ncnn, or CoreML.

Familiarity with hardware acceleration techniques.

Experience in embedded system development.

Salary Range: $110,000 - $300,000 / year

#J-18808-Ljbffr

In Summary: TetraMem is redefining the future of AI with its groundbreaking innovations in In-Memory Computing . We deliver highly efficient solutions for AI computations, enabling superior performance and energy efficiency across applications ranging from edge devices to data centers . We need a strong team of engineers and industry-leading executives to drive this progress .

En Español: En TetraMem, estamos redefiniendo el futuro de la IA con nuestras innovaciones innovadoras en Computación In-Memory. Aprovechando la tecnología RRAM multinivel del récord mundial, ofrecemos soluciones altamente eficientes para los cálculos AI, permitiendo un rendimiento y una eficiencia energética superiores a través de aplicaciones que van desde dispositivos vanguardistas hasta centros de datos. Nuestro talentoso equipo de ingenieros y ejecutivos líderes de la industria impulsa este progreso, haciendo de Tetra Mem un líder en tecnologías avanzadas de memoria. Juntos, podemos hacer un impacto duradero. ¿Estás listo para nuevos desafíos y nuevas oportunidades? Únete a nuestro equipo! Las oportunidades de trabajo actuales se publican aquí cuando estén disponibles. Suscríbete a nuestros feeds RSS para recibir actualizaciones instantáneas a medida que aparezcan nuevas posiciones. Desarrolle, optimice e implemente modelos de aprendizaje automático ligeros para aplicaciones AI extremas, especialmente para procesamiento de audio. Implementar y optimizar los modelos ML en plataformas embebidas, incluidos FPGA y soluciones ASIC personalizadas. Trabaja junto con equipos de hardware y software para integrar modelos en sistemas de producción. Investigue e implique técnicas ML de última generación para mejorar la eficiencia del modelo, latencia y potencia para el consumo de las aplicaciones embedded.