Mediabistro logo
job logo

Software Development Engineer, Annapurna Labs, Elastic Collectives at Annapurna

Itlearn360, Cupertino, CA, United States


Job Description
We are seeking an experienced engineer to work on distributed AI/ML systems. The role focuses on implementing collective operations that enable AI to scale across multiple accelerators and servers. Most of the stack is C/C++ and low‑level, so solid knowledge of Linux, kernel internals, and high‑performance code is essential. Experience with embedded systems and high‑speed networking or HPC interconnects is highly valued. If you enjoy solving hard problems, iterating fast, and delivering meaningful solutions at scale, this is a unique opportunity to contribute to the largest AI models and hardware‑accelerated clusters.

Responsibilities

Design, implement, and optimize collective operations for scalable AI/ML workloads.

Write efficient C/C++ code with deep Linux kernel and networking knowledge.

Collaborate with infrastructure experts, hardware engineers, RTL engineers, and scientists on architecture and design discussions.

Mentor junior engineers and participate in thorough, constructive code reviews.

Drive fast iteration cycles and deliver high‑quality solutions for large‑scale AI customers.

Qualifications

3+ years of professional software development experience (non‑internship).

2+ years of design or architecture experience for new and existing systems, focusing on reliability and scaling.

Proficiency in at least one programming language, preferably C/C++.

Strong foundation in Linux kernel internals and high‑performance networking or HPC interconnects.

Experience with embedded systems is a plus.

Team & Culture
Join a globally distributed team that values mentorship, collaboration, and continuous learning. You will work closely with senior engineers, architects, and directors, while also guiding junior team members. The environment encourages rapid innovation, thoughtful code reviews, and ongoing career growth.

#J-18808-Ljbffr