Logo
AMD

Principal GPU Network Software Engineer - Communications Library, MPI/ SHMEM, Cu

AMD, Santa Clara, CA, US, 95053

Salary min: $18,375.00

Salary max: $30,000.00

Duration: Full Time

Save Job

Principal GPU Network Software Engineer - Communications Library

Principal GPU Network Software Engineer - Communications Library

1 week ago Be among the first 25 applicants

This range is provided by AMD. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Passionate about building teams that will continue to shape organization's future.....

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

The Role

As a GPU Network Software Engineer with our NTSG - Network Technology Solutions Group, you will design, implement, and test features in communication libraries, middleware, and frameworks to provide best in class support for GPU applications running high performance computing and machine learning workloads at scale. You will work with technical experts within AMD, our partners, and the open-source community to implement these features as part of AMD’s open source ROCm stack for GPU computation.

The Person

You are accustomed to working in a dynamic, geographically distributed agile team, where partnership and collaboration are paramount. You possess excellent written and verbal communication skills, and strong attention to detail. You are results-oriented and accustomed to tight deadlines and changing priorities. Most importantly, you are constantly thinking of ways to improve performance of multi-node GPU applications.

Key Responsibilities

  • Design, implement, and test features to enhance GPU support in communication libraries, middleware and frameworks
  • Benchmark, profile and optimize code to maximize performance of multi-node GPU applications
  • Deliver high-quality code and documentation following best practices for open-source software development
  • Work with key technical experts at our customers, across AMD, and with our industry partners in the Ultra Ethernet Consortium and Ultra Accelerator Link Consortium to advance scale out and scale-up software and hardware solutions.


Preferred Experience

  • Strong background developing system software in C/C++
  • Experience with at least one of the following:
  • Implementing communication middleware like MPI/SHMEM
  • Implementing lower-level communication frameworks like UCX and libfabric, or development using RDMA APIs
  • Development and optimization of communication collective algorithms (e.g. AllReduce)
  • Familiarity with GPU programming in HIP or CUDA
  • In-depth knowledge of best practices in software development, including testing, profiling, debugging, documentation, version control, issue tracking, and planning
  • Proven track record contributing to open-source projects


Academic Credentials

  • B.Sc. or B.Eng. degree in Computer Science, Electrical Engineering, or equivalent


Advanced degrees, such as M.Sc., M.Eng., Ph.D. are preferred.

LOCATION:

Santa Clara, CA

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

Seniority level

  • Seniority level

    Mid-Senior level

Employment type

  • Employment type

    Full-time

Job function

  • Job function

    Engineering and Information Technology
  • Industries

    Semiconductor Manufacturing

Referrals increase your chances of interviewing at AMD by 2x

Get notified about new Network Software Engineer jobs in Santa Clara, CA.

Mountain View, CA $147,000.00-$240,000.00 2 days ago

Internship, Network Engineer, Infrastructure Engineering (Fall 2025)

Sunnyvale, CA $130,000.00-$160,000.00 2 weeks ago

Network Production Engineer (University Grad)

Network Engineer, Operations and Support

Network Production Engineer, Infrastructure

Fremont, CA $133,000.00-$190,000.00 4 days ago

Software Engineer - Datacenter networking

Software Engineer - Datacenter networking

Software Engineer - Datacenter networking

Network Engineer, Operations and Support (Labs)

Fremont, CA $161,000.00-$225,000.00 4 days ago

San Jose, CA $60,000.00-$75,000.00 2 weeks ago

Software Engineer (L5) - Open Connect Platform

San Mateo, CA $150,000.00-$170,000.00 2 weeks ago

Fremont, CA $120,000.00-$140,000.00 1 day ago

Software Engineer - FBOSS (Facebook Open Switching System)

San Mateo, CA $130,000.00-$280,000.00 2 weeks ago

Software Engineer - Security Research (Intrusion Prevention System Development)

Fremont, CA $112,000.00-$128,000.00 4 weeks ago

Sunnyvale, CA $160,000.00-$210,000.00 2 months ago

Sunnyvale, CA $156,000.00-$229,000.00 2 weeks ago

Foster City, CA $81,600.00-$115,250.00 5 days ago

Software Engineer, Traffic Infrastructure

San Francisco Bay Area $160,000.00-$180,000.00 2 weeks ago

Sunnyvale, CA $122,000.00-$174,000.00 2 weeks ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr