Mediabistro logo
job logo

Senior Deep Learning Software Engineer, LLM Performance

NVIDIA Gruppe, Santa Clara, CA, USA

Pay: $184,000-$287,500/yr

Job type: Full Time


We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference. This role focuses on designing and optimizing GPU-accelerated software for large language model deployment and serving.
What you'll be doing:

Optimize performance, analysis, and tuning of LLM, VLM, and GenAI models for DL inference, serving, and deployment in NVIDIA/OSS LLM frameworks.
Scale performance of LLM models across different architectures and NVIDIA accelerators from datacenter GPUs to edge SoCs.
Achieve maximum throughput and minimum latency, meeting throughput under latency constraints.
Contribute features and code to NVIDIA/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton.
Collaborate with cross‑functional teams in generative AI, automotive, image understanding, and speech understanding to develop innovative solutions.
What we need to see:

Bachelor’s, Master’s, PhD, or equivalent experience in Computer Engineering, Computer Science, EECS, AI.
At least 8 years of relevant software development experience.
Excellent Python, C, and C++ programming, software design, and engineering skills.
Experience with a deep learning framework such as PyTorch, JAX, or TensorFlow.
Ways to stand out from the crowd:

Prior experience with an LLM framework or a deep learning compiler in inference, deployment, algorithms, or implementation.
Prior experience with performance modeling, profiling, debugging, and code optimization of a deep learning, HPC, or high‑performance application.
Architectural knowledge of CPU and GPU systems.
GPU programming experience (CUDA or OpenCL).
Compensation & Benefits

Base salary determined by location, experience, and comparable roles: $184,000 – $287,500 for Level4; $224,000 – $356,500 for Level5.
Eligible for equity and benefits.
EEO Statement

NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

#J-18808-Ljbffr