Senior Inference Technical Product Marketing Manager - Accelerated Computing
NVIDIA, Santa Clara
Senior Inference Technical Product Marketing Manager – Accelerated Computing
4 days ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
Pay
$144,000.00/yr – $287,500.00/yr
What You’ll Be Doing
- Help drive NVIDIA’s inference platform technical go‑to‑market efforts
- Work closely with engineering and product management teams to understand key technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (e.g. parallelisms, configurations, etc.)
- Diligently review and remain up to date on model architectures, frameworks, arxiv papers, whitepapers and deployment techniques (e.g. disaggregated serving, KV cache implementations) and identify intersection points between the latest AI models and NVIDIA’s platform to maximize performance and minimize TCO
- Develop crisp, clear positioning, messaging and assets to highlight NVIDIA’s leadership position in inference. Assets include blogs, whitepapers, presentations, analyst briefings and seminars at developer conferences
- Closely follow competitive inference announcements and prepare appropriate responses for business and technical/developer audiences
- Assist on building keynote slides for executives in areas where you are a subject matter expert
What We Need To See
- A BS Degree in Computer Science, Engineering or related field or equivalent experience in a technical product marketing role; a master’s degree preferred
- 6+ years of experience in LLM, AI/ML development in an engineering role followed by 5+ years of experience in product management or technical product marketing of AI/ML products
- Deep understanding of modern data center architectures, accelerated computing, distributed inference, deep learning frameworks (PyTorch, TensorFlow, JAX) and inference‑specific frameworks & optimizations (Dynamo, Triton Inference Server, TensorRT‑LLM, vLLM, SGLang)
- Market awareness – experience conducting technical competitive analysis and synthesizing key insights
- Collaboration & influence – proven ability to work cross‑functionally across engineering, product management, sales and marketing teams
- Strong communication, asset creation & storytelling – ability to translate sophisticated technical concepts into clear, compelling narratives for both technical and business audiences
- Ability to present to executive audiences
Ways To Stand Out From The Crowd
- Hands‑on experience with AI inferencing workflows using NVIDIA or open‑source serving frameworks running on accelerated computing in the data center
- Experience developing LLM models
- Experience working with hyperscale cloud providers
- Hands‑on technical competence – background in software development, AI infrastructure, data center silicon
- Demonstrated ability to engage with executive leadership and external partners
- Published technical content or speaking experience at industry events
- Have a portfolio of published marketing/launch assets
NVIDIA is widely considered to be one of high technology's most desirable employers. We have some of the most forward‑thinking and hardworking people in the world working for us. Our goal is to craft an environment where you can do your life’s best work. If you’re creative, self‑motivated and autonomous, we want to hear from you!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is $144,000 – $230,000 for Level 4 and $184,000 – $287,500 for Level 5.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until September 28, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#J-18808-Ljbffr