Expoint – all jobs in one place
The point where experts and best companies meet
Limitless High-tech career opportunities - Expoint

Nvidia Senior Solutions Architect HPC Generative AI Deployment 
United States, Texas 
239541918

28.07.2025
US, CA, Santa Clara
US, IL, Remote
US, CA, Remote
US, NY, Remote
US, MA, Remote
time type
Full time
posted on
Posted 2 Days Ago
job requisition id

What You Will Be Doing

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

  • Dynamically engaging with developers, scientific researchers, and data scientists, gaining experience across a range of technical areas

  • Strategically partnering with lighthouse customers and researchers to help them adopt and build creative solutions using NVIDIA technology

  • Analyzing performance and power efficiency of AI workloads on Kubernetes

  • Some travel to conferences and customers is required for this role

What We Need To See

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

  • 8+ years of hands-on experience with accelerated computing and deep Learning frameworks such as PyTorch

  • Experience porting and/or optimizing scientific applications targeting GPUs

  • Strong fundamentals in programming and software design, especially in Python and C++

  • Experience with containerization and orchestration technologies, monitoring, and observability solutions for AI deployments

  • Excellent knowledge of theory and practice of AI at scale

  • Excellent presentation, communication and collaboration skills

Ways To Stand Out From The Crowd

  • Experience with NVIDIA GPUs and parallel programming libraries, such as CUDA, OpenMP, OpenACC, communication libraries and runtime (MPI, NCCL, UCX, NVSHMEM)

  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

  • Experience working with academic research community supporting HPC or AI

  • Familiarity with distributed computing platforms, containers and scheduling tools

  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

You will also be eligible for equity and .