Expoint – all jobs in one place
המקום בו המומחים והחברות הטובות ביותר נפגשים
Limitless High-tech career opportunities - Expoint

Nvidia Senior Solutions Architect Generative AI 
United States, Texas 
741758951

28.07.2025
US, CA, Santa Clara
US, CA, Remote
time type
Full time
posted on
Posted 2 Days Ago
job requisition id

What you’ll be doing:

  • Collaborating closely with customers to improve their workload performance and reduce infrastructure costs.

  • Leading and developing proof-of-concepts for AI solutions applied to the Consumer Internet industry, including areas like LLMs and recommenders, and building collateral (notebook/code) as needed.

  • Developing and debugging software for NVIDIA and open-source AI frameworks and libraries.

  • Partnering with NVIDIA’s software engineering, product, and sales teams to secure design wins and drive the development of innovative solutions based on customer feedback.

What we need to see:

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields, or equivalent experience.

  • 8+ years of experience as an AI/Software Engineer with proven track record coding in Python and/or C++ with popular AI software libraries and GPUs.

  • Experience with profiling and optimizing model training/inference performance on GPUs.

  • Experience developing and optimizing GPU kernels for deep learning, with a focus on GEMM and attention kernels.

  • Strong communication skills, with the ability to clearly convey ideas and code through GitHub, documentation, and presentations.

  • A great teammate who enjoys collaborating with cross-functional teams including Engineering, Research, Sales, Product, and Marketing.

  • Self-starter with a passion for growth, continuous learning, and sharing insights with the team.

Ways to stand out from the crowd:

  • Full stack experience from DL framework level (such as PyTorch/JAX) to lower level (such asCUDA/CUTLASS/cuDNN/NCCL).

  • Experience working with enterprise developers and strong customer-facing skills.

  • Familiarity with MLOps technologies such as containers, Kubernetes, and data center deployments.

  • Experience with large-scale production data pipelines and AI model training/deployment.

  • Creative problem-solving skills for debugging and resolving complex issues.

You will also be eligible for equity and .