Expoint – all jobs in one place
Finding the best job has never been easier
Limitless High-tech career opportunities - Expoint

Nvidia Senior Director Enterprise AI Factories Deployment 
United States, California 
707318513

Yesterday
US, CA, Santa Clara
time type
Full time
posted on
Posted 7 Days Ago
job requisition id

What you'll be doing:

  • Working with NVIDIA Enterprises customers on large data center GPU server and networking system deployments as Solution Architect leader. Guide customer discussions on network design, compute/storage and support bring up ofserver/network/clusterdeployments. You will need to visit customer data center during bring up phase.

  • Demonstrate subject matter expertise in advanced GPU & network systems and be a trusted technical advisor to NVIDIA's strategic customers. Bring customer-specific requirements to product teams to guide product roadmap features.

  • Identify new project opportunities for NVIDIA products and technology solutions in data center and artificial intelligence applications. Work closely with the GPU/Network Systems Engineering, Product management and Sales teams

  • Work as customer trusted advisor conducting regular technical customer meetings for product roadmap, cluster issues debug, feature discussions and introduction to new technology solutions

  • Help team in building custom product demonstrations and POCs for solutions that address critical business needs of our customers

  • Hiring and building an elite team of AI Infrastructure architects.

  • We make extensive use of conferencing tools, but travel is required for on-site visit to customers and industry events.

What we need to see:

  • This role is for a leader with the motivation and skills to drive the data center engineering process. Ideal candidate has 18+ overall years of Systems/Solution Engineering (or similar Engineering roles) experience, well versed with data center design and operations. Strong hands-on experience is required.

  • 8+ years of management experience.

  • Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).

  • System level expertise of CPU/GPU server architecture, NICs, Linux, system software and kernel drivers

  • Experience with networking switches for Ethernet/Infiniband, and Data Center infrastructure (power/cooling)

  • Knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes

  • Effective time management and capable of balancing multiple tasks

  • Strong verbal/written communication skills, able to share your ideas and code clearly through documents, presentations, etc

  • Strong operational understanding of large-scale data center infrastructure including servers, networking, power, cooling systems, and high-performance computing environments.

  • Hands-on leadership style — capable of both high-level program management and direct problem-solving at the operational level.

  • Experience managing multi-functional, multi-regional, and supplier-aligned teams with strong influence across technical and operational groups.

Ways to stand out from the crowd:

  • Knowledge and hands-on experience with NVIDIA hardware platforms (DGX, HGX, GB200, GB300, NVLink, InfiniBand). Experience with bringup and deployment of large clusters

  • Experience scaling hardware operations in hyperscale, AI, or high-performance computing (HPC) environments.

  • External customer facing background

You will also be eligible for equity and .