Share
What you'll be doing:
Working with NVIDIA Enterprises customers on large data center GPU server and networking system deployments as Solution Architect leader. Guide customer discussions on network design, compute/storage and support bring up ofserver/network/clusterdeployments. You will need to visit customer data center during bring up phase.
Demonstrate subject matter expertise in advanced GPU & network systems and be a trusted technical advisor to NVIDIA's strategic customers. Bring customer-specific requirements to product teams to guide product roadmap features.
Identify new project opportunities for NVIDIA products and technology solutions in data center and artificial intelligence applications. Work closely with the GPU/Network Systems Engineering, Product management and Sales teams
Work as customer trusted advisor conducting regular technical customer meetings for product roadmap, cluster issues debug, feature discussions and introduction to new technology solutions
Help team in building custom product demonstrations and POCs for solutions that address critical business needs of our customers
Hiring and building an elite team of AI Infrastructure architects.
We make extensive use of conferencing tools, but travel is required for on-site visit to customers and industry events.
What we need to see:
This role is for a leader with the motivation and skills to drive the data center engineering process. Ideal candidate has 18+ overall years of Systems/Solution Engineering (or similar Engineering roles) experience, well versed with data center design and operations. Strong hands-on experience is required.
8+ years of management experience.
Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).
System level expertise of CPU/GPU server architecture, NICs, Linux, system software and kernel drivers
Experience with networking switches for Ethernet/Infiniband, and Data Center infrastructure (power/cooling)
Knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes
Effective time management and capable of balancing multiple tasks
Strong verbal/written communication skills, able to share your ideas and code clearly through documents, presentations, etc
Strong operational understanding of large-scale data center infrastructure including servers, networking, power, cooling systems, and high-performance computing environments.
Hands-on leadership style — capable of both high-level program management and direct problem-solving at the operational level.
Experience managing multi-functional, multi-regional, and supplier-aligned teams with strong influence across technical and operational groups.
Ways to stand out from the crowd:
Knowledge and hands-on experience with NVIDIA hardware platforms (DGX, HGX, GB200, GB300, NVLink, InfiniBand). Experience with bringup and deployment of large clusters
Experience scaling hardware operations in hyperscale, AI, or high-performance computing (HPC) environments.
External customer facing background
You will also be eligible for equity and .
These jobs might be a good fit