Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

Nvidia Solutions Architect Generative AI 
United States, California 
50533908

Today
US, CA, Santa Clara
time type
Full time
posted on
Posted Yesterday
job requisition id


What you will be doing:

  • Serve as the primary technical domain expert for pre- and post-sale for partners, embedding deeply with them to design and deploy Generative AI solutions. Maintain strong relationships with leadership and technical teams to drive adoption, and successful utilization of NVIDIA GenAI platforms.

  • Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes, and advising on standard methodologies for scaling solutions to productions.

  • Define the scope, success metrics, and evaluation criteria for partner-led customer projects, ensuring they are built on standardized and reproducible GPU-accelerated workflows.

  • Enable strategic partners to launch their own Professional Services and platforms by tailoring NVIDIA agentic AI blueprints for high-impact customer workloads. You will proactively find opportunities to drive deeper adoption and utilization of NVIDIA's Generative AI products.

  • Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.

What we need to see:

  • MSc, PhD in Computer Science, Electrical Engineering, Software Engineer, ML Engineer, or related fields (or equivalent experience).

  • 5+ years of relevant work experience in developing and deploying AI models at scale as a Software Engineer or deep learning engineer.

  • Consistent track record of building enterprise-grade agentic AI systems using open-source models and solid foundation in deep learning, with a particular emphasis on generative models.

  • Hands-on experience with LLM and agentic frameworks (NeMo Agent Toolkit, LangChain, Semantic Kernel, Crew.ai, AutoGen) and evaluation and observability platforms. Comfortable building prototypes or proofs of concept

  • Strong coding development and proficiency in Python, C++ and Deep Learning frameworks (PyTorch, or TensorFlow).

  • Excellent communication and presentation skills to effectively collaborate with both internal executives, partners and customers.

Ways to stand out from the crowd:

  • Demonstrate expertise and hands-on experience with NVIDIA AI platforms.

  • Understanding of different advanced agent architectures and emerging communication protocols (MCP or Google A2A).

  • Excellent practical knowledge of Generative AI and LLM development. Ability to train GPT and Megatron Models.

  • Understanding of MLOps life cycle management and experience with LLMOps workflows.

  • Experience with CUDA programming and benchmarking and analyzing performance foundation models.

You will also be eligible for equity and .