Job responsibilities
- Leads technology and process implementations to achieve functional technology objectives
- Demonstrates expertise in site reliability principles and balances features, efficiency, and stability.
- Effectively negotiates with peers and executive partners to ensure optimal outcomes for all.
- Drives the adoption of site reliability practices throughout the organization.
- Ensures teams demonstrate site reliability best practices, empirically proven through stability and reliability metrics.
- Drives a culture of continual improvement and solicits real-time feedback to enhance the customer experience.
- Delivers technical solutions that can be leveraged across multiple businesses and domains
- Provides personalized coaching for entry to mid-level team members.
- Ensures team documents and shares knowledge and innovations via internal forums, communities of practice, guilds, and conferences.
- Employs AI-driven solutions to streamline processes and enhance operational efficiency.
- Leverages AI tools to enhance operational effectiveness and automate processes, ensuring high-quality customer service.
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and 10+ years applied experience. In addition, 5+ years of experience leading technologists to manage, anticipate and solve complex technical items within your domain of expertise
- Experience developing or leading cross-functional teams of technologists
- Experience with hiring, developing, and recognizing talent
- Experience leading a product as a Product Owner or Product Manager
- Practical cloud native experience
- Expertise in Computer Science, Computer Engineering, Mathematics, or a related technical field
- Provides personalized coaching for entry to mid-level team members.
- Ensures team documents and shares knowledge and innovations via internal forums, communities of practice, guilds, and conferences.
Preferred qualifications, capabilities, and skills
- Demonstrates an understanding of the fine balance between features, efficiency, and stability.
- Active experience or deep curiosity in applying AI to operations—such as LLM-based copilots, anomaly detection, automated runbooks, autonomous agents (e.g. CrewAI, LangGraph), or Retrieval-Augmented Generation (RAG) workflows for support.