Software Engineering Manager - (inference/Token/Gen AI)
Nutanix View all jobs
- Vancouver, BC
- $249,600 per year
- Permanent
- Full-time
- Lead the design and implementation of scalable AI Factory solutions across diverse enterprise environments.
- Collaborate with cross-functional engineering and product teams to define and enhance the product roadmap.
- Create and maintain reference architectures and operational models to ensure continuous delivery and operational excellence.
- Drive hands-on development of production-grade platforms utilizing Kubernetes for container orchestration.
- Establish high-performance inference pipelines and AI Gateways, ensuring optimal performance and scalability.
- Translate market trends into actionable engineering objectives, prioritizing real-world enterprise applications.
- Mentor and guide engineering teams, fostering a culture of innovation, collaboration, and knowledge sharing.
- Set strategic goals for the first year, focusing on building foundational AI capabilities and driving customer satisfaction.
- Proven experience in building and scaling production-grade platforms on Kubernetes (EKS, AKS, On-Prem).
- Direct experience in developing high-performance inference pipelines and AI Gateways.
- Strong understanding of Distributed Systems Architecture, including multi-cluster and hybrid-cloud management.
- Expertise in creating solutions tailored for AI practitioners with a focus on UX and security.
- Strategic mindset with the ability to navigate market shifts and translate them into actionable engineering roadmaps.
- Advanced knowledge of GPU resource optimization and automated fine-tuning techniques.
- MS or PhD in Computer Science, focusing on Distributed Systems or Machine Learning, preferred.
- Exceptional communication skills and experience interacting with C-level executives and cross-functional teams.