Principal Engineer - AI Framework Integration
Advanced Micro Devices View all jobs
- Toronto, ON Waterloo, ON
- Permanent
- Full-time
- ROCm & Framework Integration Leadership: Define and own integration strategies between ROCm libraries (HIP, CK, MIOpen) and AI frameworks. Serve as the primary technical point of contact for framework teams on library behavior, performance characteristics, and integration constraints.
- Performance & Quality Standards: Establish performance baselines, regression criteria, and validation expectations for framework‑facing library functionality. Partner with performance, tooling, and QA teams to ensure issues are detected early and addressed at the correct layer.
- Cross‑Team Technical Alignment: Drive alignment across Libraries, Frameworks, Compiler, and DevTools teams on integration priorities and sequencing. Translate framework requirements into actionable library‑level work and provide clear technical feedback upstream.
- Debugging & Escalation: Act as a senior technical escalation point for complex cross‑layer issues involving kernels, runtimes, compilers, and frameworks. Guide root‑cause analysis across organizational boundaries and ensure issues are resolved holistically.
- Developer Experience & Adoption: Advocate for integration patterns that improve usability, debuggability, and out‑of‑the‑box performance for ROCm users. Capture and promote best practices through documentation, design reviews, and mentoring.
- AI‑Assisted Engineering Practices: Leverage AI‑assisted development tools to accelerate analysis, integration validation, and documentation, while ensuring correctness and traceability.
- Deep experience with GPU software stacks, including libraries, runtimes, and performance‑critical code paths. Hands‑on familiarity with ROCm and its core components (HIP, Composable Kernel, MIOpen, hipDNN) or equivalent GPU platforms.
- Practical experience working with AI frameworks such as PyTorch, Triton, or JAX, including operator integration or backend enablement. Proven ability to debug and resolve cross‑layer issues spanning libraries, compilers, runtimes, and frameworks.
- Strong programming experience in C/C++ and Python, with comfort reading and reasoning complex codebases. Experience defining integration standards, performance criteria, and validation strategies across teams.
- Demonstrated technical leadership in large, cross‑functional engineering environments.
- Advanced degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience.
- Advanced degrees, such as M.Sc., M.Eng., Ph.D. are preferred