Software Development Engineer – Software Dev Ops & Continuous Integration Team
Advanced Micro Devices View all jobs
- Markham, ON
- Permanent
- Full-time
- CI/CD Pipeline Development: Design, implement, and manage efficient continuous integration and delivery pipelines using Buildkite, GitHub Actions, and Jenkins to enable rapid and reliable software deployment.
- Kubernetes Infrastructure Management: Deploy and maintain robust Kubernetes-based environments across both on-premise and cloud platforms to support scalable service orchestration.
- Infrastructure Automation: Automate provisioning, configuration, and management of infrastructure using Ansible, Python, and Bash to improve system consistency and reduce manual intervention.
- Service Deployment with Helm: Administer application and service deployment in Kubernetes using Helm charts for consistent and repeatable release processes.
- GPU Server Support: Configure, manage, and maintain GPU-based compute environments including lifecycle automation and hardware-level test integration.
- Database and Observability Integration: Interact with MySQL databases to support dynamic data updates and integrate data sources into Grafana dashboards for monitoring and insights.
- Cross-Functional Collaboration: Work closely with development teams, SREs, and project stakeholders to ensure system-level alignment and high-impact delivery.
- Quality Assurance Enablement: Integrate automated testing frameworks into CI pipelines to ensure code quality, stability, and performance across development cycles.
- DevOps Tools & Automation: Proficient with Buildkite, GitHub Actions, Jenkins, Ansible, and scripting languages like Python and Bash for streamlining DevOps workflows.
- Containerization & Orchestration: Strong experience with Docker, Kubernetes, and Helm for deploying and managing scalable, containerized applications.
- Infrastructure as Code (IaC): Hands-on experience automating infrastructure provisioning and configuration to ensure reproducibility and scalability across environments.
- GPU-Based Compute Environments: Familiarity with GPU server lifecycle management and integration of GPU resources into CI test workflows for performance-critical applications.
- Monitoring & Observability: Experience using tools like Checkmk, Prometheus, and Grafana to monitor infrastructure health and application performance.
- Version Control & Collaboration: Advanced knowledge of Git-based version control, including branching strategies and CI/CD integration for collaborative development.
- Linux & System Administration: Solid background in Linux environments, including shell scripting and system-level troubleshooting across distributed systems.
- Agile & Cross-Disciplinary Collaboration: Comfort working in Agile teams and partnering with software, infrastructure, and product teams to drive consistent delivery and innovation.
- Bachelor's or master's degree in computer/software engineering, Computer Science, or related technical discipline