
Software System Design Eng
- Markham, ON
- Permanent
- Full-time
- Develop and implement rigorous QA methodologies for virtualization platforms, hypervisors, GPU virtualization, SR-IOV, and pass-through technology implementations.
- Design test plans to verify virtualization features across compute, hostdrivers, networking, and GPU-accelerated workloads.
- Validate GPU virtualization capabilities on AMD gpus and ensure seamless integration within virtual desktop infrastructure (VDI) and AI/ML applications.
- Validate SR-IOV and pass-through technologies—ensuring efficient I/O performance and robust isolation for critical network and peripheral resources.
- Collaborate with engineering, product, architecture, program management teams to synchronize QA activities with product roadmaps, while aligning the testing of emerging AMD GPUs and advanced I/O features.
- Lead validation and automation initiatives using both established tools and custom frameworks to cover functional, regression, and performance tests, with dedicated focus on GPU virtualization testcases, SR-IOV, and pass-through scenarios.
- Oversee the detection, triaging, debugging, and resolution of defects spanning virtualization, GPU acceleration, and advanced I/O components, ensuring prompt and effective problem resolution.
- Partner with development, architecture, and product teams to integrate virtualization and GPU features, whilst providing proactive insights related to SR-IOV and pass-through aspects.
- Constantly research emerging trends in virtualization, GPU acceleration, and I/O technologies to propose refined testing strategies and improvements in CI/CD pipelines.
- Bachelor’s or master’s degree in computer science, Engineering, or a related field with 8+ yrs of related experience
- Demonstrated understanding of platforms such as VMware vSphere, Hyper-V, KVM, Xen, and GPU virtualization technologies
- Hands-on experience in Linux platforms, Virtualization platforms such as any KVM based Hypervisor or Openstack and ESXi- Good understanding and experience in virtualization technologies, such as KVM,Qemu, Libvirt, VirtIO,
- Solid background in leading quality assurance initiatives and guiding teams in complex, technical environments.
- Advanced skills in Python, PowerShell, or Bash to design custom test scripts and automation solutions,
- Experience in incorporating QA processes within CI/CD pipelines using tools like Jenkins, GitLab CI/CD, or their equivalents.
- Competence in diagnosing and resolving technical issues across diverse virtualized environments, including those involving advanced I/O technologies.
- Familiarity with Docker, Kubernetes, and cloud-native infrastructures, as well as virtualization services from AWS, Azure, or GCP.
- Prior experience with GPU performance tuning, virtualization configuration, and benchmarking along with in-depth exposure to SR-IOV setups and pass-through configurations
- Familiarity with various embedded and GPU hardware platforms and wide variety of operating systems(Linux and Windows) variants
- Familiarity with AI / ML frameworks , models and datacenter tools
- Strong analytical and problem-solving skills, with an ability to debug and resolve complex issues in software systems.
- Should be familiar with one or more scripting languages (bash, Perl, python, etc.)