Member of Technical Staff, AI Models Research
Advanced Micro Devices View all jobs
- Markham, ON
- Permanent
- Full-time
1. Drive Innovation: Work on cutting-edge projects and collaborate with a team of highly skilled industry specialists. Push the boundaries of AI model optimization and application and contribute to the advancement of AI technology.
2. Optimize AI Models: Leverage quantization, sparsity, and architecture search methods to optimize and enhance the performance, efficiency, and accuracy of Generative AI models. Unlock the full potential of AI technology and enable groundbreaking solutions.
3. Collaborate with Experts: Collaborate closely with software engineers, data scientists, and researchers to integrate AI models into software applications and platforms. Learn from the best in the field and work together to achieve seamless integration and optimal performance.
4. Stay Ahead of the Curve: Stay updated with the latest advancements in AI algorithms, frameworks, and technologies. Continuously explore new techniques and approaches to stay at the forefront of AI model development and optimization.By joining our team, you will be part of a collaborative environment that fosters innovation and encourages professional growth. You will have access to cutting-edge technology, training programs, mentorship opportunities, and a clear career progression path.If you are passionate about AI models and possess the skills to drive innovation and optimization, we invite you to join our team at AMD. Apply now and be at the forefront of the AI revolution!KEY RESPONSIBILITIES:
- Research, design, and implement novel methods for efficient Generative AI training and inference
- Algorithmic model optimization and training methods design including quantization, sparsity, innovative parallelism strategies etc.
- Innovate and publish your work.
- Collaborate with other team members and teams.
- Project experiences on generative tasks. Familiar with deep learning framework, e.g., Pytorch/JAX.
- Experience working with linear scaling of large language models and training at scale
- Project experiences on model compression, quantization, and end-to-end inference optimization.
- Strong knowledge of artificial intelligence, deep learning or machine learning.
- Strong coding skills in Python required
- Experience with inference frameworks like vLLM, SGLang is a plus
- Additional plus if recent publications include conferences such as NeuRIPS, CVPR, ECCV/ICCV, ICML, ICLR, etc.
- Experience with any of the following also a plus: LLMs, Large Multi-modal models, 3D World Models, VLA, Agentic workflows
- A PhD or master’s degree in artificial intelligence, machine learning, or a related field.
- Markham, Ontario (Hybrid)