Senior Software Engineer, Machine Learning Infrastructure
Thumbtack
- Toronto, ON
- Permanent
- Full-time
- Collaborate with engineers, applied scientists and product managers to identify shared ML infrastructure needs across areas like feature engineering, model experimentation, model inference & CI/CD, generative AI, and model monitoring.
- Build, maintain, and communicate our roadmap.
- Centralize and standardize ML infrastructure & associated best practices for product teams across engineering.
- When appropriate, experiment with and introduce next-generation ML infrastructure capabilities and frameworks so product teams can continue to harness the power of open source / vendor-driven advances in machine learning.
- Drive projects to completion with a tenacious focus on the business impact of those projects.
- Solve tough technical problems and stay up-to-date with the latest advances in this constantly evolving problem space.
- 5+ years of industry experience in engineering.
- 2+ years of industry experience working on machine learning modeling or infrastructure.
- You're fluent in at least one major programming language and would be able to switch between multiple languages. In our stack, we use Go and Python most heavily.
- You have experience building software on top of relational databases such as Postgres or MySQL.
- You can break down complex problems rigorously and understand the tradeoffs necessary to deliver great, impactful products.
- You're curious, you're data-driven, you love to ask questions, and you think critically about problems.
- You love delivering value to your users and your teammates through your work.
- You have experience building and evolving machine learning infrastructure.
- You have worked with frameworks like PyTorch, Tensorflow, Scikit-learn, and Airflow
- You have experience with generative AI infrastructure and vendors
- You have experience building and maintaining reliable, performant distributed systems.
- You're familiar working with major cloud providers and/or the big data ecosystem (Amazon Web Services, Google Cloud Platform, Spark, etc).
- You've demonstrated your ability to thrive in a fast-paced startup environment.