
Intermediate Software Engineer SRE - AI
- Mississauga, ON
- $109,000-118,000 per year
- Permanent
- Full-time
- Build ML-based anomaly detection and pattern recognition systems.
- Enhance telemetry with smart tagging and metadata for better AI insights.
- Develop event-driven workflows and self-healing systems using AI triggers.
- Automate incident response with generative AI and custom AI agent orchestration.
- Use time-series forecasting and predictive modelling to anticipate failures.
- Optimise infrastructure with AI-powered autoscaling and cost-aware resource allocation.
- Build scalable, fault-tolerant systems in a cloud-native environment.
- Participate in on-call rotations and lead incident response for critical systems.
- Skilled in API integration for streamlined data exchange and system connectivity.
- Run internal AIOps workshops and help teams adopt AI maturity models.
- Champion responsible AI practices and ethical automation.
- Languages: Python, Java, Bash, Terraform
- Platforms: Azure, Kubernetes, Docker
- Tools: Datadog, Prometheus, AppDynamics, ELK, GitHub Actions
- ML/AI: MCP framework, AI agents, Vector store, Agent orchestration (LangChain), RAG
- CI/CD: Jenkins, ArgoCD, Spinnaker
- Databases: SQL Server, PostgreSQL, MySQL
- 5+ years experience in software engineering.
- Experience with SRE principles.
- Experience with AI/ML in production environments
- A passion for automation, intelligent systems, and operational excellence
- Strong debugging, problem-solving, and system design skills
- Experience with AIOps platforms.
- Contributions to open-source or AI communities
- Familiarity with Responsible AI frameworks
- Participation in AI hackathons or conferences