
Staff Software Engineer, Agentic Applications - Grafana Ops, AI/ML | Canada | Remote
- Canada
- Permanent
- Full-time
Support incident response by querying telemetry and other analysis tools to investigate alerts and suggest actions.
Evaluate overall infrastructure and/or observability quality and help automate meaningful improvements.
Expand the capabilities of agents across the observability stack.What You'll Be Doing:
- Design and evolve systems that use intelligent agents that assist users in detecting, triaging, and resolving incidents using observability data and tools.
- Build end-to-end workflows for incident lifecycle management, powered by LLMs or agentic technologies.
- Implement agentic systems to automate bulk workloads (such as large-scale analyses and report generation), suggest improvements, and surface likely root causes of failures.
- Integrate agentic components with internal tools, runbooks, alerting systems, incident platforms, and development environments.
- Experience working in environments with rapid iteration and experimental development.
- A pragmatic mindset that values developer experience, reproducibility, and thoughtful trade-offs when scaling GenAI systems.
- A passion for minimizing human toil and building AI systems that actively support engineers.
- Build APIs and orchestration logic for chaining agent actions into multi-step workflows.
- Integration of agentic applications with existing platforms and third-party tools.
- Experience with multi-agent systems is a plus.
- 100% Remote, Global Culture - As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
- Scaling Organization - Tackle meaningful work in a high-growth, ever-evolving environment.
- Transparent Communication - Expect open decision-making and regular company-wide updates.
- Innovation-Driven - Autonomy and support to ship great work and try new things.
- Open Source Roots - Built on community-driven values that shape how we work.
- Empowered Teams - High trust, low ego culture that values outcomes over optics.
- Career Growth Pathways - Defined opportunities to grow and develop your career.
- Approachable Leadership - Transparent execs who are involved, visible, and human.
- Passionate People - Join a team of smart, supportive folks who care deeply about what they do.
- In-Person onboarding - We want you to thrive from day 1 with your fellow new 'Grafanistas' to learn all about what we do and how we do it.
- Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect. *We will comply with local legislation where applicable.