
Senior Software Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML | Canada | Remote
- Canada
- Permanent
- Full-time
- Design and implement robust evaluation frameworks for GenAI and LLM-based systems, including golden test sets, regression tracking, LLM-as-judge methods, and structured output verification.
- Develop tooling to enable automated, low-friction evaluation of model outputs, prompts, and agent behaviors.
- Define and refine metrics for both structure and semantics, ensuring alignment with realistic use cases and operational constraints.
- Lead the development of dataset management processes and guide teams across Grafana in best practices for GenAI evaluation.
- Experience designing and implementing evaluation frameworks for AI/ML systems.
- Familiarity with prompt engineering, structured output evaluation, and context-window management in LLM systems.
- High autonomy to collaborate and translate team goals into clear, testable criteria supported by effective tooling.
- Experience working in environments with rapid iteration and experimental development.
- A pragmatic mindset that values reproducibility, developer experience, and thoughtful trade-offs when scaling GenAI systems.
- A passion for minimizing human toil and building AI systems that actively support engineers.
- Scaling Organization - Tackle meaningful work in a high-growth, ever-evolving environment.
- Remote-First, Global Culture - Work from anywhere with collaboration at the heart of everything we do.
- Transparent Communication - Expect open decision-making and regular company-wide updates.
- Innovation-Driven - Autonomy and support to ship great work and try new things.
- Open Source Roots - Built on community-driven values that shape how we work.
- Empowered Teams - High trust, low ego culture that values outcomes over optics.
- Career Growth Pathways - Defined opportunities to grow and develop your career.
- Approachable Leadership - Transparent execs who are involved, visible, and human.
- Passionate People - Join a team of smart, supportive folks who care deeply about what they do.
- In-Person onboarding - We want you to thrive from day 1 with your fellow new 'Grafanistas' to learn all about what we do and how we do it.
- Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect. *We will comply with local legislation where applicable.