Head of Infrastructure
BettingJobs View all jobs
- Canada
- Permanent
- Full-time
- Design, build, and operate Azure cloud infrastructure supporting low-latency integrations with operator systems and external exchanges.
- Run Kubernetes (cluster sizing, autoscaling, multi-environment deployments).
- Stand up and optimize Kafka (event streaming), Redis (caching), and Postgres (OLTP).
- Implement CI/CD, secrets management, environment isolation, and infrastructure-as-code (Terraform/Bicep).
- Own observability (Prometheus, Grafana, OpenTelemetry), incident response, SLOs/SLIs, and on-call practices.
- Design and manage secure cloud networking (VNets, peering, private endpoints, DNS, firewall/NSGs) and connectivity with the operator.
- Drive security and compliance across identity, segmentation, secrets, and OS baselines in Azure.
- Lead performance engineering for market data ingestion and order routing.
- Partner with engineering on service boundaries, data contracts, and platform primitives.
- Manage cost, capacity, reliability, and availability while scaling the infrastructure function.
- 8+ years operating production infrastructure at scale.
- Deep cloud expertise (Azure preferred; AWS/GCP backgrounds welcome).
- Hands-on production experience with Kubernetes, Kafka, Redis, and Postgres.
- Strong networking, security, identity (e.g., Azure AD), and infrastructure-as-code skills.
- Proven ability to build reliable, observable platforms with strong CI/CD; experience with Prometheus, Grafana, OpenTelemetry, or similar.
- Cloud networking fundamentals (VNet design, private endpoints, DNS, firewall rules).
- Performance tuning for latency- and throughput-sensitive systems.
- Strong collaboration skills translating product/trading needs into platform capabilities.
- Active, hands-on use of AI tooling (infra-as-code generation, log analysis, automation, incident triage).
- Experience in trading, sports betting, exchanges, financial markets, or other real-time systems.
- Event-driven architecture or stream processing experience.
- SRE leadership or reliability program experience.