Site Reliability Engineer
McCain Foods Voir toutes les offres
- Toronto, ON
- 102.700-137.000 $ par an
- Permanent
- Temps-plein
- Architect, design, and implement reliable and scalable systems in Azure cloud.
- Instrument distributed systems using OpenTelemetry for logs, metrics, and traces — embedding observability into the codebase across microservices and critical applications.
- Drive SAP observability strategy as we migrate to SAP RISE — integrating New Relic with SAP to provide full-stack visibility, performance insights, and business-critical alerting.
- Automate infrastructure and operations at scale using Infrastructure as Code (Terraform or Bicep), CI/CD pipelines, and self-healing systems to reduce manual toil.
- Collaborate with developers and platform teams to define and implement SLOs, SLIs, and Error Budgets, and embed SRE practices across product teams.
- Lead incident response and root cause analysis, building a blameless culture while hardening systems against future failures.
- Contribute to the evolution of McCain’s SRE playbooks, tooling, and engineering standards as a founding member of the global SRE practice.
- Bachelor's Degree in related field, such as Computer Science or related technical field
- 7+ years of software engineering experience, including at least 5 year working experience as a Site Reliability Engineer accountable for SLOs.
- Experience with deployment and development on Azure
- Experience in Continuous Delivery methodologies and tools.
- Good knowledge on resiliency patterns and cloud security
- Experience troubleshooting issues with users and ability to collaborate effectively with cross-functional teams.
- Any certifications on Azure presferred
- Resilience: Services meet or exceed availability targets across mission-critical systems (including SAP).
- Observability: End-to-end visibility with actionable dashboards and alerts.
- Automation: Manual tasks are eliminated via tooling and scripts; infrastructure is code-first.
- Influence: SRE principles are embedded into our engineering culture, with you as a key driver.
- Trust: Stakeholders see you as the go-to expert for system reliability and design best practices.
Division: Global Digital Technology
Department: Dev SecOps and SR
Location(s): CA - Canada : Ontario : Toronto || CA - Canada : New Brunswick : Florenceville-BristolCompany: McCain Foods (Canada)