Senior Site Reliability Engineer
Semios Voir toutes les offres
- Vancouver, BC
- 120.000-140.000 $ par an
- Permanent
- Temps-plein
- AgTech Breakthrough – Smart Irrigation Company & Pest Management Solution of the Year
- Thrive Top 50
- Google for Startups Accelerator Cohort
- Global Cleantech Top 100
- Lead the delivery of infrastructure projects.
- Plan and perform higher-risk maintenance.
- Contribute to resolving incidents and participate in an on-call roster.
- Work with product and software development colleagues to improve the resiliency and reliability of our products.
- Mentor team members in all aspects of SRE work.
- Manage your productivity and workload in a work-from-home environment
- Use a data-driven approach to identify changes to the product architecture to improve reliability, performance, and availability.
- Fully understand production environments and the end-to-end delivery process.
- Identify parts of the system that do not scale and drive solutions for these problem areas.
- Maintain and improve Service Level Indicators (SLI) that align with availability and performance targets.
- Build quality into the team's work by encouraging refactoring, testing, and breaking up the team’s work into small, releasable pieces.
- Have good knowledge of Linux and bash or similar.
- Be versed in the delivery of a SaaS product on AWS, GCP, or Azure.
- Have strong programming skills (Ruby, Python, Go, etc.).
- Be competent with Terraform or similar Infrastructure as Code (IaC) tools.
- Have experience with Docker, Kubernetes, EKS, or similar technologies.
- Have experience with CI/CD pipelines on Buildkite or similar platforms.
- Be familiar with building delivery pipelines with Buildkite or similar.
- Have strong version control skills with Git.
- Be experienced with increasing monitoring and observability using Datadog or similar tools (New Relic, Splunk, etc.).
- Have the desire to document and/or automate to reduce repetitive tasks.
- Enjoy delivering quickly and iterating fast.
- 5+ years of relevant experience in DevOps, SRE, or infrastructure engineering roles.
- At least 2–3 years in a senior or lead capacity, with demonstrated ownership of critical systems and mentoring responsibilities.
- Hands-on experience with modern cloud environments (AWS, GCP, or Azure), including deployment, scaling, monitoring, and cost optimization of SaaS applications.
- Proven experience implementing and managing observability stacks (e.g., Datadog, Prometheus, New Relic, Splunk) and driving improvements to SLIs/SLOs.
- Experience in incident management, including participation in on-call rotations and leading post-incident reviews with a focus on continuous improvement.
- Purposeful Work: Make a global impact by advancing sustainable food production.
- Our People: Work with a fun, collaborative, and supportive team.
- Recharge: Generous vacation policy, company-paid holidays and year-end winter break.
- Work Flexibility: Hybrid working arrangements and strong work-life balance culture.
- Prioritize Your Well-Being: Access comprehensive health plans designed to support your physical and mental health.
- Group RRSP, which includes a 3% company paid match after three months of employment
- Office location that is convenient via transit and bike paths
Nous sommes désolés mais ce recruteur n'accepte pas les candidatures en provenance de l'étranger.