
Kubernetes/ DevOps System Administrator
- Vancouver, BC
- 65.000-130.000 $ par an
- Permanent
- Temps-plein
- Deploy, manage, and maintain highly available Kubernetes clusters (on-prem and/or cloud).
- Automate cluster provisioning, upgrades, patching, and scaling.
- Configure and manage networking, storage, ingress and namespaces within Kubernetes.
- Implement and maintain monitoring, logging, and alerting solutions for clusters and workloads.
- Troubleshoot and resolve issues related to pods, nodes, control plane components, and application deployments.
- Document infrastructure, configurations, and operational processes.
- Participate in on-call rotation and incident response processes.
- "Automate all the things"
- Assist in the design (major or incremental) of Kubernetes infrastructure
- Assist in the provisioning of Kubernetes infrastructure Identify, troubleshoot, diagnose, and correct systems related issues as they arise
- Monitor and review dashboards, logs, and critical alerts to ensure system issues are identified, escalated, and addressed as quickly as possible
- Provide ongoing operational support and work closely with team members on issue resolution
- Manage and administer incident, problem, and service request queues
- Develop and maintain system use guides, run books, and other documentation as needed
- Evaluate and determine opportunities for automation and implementation
- 3+ years of hands-on experience managing Kubernetes clusters in production
- Strong understanding of Kubernetes internals (control plane, scheduler, kubelet, etc.)
- Experience with on-premises Kubernetes (e.g., kubeadm, RKE, OpenShift) and/or managed services (EKS, GKE, AKS)
- Proficiency in Linux systems administration and containerization (Docker)
- Enjoys automating away manual tasks using scripting (Bash, Python, etc.) and configuration management tools such as Ansible
- A problem solver who takes initiative, is pro active and Comfortable digging through open source code to find answers
- Ability to analyze complex systems and problems and express them in simple terms
- Experience in troubleshooting networking issues
- Experience working in Agile based teams
- A problem solver who takes initiative and is pro active
- Effortlessly self-motivates while working on team-based projects
- A well organized, thorough and detail oriented person
- Able to keep the "bigger picture" in mind while prioritizing conflicting demands and tasks
- Confident enough to voice your opinion, ask questions and not afraid to suggest a better solution, without being abrasive
- Enjoys collaborating with others, including other functional teams to implement automated, scalable, stable and efficient infrastructure
- Completion of the Certified Kubernetes Administrator (CKA) exam
- Experience in working within on-premise infrastructure
- Understanding of software delivery practices such as feature toggling, no-downtime deployment
- Experience researching and advising on new technology implementations