
Senior Infrastructure Engineer
- Toronto, ON
- Permanent
- Temps-plein
- Manage infrastructure for software, data, and ML platforms, both in the cloud and our on-premises GPU clusters.
- Design and implement integrations between infrastructure components (containing internal and 3rd party systems) to ensure seamless flow of data in a robust, reliable, and secure manner.
- Streamline and/or automate operational tasks such as infrastructure provisioning, configuration management, and application deployment.
- Implement and manage robust monitoring, logging, and alerting for key infrastructure components.
- Collaborate closely with engineers, scientists, security, and compliance teams to implement and promote DevSecOps principles across the organization.
- 5+ years of experience working as an Infrastructure Engineer, DevOps/MLOps Engineer, or SRE.
- Proficient in architecting and managing infrastructure using Infrastructure as Code tools (e.g. Terraform and Helm).
- Deep expertise in containerization and orchestration technologies like Docker and Kubernetes.
- Strong understanding of identity management and security best practices.
- Extensive experience designing, implementing, and maintaining CI/CD pipelines (e.g. CircleCI).
- Demonstrated experience with mentoring and elevating other team members' skills to adhere to engineering and DevOps best practices.
- Experience with Python/Shell scripting and automation tools.
- Experience managing infrastructure on Google Cloud Platform (GCP).
- Hands-on experience with modern ML platforms and frameworks (e.g. Weights & Biases, Metaflow, MLflow, Ray) and familiarity with the operational challenges of scaling ML workloads.
- Experience designing and operating hybrid-cloud architectures that span on-premises and cloud environments, with an emphasis on resilience, observability, and cost optimization.
- Familiarity with secrets management, zero-trust architectures, and secure-by-default design patterns in regulated or privacy-sensitive environments.
- A collaborative and innovative environment at the frontier of computational biology, machine learning, and drug discovery.
- Highly competitive compensation, including meaningful stock ownership.
- Comprehensive benefits - including health, vision, and dental coverage for employees and families, employee and family assistance program.
- Flexible work environment - including flexible hours, extended long weekends, holiday shutdown, unlimited personal days.
- Maternity and parental leave top-up coverage, as well as new parent paid time off.
- Focus on learning and growth for all employees - learning and development budget & lunch and learns.
- Facilities located in the heart of Toronto - the epicenter of machine learning and AI research and development, and in Kendall Square, Cambridge, Mass. - a global center of biotechnology and life sciences.