AI Platform Engineer II
Braze Voir toutes les offres
- Toronto, ON
- Permanent
- Temps-partiel
- Build and maintain critical services and subsystems on our AI platform, balancing performance with cost-effective operations
- Implement cloud-native solutions that ensure reliability, scalability, and fault tolerance
- Troubleshoot production incidents end-to-end, going deep to identify root causes and implement durable fixes
- Contribute to observability practices using Sentry and Datadog to proactively detect issues and minimize downtime
- Collaborate with data scientists, ML engineers, and product teams to translate real-world use cases into platform capabilities
- Improve developer experience by streamlining workflows, enhancing tooling, and supporting MLOps best practices
- Core Data & ML: Python, Ibis, FastAPI, Dataproc (Spark), SQL, BigQuery, MLflow, Streamlit
- Platform & Infrastructure: Google Cloud Platform, AWS, Kubernetes, Helm, Terraform
- Workflows & Orchestration: Airflow, RabbitMQ, Celery
- CI/CD: GitHub Actions, Jenkins
- Observability: Sentry, Datadog
- Production ML at scale: no toy datasets or notebook demos; you're building infrastructure that powers real AI workloads
- Engineering rigor: unit and integration tests, modular design, CI/CD, pair programming, and code reviews are how we work, not aspirations
- Learn continuously: deep exposure to ML system architecture, end-to-end ML workflows, and reinforcement learning systems
- 2-4 years of experience in platform engineering, infrastructure, or a related backend role
- Solid understanding of platform architecture, particularly in ML or data-intensive environments
- Hands-on experience with Kubernetes and cloud infrastructure (GCP preferred)
- Ability to troubleshoot complex distributed systems under pressure
- Writes clean, modular code with a focus on testable APIs and maintainable design
- Experience working with AI coding assistants. Understands effective prompting strategies and can articulate when these tools add value versus when they're not appropriate
- Clear communicator who can work across technical and non-technical stakeholders
- Proactive problem solver who identifies issues and works around obstacles without waiting for direction
- Competitive compensation that may include equity
- Retirement and Employee Stock Purchase Plans
- Flexible paid time off
- Comprehensive benefit plans covering medical, dental, vision, life, and disability
- Family services that include fertility benefits and equal paid parental leave
- Professional development supported by formal career pathing, learning platforms, and a yearly learning stipend
- A curated in-office employee experience, designed to foster community, team connections, and innovation
- Opportunities to give back to your community, including an annual company-wide Volunteer Week and donation matching
- Employee Resource Groups that provide supportive communities within Braze
- Collaborative, transparent, and fun culture recognized as a Great Place to Work®