Staff Software Developer, AI Infrastructure, Applied AI
Google Voir toutes les offres
- Waterloo, ON
- Permanent
- Temps-plein
- Bachelor’s degree or equivalent practical experience.
- 8 years of experience in software development focusing on infrastructure development in C++.
- 5 years of experience with one or more of the following: speech/audio (e.g., technology duplicating and responding to the human voice), reinforcement learning (e.g., sequential decision making), or specialization in another ML field.
- 5 years of experience with ML design and ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).
- 5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture.
- Master’s degree or PhD in Engineering, Computer Science, or a related technical field.
- 8 years of experience with data structures and algorithms.
- 3 years of experience in a technical leadership role leading project teams and setting technical direction.
- 3 years of experience working in an organization involving cross-functional, or cross-business projects.
- Develop and build high-performance, distributed infrastructure to support agentic AI workflows, leveraging C++ to ensure low-latency agentic systems for real-world enterprise loads.
- Take full ownership of the technical stack, transitioning experimental models into production services while ensuring system reliability, observability, and fault tolerance in multi-agent environments.
- Drive inference cost optimization and system efficiency by implementing efficient connectors, optimizing kernels, managing memory usage, and reducing latency to ensure AI solutions are not just powerful, but economically viable and at scale.
- Provide technical guidance on system architecture and code quality, fostering a culture of developing excellence through design reviews, code audits, and the adoption of best practices.
- Maintain a tight loop between hypothesis and deployment by quickly prototyping new capabilities and seamlessly hardening them for production release while focusing on customer needs.