Montréal, Quebec H1A 0A1 Posted February 20th, 2026
Looking for more job opportunities? Click here
Job Type: Full Time
Job Category: IT
Job Description
AI SRE / AI Ops engineer
Montreal, QC - Hybrid
Skills Required :
Production experience in SRE / Infrastructure / ops for large-scale systemsStrong programming/scripting skills (Python, Go, Java, or equivalent)Deep experience with containerization (Docker), orchestration (Kubernetes, etc.)Infrastructure-as-code (Terraform, Helm, CloudFormation, Ansible, etc.)Familiarity with GPU / AI compute clusters, high-performance data storage, and distributed architecturesExperience with monitoring / observability / logging / alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.)Production experience in SRE / Infrastructure / ops for large-scale systemsStrong programming/scripting skills (Python, Go, Java, or equivalent)Deep experience with containerization (Docker), orchestration (Kubernetes, etc.)Infrastructure-as-code (Terraform, Helm, CloudFormation, Ansible, etc.)Familiarity with GPU / AI compute clusters, high-performance data storage, and distributed architecturesExperience with monitoring / observability / logging / alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.)Networking & systems engineering knowledge (TCP/IP, DNS, routing, load balancing, distributed storage)Solid experience in capacity planning, performance tuning, scaling, and incident responseDemonstrated ability to lead RCAs, deploy fixes, and drive reliability improvementsExperience in regulated environments (financial services, compliance, audit, security) is a strong plusExcellent communication, documentation, and cross-team collaboration skillsProven track record of reducing operational toil via automationRequired Skills
DEVOPS ENGINEER
SENIOR EMAIL SECURITY ENGINEER