AI SRE

Montreal 6 days agoFull-time External
Negotiable
Montréal, Quebec H1A 0A1 Posted February 20th, 2026 Looking for more job opportunities? Click here Job Type: Full Time Job Category: IT Job Description AI SRE / AI Ops engineer Montreal, QC - Hybrid Skills Required : Production experience in SRE / Infrastructure / ops for large-scale systemsStrong programming/scripting skills (Python, Go, Java, or equivalent)Deep experience with containerization (Docker), orchestration (Kubernetes, etc.)Infrastructure-as-code (Terraform, Helm, CloudFormation, Ansible, etc.)Familiarity with GPU / AI compute clusters, high-performance data storage, and distributed architecturesExperience with monitoring / observability / logging / alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.)Production experience in SRE / Infrastructure / ops for large-scale systemsStrong programming/scripting skills (Python, Go, Java, or equivalent)Deep experience with containerization (Docker), orchestration (Kubernetes, etc.)Infrastructure-as-code (Terraform, Helm, CloudFormation, Ansible, etc.)Familiarity with GPU / AI compute clusters, high-performance data storage, and distributed architecturesExperience with monitoring / observability / logging / alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.)Networking & systems engineering knowledge (TCP/IP, DNS, routing, load balancing, distributed storage)Solid experience in capacity planning, performance tuning, scaling, and incident responseDemonstrated ability to lead RCAs, deploy fixes, and drive reliability improvementsExperience in regulated environments (financial services, compliance, audit, security) is a strong plusExcellent communication, documentation, and cross-team collaboration skillsProven track record of reducing operational toil via automationRequired Skills DEVOPS ENGINEER SENIOR EMAIL SECURITY ENGINEER