Senior Site Reliability Engineer – Cloud Infrastructure

Ottawa 5 days agoFull-time External
409.1k - 715.9k / yr
Senior Site Reliability Engineer – Cloud Infrastructure In this role, you will support cloud and container-based platforms within a global follow-the-sun operations model. You will work closely with internal engineering teams to test and certify platform components, contributing to the reliability, performance, and scalability of private cloud environments. Your Responsibilities • Provide L3 support for private cloud environments, including participation in an on-call rotation. • Collaborate closely with engineering teams to test and validate new component releases and infrastructure upgrades. • Contribute to performance, capacity, and monitoring improvements. • Create and enhance support processes including documentation, automation, scripting, customer engagement, and incident/problem/change management. • Work jointly with L2 teams and other L3 colleagues across multiple regions. Your Profile • 5 to 10 years of relevant experience. • 3 to 5 years of Linux experience. • Experience with front-end and back-end development using Golang. • Strong knowledge of server infrastructure, virtualization, and cloud computing. • Proven experience with Kubernetes and Docker. • Solid understanding of networking and internet protocols (TCP/IP, HTTP/HTTPS). • Strong knowledge of security protocols (SSL/TLS, Kerberos). • Ability to manage multiple tasks and work under pressure during outages. • Experience with Agile and DevOps/SRE methodologies. • Administrative competence in at least one scripting language (e.g., Python). • Excellent communication skills with diverse user groups and distributed teams. • Willingness to participate in an on-call rotation every 5 weeks.