Senior Site Reliability Engineer – Cloud Infrastructure
In this role, you will support cloud and container-based platforms within a global follow-the-sun operations model. You will work closely with internal engineering teams to test and certify platform components, contributing to the reliability, performance, and scalability of private cloud environments.
Your Responsibilities
• Provide L3 support for private cloud environments, including participation in an on-call rotation.
• Collaborate closely with engineering teams to test and validate new component releases and infrastructure upgrades.
• Contribute to performance, capacity, and monitoring improvements.
• Create and enhance support processes including documentation, automation, scripting, customer engagement, and incident/problem/change management.
• Work jointly with L2 teams and other L3 colleagues across multiple regions.
Your Profile
• 5 to 10 years of relevant experience.
• 3 to 5 years of Linux experience.
• Experience with front-end and back-end development using Golang.
• Strong knowledge of server infrastructure, virtualization, and cloud computing.
• Proven experience with Kubernetes and Docker.
• Solid understanding of networking and internet protocols (TCP/IP, HTTP/HTTPS).
• Strong knowledge of security protocols (SSL/TLS, Kerberos).
• Ability to manage multiple tasks and work under pressure during outages.
• Experience with Agile and DevOps/SRE methodologies.
• Administrative competence in at least one scripting language (e.g., Python).
• Excellent communication skills with diverse user groups and distributed teams.
• Willingness to participate in an on-call rotation every 5 weeks.