Lead Cloud Site Reliability Engineer (SRE)

London 3 days agoFull-time External
Negotiable
Job Description - We're looking for a Lead Cloud Site Reliability Engineer (SRE) with strong expertise in Azure, Kubernetes, Terraform, and GitHub to lead large-scale projects and mentor a growing team. Key Responsibilities • Lead SRE activities for large-scale cloud projects, providing technical guidance to engineers. • Deliver solutions across VMs and Kubernetes, ensuring efficient deployment, scaling, and management. • Implement CI/CD pipelines using GitHub Actions or similar tools. • Design and manage Infrastructure as Code (IaC) using Terraform (preferred), Ansible, Jenkins, etc. • Assess networking requirements and design secure solutions (load balancing, firewalls, routing). • Troubleshoot and resolve complex cloud infrastructure and application issues. • Mentor junior engineers and promote knowledge sharing within the team. • Collaborate with stakeholders, vendors, and cross-functional teams (Cyber Security, Testing, Application). • Support cloud migration initiatives using frameworks like CAF, AzureRM, Google Cloud. • Represent the team during project delivery and ensure adherence to change control processes. • Participate in 24/7 on-call support rota and occasional support for previous adoption work. What We're Looking For • Strong DevOps background with automation-first mindset • Expertise in Azure, Kubernetes, Terraform, GitHub • Experience in cloud migration and networking solutions • Ability to lead projects and communicate effectively • Familiarity with change control processes Nice to Have • Cloud certifications (Azure, GCP, etc.) • Experience with multi-Tenant solutions • Passion for continuous learning and innovation