Staff Site Reliability Engineer |SRE DevOps

Singapore 4 days agoFull-time External
82.0k - 120.2k / mo
about company I am currently working with a highly regulated financial services platform specializing in digital assets and offering custody services Salary budget wide open! 4 rounds of interview to offer. 1 day in office, 4 days WFH. about job ● Spearheading primary operational support and engineering for various platform services. ● Driving improvements in reliability, quality, and time-to-market across all system offerings. ● Developing, building, and maintaining robust operational tooling and automation to streamline workflows. ● Defining and tracking key performance indicators (SLIs/SLOs) in collaboration with development teams. ● Creating "Production-ready Scorecards" to formally evaluate system health before deployment. ● Providing education and mentorship to engineering teams on resiliency principles, including chaos testing and blue/green deployments. skills and requirements ● Min 10 years of experience. ● Utilizing monitoring, alerting, and automation tools to resolve performance issues in systems at scale. ● Expert proficiency in developing automated solutions using Infrastructure as Code (Terraform). ● Expert-level knowledge of containerization technologies such as EKS (k8s), Nomad, and Docker. ● Expertise in Configuration Management tools like Ansible, Chef, or Puppet. ● Proficiency in writing scripts or CLI tools in high-level languages like Python or Go to enhance developer productivity. ● Proven experience as a Technical Leader, contributing to technical decision-making and architectural recommendations. To apply online please use the 'apply' function, alternatively you may contact Stella at 96554170 (EA: 94C3609 /R1875382) skills no additional skills required qualifications no additional qualifications required education Bachelor Degree