Cloud Reliability and Support Engineer (AWS)

Hong Kong 4 days agoFull-time External
106k - 141.5k / yr
As a leading recruitment partner, Pinpoint Asia is mandated by a Top-Tier Global Quantitative Investment Manager to identify a high-caliber Cloud Reliability & Support Engineer to join their high-frequency, technology-driven environment. Our client is a global leader in systematic trading, where technology is the heart of the business. This is not a "standard" cloud role—it is a front-line mission for a technical problem-solver who thrives on resolving complex AWS infrastructure challenges in a fast-paced, high-stakes environment. Unlike typical DevOps roles focused purely on pipeline builds, this role is designed for the true infrastructure expert who enjoys the "puzzle" of troubleshooting. You will be the guardian of a highly available, scalable, and secure cloud ecosystem, ensuring that the firm’s scientific approach to investing is never hindered by technical downtime. Why this role? Direct Impact: You are critical to the "Keep the Lights On" (BAU) operations of a global trading giant.Technical Depth: Work across a sophisticated stack including AWS (EC2, S3, VPC), Kubernetes, Docker, Terraform, and Ansible.Global Collaboration: Partner with world-class developers and SRE teams across APAC, EMEA, and the US.What You’ll Do: Firefighting & Resolution: Take full ownership of troubleshooting and resolving complex AWS infrastructure incidents.Optimization: Monitor performance and cost-effectiveness, ensuring the cloud environment is lean and high-performing.Scripting & Automation: Use Python, Bash, and Terraform to automate repetitive tasks and improve system reliability.Hybrid Administration: Navigate and troubleshoot across both Linux and Windows OS environments—a unique challenge in the quant space.Continuous Improvement: Evaluate and recommend new technologies to bolster reliability and security.The Ideal Profile: Experience: 3 to 10 years of professional experience.Mindset: You genuinely enjoy Support and Site Reliability. You are someone who loves to "dive in" without context and find the root cause of an error.Cloud Fluency: Extensive hands-on experience with core AWS services (VPC, EC2, S3) and container orchestration (Docker/K8s).Automation Chops: Proficient in Python/Bash and experienced with IaC tools like Terraform or Ansible.Resilience: You maintain extreme composure under pressure and can communicate complex technical solutions clearly to stakeholders.