Cloud Platform Engineer (Azure & AWS)

Houston 2 days agoFull-time External
557 - 626 / hr
Cloud Platform Engineer (Azure & AWS) Location: On-site – Houston, TX (ZIP Code 77040) You will play a key role in the development, maturation, and operational excellence of our Azure and AWS cloud platforms, supporting tenants, subscriptions, landing zones, networking, security, and core infrastructure through best practices and Infrastructure as Code (IaC). This role is 100% on-site and also requires hands-on support for on-premises Kubernetes clusters. The ideal candidate brings a strong blend of cloud subject-matter expertise, hands-on engineering skills, leadership capability, and strong written and verbal communication skills, enabling close collaboration across engineering and operations teams. Key Responsibilities • Design, implement, and maintain scalable, secure, and reliable cloud infrastructure aligned with Site Reliability, Observability, and Scalability best practices • Collaborate with Product, DBAs, Developers, DevOps, SRE, and Data Engineering teams to embed IaC, automation, and cost optimization early in solution design • Architect and deliver technical solutions that improve reliability and service levels across cloud and on-prem platforms • Support and operate Kubernetes clusters, including provisioning, performance tuning, and security configuration • Monitor platforms, investigate incidents, perform root cause analysis, and drive permanent remediation • Identify and resolve bottlenecks, manage operational issues, and coordinate support efforts • Ensure continuous availability of data platform and infrastructure services • Document architecture, standards, and operational procedures • Proactively identify platform improvement opportunities and present recommendations to leadership Qualifications • 5+ years of hands-on experience with AWS and Azure (certifications or equivalent experience) • 5+ years of experience with DevOps practices and deployment automation • 5+ years administering and maintaining Linux-based systems • 3+ years of experience with infrastructure automation tools such as Terraform, Ansible, or Salt • Strong expertise in Azure DevOps • Experience designing and operating highly available, fault-tolerant systems, including backup, recovery, load balancing, and disaster recovery • Proven experience with large-scale distributed systems • Excellent written and verbal communication skills, with the ability to convey technical concepts to both engineering and management audiences • Solid understanding of core IT infrastructure including virtualization, networking, and storage