We are looking for a Senior Dev Ops Engineer to design, build, and operate scalable, secure, and highly available platforms. This role focuses on automation, cloud infrastructure, CI/CD, observability, and reliability, working closely with development, security, and operations teams. The ideal candidate combines strong hands on technical expertise with a Dev Ops mindset, taking ownership of platform stability, delivery efficiency, and continuous improvement.
Key Responsibilities Platform & Infrastructure Design, build, and manage cloud infrastructure across AWS, Azure, GCP, or Alibaba Cloud Implement Infrastructure as Code (IaC) using Terraform and related tools Ensure high availability, scalability, resilience, and disaster recovery Optimize cloud cost, performance, and resource utilization CI/CD & Automation Design and maintain CI/CD pipelines using Jenkins, Git Lab CI, Git Hub Actions, or Azure Dev Ops Automate build, test, security scanning, and deployment processes Implement deployment strategies such as Blue/Green, Canary, and Rolling deployments Improve release frequency while reducing deployment risk Containers & Kubernetes Build and manage containerized applications using Docker Operate Kubernetes clusters (on-prem or managed services) Manage Helm charts, Kubernetes manifests, secrets, and configurations Troubleshoot Kubernetes networking, scaling, and performance issues Observability & Reliability Implement monitoring, logging, and tracing solutions (Prometheus, Dynatrace, Datadog, ELK, Open Telemetry) Define and track SLIs, SLOs, and SLAsParticipate in incident management, on-call rotations, and root cause analysis Drive reliability improvements and reduce operational toil Security & Dev Sec Ops Integrate security controls into CI/CD pipelines Implement secrets management and IAM best practices Ensure security across containers, infrastructure, and pipelines Support compliance and security audits when required Collaboration & Leadership Work closely with development, QA, security, and architecture teams Mentor junior engineers and promote Dev Ops best practices Contribute to architectural decisions and technical standards Document systems, runbooks, and operational procedures
Required Skills & Experience Technical Skills5 years of experience in Dev Ops, SRE, or Cloud Engineering Strong Linux system administration and troubleshooting skills Hands-on experience with: (Cloud platforms (AWS / Azure / GCP / Alibaba Cloud) Terraform and Infrastructure as Code CI/CD tools and pipeline design Docker and Kubernetes Proficiency in scripting and automation (Bash, Python) Experience with monitoring, logging, and alerting tools Solid understanding of networking (DNS, HTTP, load balancing) Architecture & Engineering Experience supporting microservices architectures Understanding of application performance and scalability Familiarity with SQL and NoSQL databases from an operations perspective Experience with high-availability and fault-tolerant systems
Nice to Have Skills SRE practices (error budgets, capacity planning) Git Ops tools (Argo CD, Flux) Service mesh technologies Performance testing and tuning Platform engineering or internal developer platforms Cloud cost optimization (Fin Ops)