[1 year contract, renewable]
About the Role
The Core Engineering Product (CEP) Team aims to spearhead the digital transformation of government. CEP operates key platforms within Singapore Government Technology Stacks to allow government agencies to create high quality and reliable government services for citizens using commercial cloud services. CEP Engineering Team develops the required platform automations, processes, toolchain and policies/best practices to enable government agencies to develop, deploy and operate government services on the cloud in an agile yet secure manner.
What to Expect
• Participate in an in-house engineering squad for engineering excellence in design, development and operational of various engineering productivity products.
• Practice and lead other development communities within Singapore Government.
• Opportunities to learn and implement large scale product design and automation in Government Context
• Responsible to design, develop and maintain the engineering products and tooling.
• Be the guiding subject matter of expert for Devops methodologies, contribute to Automation, Availability, Scalability and Resiliency to the team and the development communities within government.
• This is a senior engineering role has focus in designing, deploying, and operating Red Hat OpenShift Container Platform on OpenStack in our on-premises data centers. You’ll play a pivotal role in ensuring our container platforms are resilient, secure, and highly automated to support mission-critical workloads.
________________________________________
Key Responsibilities
• Platform Design & Deployment
o Work closely with CEP customer on technical requirement gathering and
o Architect and implement production-grade OpenShift clusters on OpenStack, including control plane, compute nodes, storage integrations, and networking.
o Adapt typical OpenShift and OpenStack design into government security and governance compliance construct.
o Provide deep technical advisory and design decision rationales to internal and external stakeholders.
o Define and automate infrastructure provisioning (IaaC) using tools such as Terraform, Ansible, or Red Hat Ansible Tower.
• Operational Excellence
o Develop and maintain monitoring, alerting, and logging pipelines (Prometheus, Grafana, EFK/ELK, Alertmanager).
o Lead capacity planning, performance tuning, and day-to-day cluster health management.
o Implement robust backup, disaster recovery, and upgrade strategies.
• Automation & CI/CD
o Build and manage CI/CD pipelines (Jenkins, GitLab CI, Argo CD) for platform updates, operator deployments, and application rollouts.
o Author scripts and operators to automate routine maintenance, scaling, and self-healing tasks.
• Security & Compliance
o Enforce security best practices: RBAC, network policies, SELinux, secrets management (Vault, OpenShift Secrets).
o Collaborate with security teams to implement vulnerability scanning, baseline hardening, and compliance audits.
• Collaboration & Documentation
o Partner with development, QA, and networking teams to onboard new applications and troubleshoot platform issues.
o Produce runbooks, run-charts, design docs, and knowledge-base articles.
________________________________________
Required Qualifications
• Experience
o 5+ years in Linux system administration (RHEL) and virtualization (KVM/QEMU). Experience in VMware would be added advantage.
o 3+ years deploying and operating OpenShift in production environments.
o Strong understanding about network and storage virtualisation.
o Hands-on experience with OpenStack (Ansible-based or OpenStack SDK): Nova, Neutron, Cinder, Keystone, Glance.
o Understand about basic infrastructure security and policies in government will be added advantage.
• Technical Skills
o Infrastructure as Code: Terraform, Ansible, or equivalent.
o Physical, virtual and container-based networking & storage: Calico, OVN, Ceph, Portworx.
o Monitoring/Logging: Prometheus, Grafana, ELK/EFK stacks.
o Scripting: Bash, Python, or Go.
o Networking fundamentals: VLANs, SDN, L3 routing, load balancing (HAProxy, OVN LB).
• Soft Skills
o Strong problem-solving and troubleshooting aptitude in complex distributed systems.
o Excellent verbal and written communication; able to produce clear operational documentation.
o Proactive, self-driven, and comfortable leading cross-functional initiatives.
________________________________________
Preferred Qualifications
• Red Hat Certified Specialist in OpenShift Administration or OpenStack (RHOS-CL310).
• Familiar with VMware stacks
• Experience with GitOps tools (Argo CD, Flux).
• Familiarity with service mesh (Istio, OpenShift Service Mesh) and serverless frameworks.
• Exposure to hybrid-cloud or multi-cloud OpenShift deployments.