Position Title: DevOps / SRE
Location: Stamford, CT (Onsite)
Job Description
• Performance Engineering
• Design and develop performance test scripts using Apache JMeter or equivalent tools.
• Create test scenarios including load, stress, spike, endurance, and scalability.
• Collaborate with developers, architects, and business stakeholders to understand performance goals.
• Experience with Dynatrace, Grafana, or Kibana.
DevOps / SRE / Platform Engineering
• 10+ years in DevOps, SRE, or Platform Engineering.
• 6+ years Kubernetes.
• 3+ years hands-on OpenShift 4.x operations at scale.
• Own enterprise-grade OpenShift 4.x platforms across on-prem and cloud.
• Deliver reliable, secure, automated clusters; enable developer self-service through GitOps and golden patterns.
• Lead upgrades, scaling, and multi-cluster operations.
Core Skills
• Strong Linux (RHEL), networking (TCP/IP, DNS, TLS, routing), storage concepts.
• Terraform/Ansible (IaC).
• Argo CD, Tekton, Operators, Helm/Kustomize.
• Experience with ODF/OCS and cluster upgrades.
• Security: RBAC, SCC/PSA, Network Policies, supply-chain controls, vulnerability remediation.
• Excellent troubleshooting across all platform layers.
OpenShift Expertise
• Design and implement OpenShift clusters (IPI/UPI).
• Day-2 operations: Machine Configs, upgrades, node pools.
• Multi-cluster governance with ACM/OCM.
• Networking/Ingress: OVN-Kubernetes, Multus, Ingress Controllers/Routes, L4/L7 load balancing, DNS/TLS.
• Storage: ODF/OCS/Ceph/Portworx, PVC/PV classes, performance tuning, backup/restore (Velero/OADP).
GitOps / CI-CD / Security
• GitOps-first enablement: Argo CD (app-of-apps), Helm/Kustomize, Operators.
• Build reusable templates for namespaces, quotas, RBAC, policies.
• Tekton/OpenShift Pipelines, Quay/Harbor, image signing/promotion, SBOM, vulnerability scanning (RHACS/StackRox/Trivy).
• SCC/PSA, Network Policies, secrets management, compliance operator/OpenSCAP, Gatekeeper/Kyverno.
• Security posture: CVE remediation, audit compliance, policy conformance.
Additional Skills
• Red Hat certifications (EX280/EX288, RHCSA, RHCE).
• Service Mesh (Istio/Red Hat Service Mesh).
• Keycloak/SSO.
• External Secrets / Vault.
• AWS/Azure/GCP integration (LB, DNS, IAM, secrets), cost management/showback