DevOps/SRE

New York 13 days agoFull-time External
Negotiable
Position Title: DevOps / SRE Location: Stamford, CT (Onsite) Job Description • Performance Engineering • Design and develop performance test scripts using Apache JMeter or equivalent tools. • Create test scenarios including load, stress, spike, endurance, and scalability. • Collaborate with developers, architects, and business stakeholders to understand performance goals. • Experience with Dynatrace, Grafana, or Kibana. DevOps / SRE / Platform Engineering • 10+ years in DevOps, SRE, or Platform Engineering. • 6+ years Kubernetes. • 3+ years hands-on OpenShift 4.x operations at scale. • Own enterprise-grade OpenShift 4.x platforms across on-prem and cloud. • Deliver reliable, secure, automated clusters; enable developer self-service through GitOps and golden patterns. • Lead upgrades, scaling, and multi-cluster operations. Core Skills • Strong Linux (RHEL), networking (TCP/IP, DNS, TLS, routing), storage concepts. • Terraform/Ansible (IaC). • Argo CD, Tekton, Operators, Helm/Kustomize. • Experience with ODF/OCS and cluster upgrades. • Security: RBAC, SCC/PSA, Network Policies, supply-chain controls, vulnerability remediation. • Excellent troubleshooting across all platform layers. OpenShift Expertise • Design and implement OpenShift clusters (IPI/UPI). • Day-2 operations: Machine Configs, upgrades, node pools. • Multi-cluster governance with ACM/OCM. • Networking/Ingress: OVN-Kubernetes, Multus, Ingress Controllers/Routes, L4/L7 load balancing, DNS/TLS. • Storage: ODF/OCS/Ceph/Portworx, PVC/PV classes, performance tuning, backup/restore (Velero/OADP). GitOps / CI-CD / Security • GitOps-first enablement: Argo CD (app-of-apps), Helm/Kustomize, Operators. • Build reusable templates for namespaces, quotas, RBAC, policies. • Tekton/OpenShift Pipelines, Quay/Harbor, image signing/promotion, SBOM, vulnerability scanning (RHACS/StackRox/Trivy). • SCC/PSA, Network Policies, secrets management, compliance operator/OpenSCAP, Gatekeeper/Kyverno. • Security posture: CVE remediation, audit compliance, policy conformance. Additional Skills • Red Hat certifications (EX280/EX288, RHCSA, RHCE). • Service Mesh (Istio/Red Hat Service Mesh). • Keycloak/SSO. • External Secrets / Vault. • AWS/Azure/GCP integration (LB, DNS, IAM, secrets), cost management/showback