Observability Engineer (Cloud & Kubernetes)

Montreal 4 days agoContractor External
Negotiable
Observability Engineer (Cloud & Kubernetes) Location: Montreal, QC Hybrid role: 3 days onsite and 2 days remote. Employment Type: 1 Year Contract (FTC) About the Role We are looking for a Senior Observability Engineer to help design, build, and scale modern observability platforms across private and public cloud environments. This role is ideal for someone passionate about monitoring, reliability, and performance engineering for cloud-native and hybrid applications. You will work closely with DevOps, SRE, platform, and security teams to deliver scalable telemetry solutions that improve system visibility, uptime, and performance. 🛠️ Key Responsibilities • Design and implement monitoring, logging, and tracing solutions for cloud and Kubernetes environments • Build dashboards, alerts, and automated observability workflows • Integrate observability into CI/CD pipelines • Support modernization of observability platforms in multi-cloud environments (AWS/Azure/GCP) • Enable teams to use telemetry data for reliability and performance improvements • Share best practices and drive adoption of observability standards • Mentor teams and support knowledge transfer • Participate in on-call rotation when required Required Skills • 5+ years in Observability, Monitoring, or SRE roles • Experience with at least one cloud platform (AWS, Azure, or GCP) • Strong hands-on experience with tools like: • Grafana, Prometheus, Datadog, Splunk, Elastic, Loki, Tempo, or similar • Kubernetes and container monitoring experience (EKS/AKS/GKE) • Dashboard creation, alerting, and telemetry configuration • CI/CD integration and automation (Terraform, YAML, Python, or Bash) • Linux system knowledge • Understanding of application and infrastructure architecture • Strong communication and problem-solving skills Nice to Have • Distributed tracing & application instrumentation experience • APM tools experience • Go programming knowledge • DevOps automation background • Knowledge of HA/DR architectures • Networking fundamentals (TCP/IP, HTTP, Load Balancers)