Observability Engineer-- CHODC5728259

Montreal 2 days agoContractor External
Negotiable
Title : Observability Engineer Location: Montreal, QC (Hybrid Role) Direct Client Description: We are seeking an experienced and motivated engineer to join the Observability fleet which focuses on delivering tools in private and public cloud environments. The role focuses on developing and modernizing Observability platforms for cloud-native and hybrid applications. This role involves designing, integrating, and maintaining solutions for collecting, transporting, and visualizing telemetry (tracing, metrics, and logging) to improve the reliability and uptime of our applications. You will closely collaborate with software developers, SRE, infrastructure, and security teams to drive automation and implement best-in-class observability solutions supporting both development and operations in a hybrid cloud environment. Required Skills: • Experience with any one of the public cloud providers (AWS, Azure, Google) • At least 5 years of relevant experience in Observability, Logging, and Monitoring in enterprise environments. • Hands-on experience with observability tools such as Grafana, Prometheus, Loki, Cortex, Tempo, ElasticSearch, Datadog, Splunk, or equivalents. • Experience working with container technologies (Docker, Kubernetes) and orchestration platforms (GKE or similar). • Proficiency in setting up and configuring dashboards, alerts, and alarms on telemetry data. • Skilled in configuring and establishing monitoring for applications deployed in public cloud environments. • Experience in integrating observability tools with CI/CD pipelines and automating through scripting (Python, Bash, JSON, YAML, Terraform or similar). • Excellent communication, presentation, and problem-solving skills. • Proficiency with Linux operating systems and databases (MySQL, DB2, MSSQL, or similar). • Solid understanding of how enterprise service delivery components interact (web servers, application servers, databases, web services, storage, security) • Open to work in on call rotation (+- every 6 weeks) Nice to have: • Experience with application instrumentation for distributed tracing, metric and log collection. • Experience with Go programming language is a plus. • Experience with DevOps tooling and automation. • Prior experience with Application Performance Management (APM) solutions. • Experience integrating end-user applications with monitoring and APM tools. • Understanding of enterprise-architecture concepts: 3-tier architecture, high-availability/disaster recovery, active-active data centers, etc. • Familiarity with networking concepts and protocols (OSI model, TCP/IP, HTTP, firewalls, load balancers).