Note : We don t want any SRE/ Operations /Developers purely looking for Platform Engineers who manage configure, Authentication, Authorizations, Admin tasks, Product Expansions dynamically based on Volumes.
Kafka JD:
Key Responsibilities:
• Deploy, configure and maintain Confluent Kafka clusters in a highly available environment.
• Perform Kafka upgrades and patching ensuring minimal downtime and seamless transitions.
• Administer security protocols and access controls to protect the platform.
• Monitor cluster health and performance using our established monitoring stack(Prometheus, Grafana)
• Troubleshoot and resolve cluster-related incidents such as broker outages, replication lag.
• Develop and enhance automation scripts to streamline operational tasks and reduce manual intervention.
• Assist with planning and managing cluster resources to support future growth.
Qualifications:
• Bachelor's degree in Computer science, Information technology or equivalent professional experience.
• 5+ years of Kafka administration experience, ideally with confluent platform.
• Demonstrated expertise in Kafka upgrades and production troubleshooting.
• Strong background in Linux system administration.
• Proficiency in Python and/or Bash scripting for automation.
• Good understanding of Prometheus and Grafana for monitoring and observability.
• Certification on Confluent Kafka is a plus.