Job Description
We are seeking a seasoned Big Data Engineer with extensive experience in designing and implementing large-scale data lakes using the Cloudera ecosystem, particularly within the telecom domain.
The ideal candidate will have expertise in developing batch and real-time pipelines, optimizing distributed data systems, and ensuring secure, high-performance data infrastructure.
• Design and maintain pipelines using Spark, Hive, and Python on Cloudera.
• Develop real-time ingestion workflows with Kafka (e.g., CDRs, usage logs).
• Manage orchestration with Oozie, access control with Ranger, and APIs for downstream integration.
• Ensure security compliance (Kerberos, Ranger), performance tuning, and resource optimization.
• Collaborate with cross-functional teams to deliver telecom-focused data solutions.
• Experience with cloud migration/integration is an added advantage.
Requirements:
• 12–15 years of experience in Big Data Engineering with strong Cloudera expertise.
• Deep understanding of the telecom domain, including CDRs, OSS/BSS, and network data.
• Hands-on experience with Kafka, Python, Hive, HDFS, HBase, Oozie, Impala.
• Strong skills in data security, orchestration, and performance optimization.
• Cloud experience is an added advantage.