Role responsibilities;
• Responsible for implementation and ongoing administration of Hadoop infrastructure.
• Responsible for Cluster maintenance, trouble shooting, Monitoring and followed proper backup & Recovery strategies.
• Provisioning and managing the life cycle of multiple clusters like EMR & EKS. Infrastructure monitoring, logging & alerting with PrometheGrafana/Splunk.
• Performance tuning of Hadoop clusters and Hadoop workloads and capacity planning at application/queue level. Responsible for Memory management, Queue allocation, distribution experience in Hadoop/Cloud era environments.
The ideal candidate will have experience with the following;
• Experience in Linux / Unix OS Services, Administration, Shell, awk scripting
• Strong knowledge of anyone programming language Python/Scala/Java/R with Debugging skills.
• Experience in Hadoop (Map Reduce, Hive, Pig, Spark, Kafka, HBase, HDFS, H-catalog, Zookeeper and Oozie/Airflow)
• Experience in Hadoop security (Kerberos Knox, TLS