Role responsibilities;
- Responsible for implementation and ongoing administration of Hadoop infrastructure.
- Responsible for Cluster maintenance, trouble shooting, Monitoring and followed proper backup & Recovery strategies.
- Provisioning and managing the life cycle of multiple clusters like EMR & EKS. Infrastructure monitoring, logging & alerting with PrometheGrafana/Splunk.
- Performance tuning of Hadoop clusters and Hadoop workloads and capacity planning at application/queue level. Responsible for Memory management, Queue allocation, distribution experience in Hadoop/Cloud era environments.
The ideal candidate will have experience with the following;
- Experience in Linux / Unix OS Services, Administration, Shell, awk scripting
- Strong knowledge of anyone programming language Python/Scala/Java/R with Debugging skills.
- Experience in Hadoop (Map Reduce, Hive, Pig, Spark, Kafka, HBase, HDFS, H-catalog, Zookeeper and Oozie/Airflow)
- Experience in Hadoop security (Kerberos, Knox, TLS).