Expert in Bigdata technologies Spark, HDFS, MapReduce/YARN, Hive, Hbase, PIG and Sqoop
Experience in Performance tuning MR/Hive/Spark jobs, Optimization using different file formats
Experience about Hadoop cluster setup and integrating Spark With YARN, Setting up HUE to access Hive,Spark and hands on experience in Scala
Strong work experience in Bigdata Analytics from end to end(Getting the apache log level information into the working environment,processing the logs using Hadoop MR, Data warehousing on top of Hadoop, high level data summarization to RDBMS and cluster management activities with optimal cost in Amazon Cloud environment)
Well versed on automating the day to day OPs/dev...