1.Focus: Specialist in conceptualising and solutioning of Big-data analytics engagements either on the Azure platform or on Opensource platform of Spark eco-system (Apache Spark Core, MLiB, Streaming and GraphX)
Expertise in usage of Spark context in conjunction with HIVE or Cassandra to provide ‘RDD ability’ to run classification and regression tasks 10-100 times faster than Hadoop Map-Reduce tasks.
2. Ambidextrous Programmer: Ability to use multiple programming languages like – Microsoft R Server, Scala, Open-R, Python, CSQL, HQL, Pig Latin and SAS helps in best language selection for analytical sub-tasks of: Data-Munging, Modelling, Run-optimisations and storage/persisting of datas...