- Experienced with Machine Learning and Data Mining algorithms.
- Experienced with building ETL data pipeline.
- Experienced with building API for data products.
- Interested in how to compute machine learning algorithm on large dataset.
- Heavy use of Map/Reduce, Mahout, Giraph and Graphlab/Graphchi to build dataset.
- Heavy use of HBase, Play, Scala, Akka to provide OLTP API for data products.
- Ruby on Rails to visualize dataset.
- Participate Kaggle occationally for fun.
- Passionate on learning from Coursera(ML, PGM, NLP), Udacity(AI)
Specialties: Large scale data mining, Firm problem solving ability, HBase, Play, Scala, Spark, Hadoop, Giraph