Research Staff Member at IBM T. J. Watson Research Center
Industry
Technology / Software / Internet
Website
About
I currently work on big data and distributed systems. Specifically, to accelerate machine learning algorithms using scale-up (e.g., GPU) and scale-out (e.g., Spark) systems. As an example, I built cuMF (https://github.com/wei-tan/CuMF/), a scalable matrix factorization library on GPU. As far as I know, cuMF is the fastest and can tackle the largest MF problem ever reported. CuMF can be used in recommender systems, embedding layer in deep learning, and topic model.
I also worked on NoSQL (e.g., HBase) and services computing.
My work and code have been incorporated into IBM patent portfolio and products such as BigInsights and Cognos. I am also a very hands-on researcher (see my GitHub p...