I am a professional data scientist with extensive exposure to large-scale structured and unstructured datasets from different domains including but not limited to online retailing, legal documents, media articles and medical devices.
Supervised Learning(classification): Deep Learning, Logistic Regression, SVM, Random Forest
Unsupervised Learning(clustering): K-mean
Natural Language Processing: POS tagging,Noun phrases extraction, tf-idf, LDA
Time Series Forecasting: SARIMA, Holt Winters, SVR, Random Forest Regression
Skills: Python, SQL, R, MongoDB/NoSQL, Redis, Linux/Unix, SEMrush, Google Trends/Correlate
Python packages: numpy, pandas, multiprocessing,sklearn, nltk, lda, gensim, c...