�ݺ�ߣ

Machine Learning
A Developer’s Perspective

Walk forward in pure radiance of the past

What ML is not ?
• A silver bullet to solve all your automation needs.
• Rule based expert systems (Classic Business Analysts)
• Dark Magic which is able to infinitely generalize from a small dataset.
• Skynet type of algorithm with superhuman intelligence.

So what’s ML anyway….?
“Machine Learning is like teenage sex: everyone talks about it, nobody really knows how to
do it, everyone thinks everyone else is doing it, so everyone claims they are doing it” - Dan
Ariely
“Machine Learning is the art and science of developing algorithms which are able to teach
computers how to do a task without being explicitly trained for it (i.e. without exposing it to
all possible patterns) “

The Math Behind ML
Statistics
Linear
Algebra
Probability
Multivariate
Calculus
Optimization
Theory
Linear/Integer
Programming
Machine
Learning

Taxonomy of AI
Machine
Learning
Symbolic
Logic
Game Theory
Social
Network
Analysis
General Game
Playing
Decision/Control
Systems

Hues of ML
Supervised
Learning
Semi-
Supervised
Learning
Unsupervised
Learning
Reinforcement
Learning
Deep Learning Active Learning
Zero-Shot One
Shot Learning
Transfer
Learning
Classification
Regression
ClusteringLabel
Propagation
Q-Learning

Building Blocks of ML
Feature
Extraction
Class Imbalance
Handler
Classification
Evaluation
Ensemble
Feedback

Continued
• Bias-Variance Tradeoff
• Regularization
• Hyperparameter Optimization
• PAC Learning &VC-dimension

In the church of Reverend Bayes

Continued…
• Despite the assumption of mutually independent feature set, it works wonderfully well
most times.
• Theoretically the best classifier (lowest error rate) among any possible classifier (given
enough data)
• Good for easily establishing fairly strong baselines.
• Not good at handling noise, missing feature values and feature correlations

Continued…
• Highly non-linear, lazy, online instance based learner
• Theoretically the error rate is twice the bayes error rate
• Optimal value of K has to be learned in order to balance the bias-variance trade-off
• Fails in high dimensional spaces due to the curse of dimensionality

Enough Theory Now Show Me Some Code

Online Resources
 http://sebastianraschka.com/notebooks/ml-notebooks.html
 http://www.erogol.com/machine-learning-pathway/
 http://www.datasciencecentral.com/profiles/blogs/top-10-ipython-tutorials-for-data-science-and-machine-
learning
 http://www.kdnuggets.com/2017/01/blogs-analytics-big-data-mining-data-science-machine-learning.html
 http://www.datasciencecentral.com/profiles/blogs/15-deep-learning-tutorials
 https://github.com/rasbt/python-machine-learning-book/blob/master/docs/references.md
 https://charlesmartin14.wordpress.com/
 http://people.cs.pitt.edu/~milos/courses/cs3750/

Best Books to Read
• Pattern Classification – Duda, Hart, Stork
• Machine Learning – Tom Mitchell
• Elements of Statistical Learning – Friedman, Hastie, Tibshirani
• Machine Learning, A Probabilistic Approach – Kevin Murphy
• Introduction to Statistical Pattern Recognition – K Fukunaga

Best Practices
• Always split your data into train, test, validation sets
• Take care of bias-variance tradeoff
• Adjust your hyperparameters for optimal performance
• Use domain/problem specific evaluation metrics
• Combine different classifiers/regressors to form an ensemble.

�ݺ�ߣ

Machine learning a developer's perspective

More Related Content

Machine learning a developer's perspective