46. ???? J ????? ??? ?? ????? @ ??????,
???? == ??? ??
??????? ??(???)? ????? ??/??/??/?
?/??? ???? ?.
¨¤ Learning from Data
46
Inductive Learning (learning by examples)
automatic discovery of regularities in data through the use
of computer algorithms and generalizing those into new but
similar data
56. ???? J ????? ??? ?? ????? @ ??????,
Statistics vs Machine Learning
56https://www.datasciencecentral.com/profiles/blogs/machine-learning-vs-traditional-statistics-different-philosophi-1
https://svds.com/machine-learning-vs-statistics/
¡°Machine learning is for Computer
Science majors who couldn¡¯t pass a
Statistics course.¡±
57. ???? J ????? ??? ?? ????? @ ??????,
Statistics vs Machine Learning
57https://www.datasciencecentral.com/profiles/blogs/machine-learning-vs-traditional-statistics-different-philosophi-1
https://svds.com/machine-learning-vs-statistics/
¡°Machine learning is essentially
a form of applied statistics¡±
58. ???? J ????? ??? ?? ????? @ ??????,
Statistics vs Machine Learning
58https://www.datasciencecentral.com/profiles/blogs/machine-learning-vs-traditional-statistics-different-philosophi-1
https://svds.com/machine-learning-vs-statistics/
¡°Machine learning is Statistics minus any
checking of models and assumptions.¡±
60. ???? J ????? ??? ?? ????? @ ??????,
Machine Learning (ML) Traditional statistics (TS)
Goal: ¡°learning¡± from data of all sorts Goal: Analyzing and summarizing data
No rigid pre-assumptions about the problem a
nd data distributions in general
Tight assumptions about the problem and data distri
butions
More liberal in the techniques and approaches Conservative in techniques and approaches
Generalization is pursued empirically through t
raining, validation and test datasets
Generalization is pursued using statistical tests on the
training dataset
Not shy of using heuristics in approaches in se
arch of a ¡°good solution¡±
Using tight initial assumptions about data and the pr
oblem, typically in search of an optimal solution unde
r those assumptions
Redundancy in features (variables) is okay, and
often helpful. Preferable to use algorithms desi
gned to handle large number of features
Often requires independent features. Preferable to us
e less number of input features
Does not promote data reduction prior to learn
ing. Promotes a culture of abundance: ¡°the mo
re data, the better¡±
Promotes data reduction as much as possible before
modeling (sampling, less inputs, ¡)
Has faced with solving more complex problems
in learning, reasoning, perception, knowledge
presentation, ¡
Mainly focused on traditional data analysis
60