This document discusses the VacAdvisor tool, which helps users plan vacations based on their budget. It analyzes vacation destination data using clustering algorithms to group similar cities together. It evaluates different clustering methods and selects k-means clustering as the best approach. VacAdvisor then uses the clusters to recommend vacation options to users based on their budget.
12. Cluster Validation
Table: (Cluster Validation)
Number of Clusters WSS(103) City Cities Closest to Centroid
2 10.32 Seattle Detroit, Charlotte, South Bend
3 9.93 Seattle Boston, Phoenix, Detroit
4 9.91 Seattle Charlotte, South Bend, Minneapolis
5 8.40 Seattle Detroit, Charlotte, South Bend
7 9.62 Seattle Detroit, Charlotte, South Bend
10 9.63 Seattle Sacrameto, San Jose, Colombus
12 9.52 Seattle Sacrameto, San Jose, Colombus
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
22. Feature Engineering
Number of Images: 50,000 and 60 feature vector for each image
Perimeter
Moment of Inertia [4 features]
Elongation
Jaggedness
Circularity
Moment features [9 features]
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor