This document describes a vacation advisor tool that provides vacation recommendations and cost estimates based on a user's budget. It clusters over 720 US cities based on flight costs, hotel rates, daily expenses, and location attributes to identify similar destinations. Different clustering algorithms like k-means, VQ, and mean shift are evaluated using metrics like within-cluster similarity. The goal is to match users with appealing vacation options within their budget.
7. Features
Flight cost from New York to 720 cities in the US
Average cost of hotels for 3143 counties
Average daily expenses including car
Location of city [east, west, central, south, north]
City speci鍖cs like beaches, museum, national parks
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
13. Cluster Validation
Table: (Cluster Validation)
Number of Clusters WSS(105) City Similarity
2 10.32 Seattle Detroit, Charlotte, South Bend
3 9.93 Seattle Boston, Phoenix, Detroit
4 9.91 Seattle Charlotte, South Bend, Minneapolis
5 8.40 Seattle Detroit, Charlotte, South Bend
7 9.62 Seattle Detroit, Charlotte, South Bend
10 9.63 Seattle Sacrameto, San Jose, Colombus
12 9.52 Seattle Sacrameto, San Jose, Colombus
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
14. Cluster Initialization and Validation
Table: (Cluster Initialization and Validation)
Alg Time(s) homo compl v-meas ARI AMI silhouette
k-means 0.03 0.971 0.971 0.971 0.988 0.970 0.389
VQ 0.04 1.000 1.000 1.000 1.000 1.000 0.388
After PCA 0.00 1.000 1.000 1.000 1.000 1.000 0.388
Mean Shift 0.24 1.000 0.970 0.972 0.980 0.972 0.386
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor