The document discusses using geographic distance and other factors to improve venue recommendation systems. It analyzes Foursquare data on venues and users in London to determine which categories of venues users tend to visit locations farther from home. It then develops a probabilistic model to calculate the likelihood a user will visit a venue based on their past likes, proximity, and visit frequency. The model is tested using naive Bayesian, Bayesian, and linear regression approaches. Results show geographic closeness strongly influences recommendations and incorporating domain knowledge significantly improves accuracy over basic collaborative filtering. The discussion notes the importance of understanding one's recommendation domain and how performance depends on venue category characteristics.
1 of 23
Download to read offline
More Related Content
Ads and the City
1. Ads and the City:
Considering Geographic Distance Goes a
Long Way
Diego Saez-Trumper1 Daniele Quercia 2 Jon Crowcroft 2
1
Universitat Pompeu Fabra, Barcelona
2
Computer Laboratory, University of Cambridge
Dublin, September, 2012
6. On people mobility (from the literature)
distance matters
likes might matter
¡°power users¡± are special
p(go|like, close) ¡Ø pgo ¡¤ pclose ¡¤ plike
7. plike
#venues visited by user u with rating lui
p(like = lui |go) =
total #venues visited by user u
lui is ranking obtained from item-based CF algorithm.
8. pgo
#venues visited by user u
pgo =
total #venues
21. Final Remarks
results depend on venue category (different ¦Á and predictability)
geographic closeness plays a very important role.
domain knowledge signi?cantly improves recommendations
results.
22. ¡°Understanding the speci?cs of your domain
is critical to building a good recommender¡±
Paul Lamere @ recsys¡¯12