�ݺ�ߣ

DISCOVER REAL TIME
KNOWLEDGE CLUSTERS
DevsNearMe

Priorities
 Separate signal from noise
 Can we at least predict better than others
 Ideally a probability distribution model giving us
ideas for best-case, expected value or worst-case
behavior

Model/Algorithm
 Prediction based on
 Absolute number + Prediction (Context –based
information + Learning from User past behavior)

Absolute numbers
 Foursquare check-ins are fairly reliable, as are MTA
and TSA swipes
 This just gets added to the prediction, no weighing
applied currently but may be modify if there is a
trend of fake data being generated

Context based Prediction
 Context based – Use Decision Tree Learning to
generate weights to apply to event rsvp counts for
eventbrite, meetup, facebook.
 E.g A meetup event rsvp has a higher weight if it is a
paid event, has free giveaways and if the weather is
nice
 Weights are in range 0-1 and we multiply each event
rsvp count by their weight and divide by 3 to get the
weighted average rsvp count.
 Events of similar nature in a nearby radius will
downgrade the potential attendance

User Learning based prediction
 A persons likelihood of attending an event can be
modelled in a Bayesian manner
 Past event attendance/rsvp ratio , history of
attending a series of events of a particular nature
 Item based classification is another factor e.g if
person a,b,c,d attend events X and Y and we know
that b,c, and d are attending event Z, there is a
higher chance for a to attend event Z

Age based classification

 Sharing peaks at teenage, early adulthood and then falls
down
 Influence of social data needs inversely weighing to infer
total count of people at an event

Gender based classification
 Social sharing can weigh in gender for better
classification

Miscelleneous
 Chart data sources : appdata, beevolve, appdata,
quora, statista

�ݺ�ߣ

DevsNearMe

Convert to study guideBETA

More Related Content

DevsNearMe