At Wattpad, we have observed that a majority of the reading activities happen in sequence and they vastly vary by time-of-the-day or namely: user session type. To address this, we designed a recommender system which can model users temporal dynamics and predict what story a user is going to read next based on users sequential behaviour in a particular session.
3. Why Session based ?
Wattpad users read different content based on
different context. Context examples:
Morning/Bed time reads
Commute reads
Vacation reads
etc.
A lot of reading activities happen in sequence:
Story series, prequels/sequels, serialized stories
etc.
8. Training Dataset
Number of users: 20M
Number of stories: 4M
Number of sequences: 100M+
Max sequence length: 30
Min sequence length: 3
Minibatch size: 1024
Keras with TF backend
Trained on 8-GPU for 2 days
9. Online Experiment Results
Experiment sample size:
~ 1 M users
Traffic split: 50/50
Eval metric:
normalized mean reading time
Experienced Users: 39%
New Users (signed up in last 2 weeks): 82%
10. Learnings
Deep Learning opens the door to a whole bunch of new
possibilities for recommender systems:
Powerful representation
Flexibility
However, sequential recs is not a solution for every
application
Intensive pre and post processing required to make it
work properly
Training is expensive (try with a small dataset with a
simple architecture at first)
It is highly rewarding (when it works)