Machine learning is being used at Carousell in several ways to improve the user experience. Category and title suggestions using convolutional neural networks help users list items faster and improve listing quality. Chat reply suggestions apply dot product ranking and recurrent neural networks to previous messages to suggest relevant responses. These models are trained on Google Cloud and deployed using TensorFlow and Kubernetes to provide fast, efficient predictions at scale.
2. THE CAROUSELL APP
SNAP, LIST, SELL
List your item for sale in 30 seconds
IN-APP CHAT
Chat directly with sellers without revealing
personal information
IMAGE-CENTRIC
Main focus of the app on images
SOCIAL FEATURES
Share listings on social media channels &
join groups of people with similar interests
3. 110
million
listings
15+
Avg Mins on the
app for active
users
43
million Sold
19
Cities
7
Countries
Largest and fastest growing
mobile classifieds in the region
12. Title Suggestion: Ranking of whitelisted titles
Samsung galaxy edge
Red dress
Forever 21 skirt
PS3 Slim 500GB
iPhone X
Wine cooler
iPhone case
Toyota Wish
Samsung galaxy edge
Red dress
Forever 21 skirt
PS3 Slim 500GB
iPhone X
Wine cooler
iPhone case
Toyota Wish
23. Training and Deployment
Write ML models using
TensorFlow
Train models using
Google Cloud ML Engine
Deploy models on Google
Kubernetes Engine
24. Category & Title Suggestions
Fast training
iteration
Trained with
Google Cloud
ML for 2 days
No manual
labelling
User-written
titles as labels
Performance
Train on tens of
millions of
images
Efficient
inference
Single pass
with one matrix
operation
25. Category & Title Suggestions - Results
~10s faster in time to list on average
Improved listing completion rate
Less wrongly categorized listings
29. Chat Reply Suggestions
Whitelist of
messages
x
scores = x . Y
precom
puted
vectors
Y
Previous
Messages
N-gram embeddings
Extension of
Dot-Product model
with Recurrent Neural
Network
RNN
30. Question Answer Model
Whitelist of
messages
x
scores = x . Y
precom
puted
vectors
Y
Previous
Messages
- Sequence Examples
- Bi-directional RNNs
- Hash Embeddings
- Attention layers
N-gram embeddings
RNN
31. References
(1) Efficient Natural Language Response Suggestion for Smart Reply
https://arxiv.org/abs/1705.00652
Authors:
Matthew Henderson, Rami Al-Rfou, Brian Strope, Yun-hsuan Sung,
Laszlo Lukacs, Ruiqi Guo, Sanjiv Kumar, Balint Miklos, Ray Kurzweil
(2) Question-Answer Selection in User to User Marketplace Conversations
https://arxiv.org/abs/1802.01766
Authors:
Girish Kumar, Matthew Henderson, Shannon Chan, Hoang Nguyen, Lucas Ngoo