How to use no-click as a signal for improving search rankings. Joint work with Aris Gionis, Ronny Lempel, and Yoelle Maarek
1 of 18
More Related Content
When no clicks are good news
1. When no clicks are good news
Carlos Castillo, Aris Gionis, Ronny Lempel, Yoelle Maarek
Yahoo! Research Barcelona & Haifa
2. 2 SIGIR 2010 Industry Track Geneva, Switzerland
Usage mining for search
Behavioral signals are useful to measure
performance of retrieval systems
Relevant results are
clicked more often,
visited for longer time,
lead to long-term engagement,
etc.
However, predicting user satisfaction accurately
from search behavior signals is still an open
problem
3. 3 SIGIR 2010 Industry Track Geneva, Switzerland
A (not-so-)special case
If we satisfy the user
by impression, then
we observe a lower
click-through rate
4. 4 SIGIR 2010 Industry Track Geneva, Switzerland
Satisfaction by impression
Oneboxes and Direct Displays
Oneboxes1
and Direct Displays2
(DD) are
Very specific results answering (mostly) unambiguous queries
with a unique answer directly on the SERP
Displayed above regular Web results, due to their high
relevance, and in a slightly different format.
Typical example: weather <city name>
Test: guess which onebox/DD was served by which search engine:-)
1
: Google terminology
2
:Yahoo! terminology
5. 5 SIGIR 2010 Industry Track Geneva, Switzerland
Increasing number of by impression results
When searching for specific stocks, movie or train schedules,
sports results, package tracking (Fedex/UPS), etc.
To the extreme, what about spell checking, arithmetic operations
or currency conversion, addresses, things to do?
6. 6 SIGIR 2010 Industry Track Geneva, Switzerland
The problem
Click-based metrics for user satisfaction
For cases where we expect no clicks
Not only search sessions
Any browsing/interaction session
7. 7 SIGIR 2010 Industry Track Geneva, Switzerland
Our proposal
General method
Pick a class of users with a distinctive behavior
Study their response to changes
8. 8 SIGIR 2010 Industry Track Geneva, Switzerland
Our proposal
General method
Pick a class of users with a distinctive behavior
Study their response to changes
Specific method
Find users who are Tenacious
reformulate or click, do not let go
Measure their abandonment
9. 9 SIGIR 2010 Industry Track Geneva, Switzerland
How to model users?
Session representation
Actions classes: queries and clicks
XQCQX means start, query, click, query, stop
Alternative: reformulation classes
User representation
Frequency of action 3-grams = 15 features in total
Tenacity = (XQQ+XQC)/(XQQ+XQC+XQX)
10. 10 SIGIR 2010 Industry Track Geneva, Switzerland
(Preliminary) experiments
Segment sessions into logical goals
Divide goals in two groups
With direct-displays above position 5 (DD)
Without (NO-DD)
Metric
Find users with TenacityNO-DD >= 80%
Measure TenacityDD / TenacityNO-DD
Ground truth
Ask humans do you think users querying Q will be
satisfied by impression by this DD?
1=never ... 5=always
11. Change in the tenacity of tenacious users
Pitbull: editorial vs metric (type weather)
12. BAD
GOOD
Change in the tenacity of tenacious users
BAD
GOOD
Pitbull: editorial vs metric (type weather)
13. 63% of bad cases
83% precision
BAD
GOOD
Change in the tenacity of tenacious users
Pitbull: editorial vs metric (type weather)
14. Change in the tenacity of tenacious users
BAD
GOOD
Pitbull: editorial vs metric (type reference)
15. Change in the tenacity of tenacious users
BAD
GOOD
BAD
GOOD
Pitbull: editorial vs metric (type reference)
16. 71% of bad cases
84% precision
BAD
GOOD
Change in the tenacity of tenacious users
Pitbull: editorial vs metric (type reference)
17. 17 SIGIR 2010 Industry Track Geneva, Switzerland
Summary
Tenacious users can be used to identify bad DDs
General method: usage mining on classes of users
Shoppers
Smart searchers
Click-a-lots / explorers
Leaders
Poodles?
etc.
General/shared taxonomy of users?