際際滷

際際滷Share a Scribd company logo
News Article Ranking:Leveraging the Wisdom of BloggersRichard McCreadie, Craig Macdonald & IadhOunis
IntroductionBackground:Bloggers react to news as it happens
Thelwall explored how bloggers reacted to the London bombings
 30% of bloggersblog on news-related topics (Technorati poll 2008)
Hence, the blogosphere is valuable as a source of news-related information
Kniget al. & Sayyadiet al. have exploited the blogosphere for event detectionObama VictoryNumber of blog postsDay (November 2008)M. Thelwall  WWW06Knig et al. SIGIR09Sayyadi et al.  ICWSM09
IntroductionEditorial News:
Every day newspaper editors select articles for placement within their newspapers.
This can be seen as a ranking problem.
 Rank articles by readership interestFrontPagePage2NewspaperEditor . . .We investigate how such a ranking can be approximated using evidence from the blogosphere
Introduction
The News Article Ranking Problem
The Votes Approach
Evaluating Votes
Temporal Promotion
News Article Representation
ConclusionsTalk Outline
News Article RankingProblem Definition:Rank news articles by their inherent importance.
Given a day of interest dQ we wish to score each news article a by its predicted importance, score(a,dQ) using evidence from the blogosphere.=29Day dQ=23=14=13News ArticleRanker=4=4ImportanceScores
Idea:The more blog posts about an article the more important the subject must be.
Score by blog post volumeApproachTwo Stages:Score each news article a for all days d based on related blog post volume for day d.	   News articles are represented by their headlinesGiven a query day dQ rank A based on the score for each news article on day dQ, i.e. score(a, dQ)-> a voting processThe Votes Approach
Votes Approach : Stage 1Stage 1: Score days for each news story11234234Ranking of days for ablog postranking4) Rank days by votes received2) Select the top 1000 blog posts for a3) Each post votes for a dayDaysvotes = 2votes = 1votes = 2votes = 2For eachnews articlea1) Use its representation (headline) as a queryvotes = 0votes = 1votes = 2votes = 0TerrierVotesVoting Model : Count* Craig Macdonald  PhD thesis 2009
Votes Approach : Stage 2Stage 2: Rank news articles for day dQvotes = 22Stage 1votes = 2votes = 242votes = 1votes = 2News article aNews article aNews article a14123votes = 0votes = 131votes = 03votes = 64votes = 2votes = 634QueryDay 2votes = 3votes = 2News article a132votes = 1votes = 321votes = 12votes = 91votes = 7votes = 931votes = 5votes = 72News article a3votes = 03votes = 542votes = 04Ranking of Articles
Introduction
The News Article Ranking Problem
The Votes Approach
Evaluating Votes
Temporal Promotion
News Article Representation
ConclusionsTalk Outline
Hypothesis:The volume of relevant blog posts published on a news article is a strong indicator of that articles importance (from an editors perspective).Research Questions:Can the number of related blog posts to a news article published on day dQ provide a comparative ranking to that which an editor might make?Evaluating Votes
TaskTREC 2009:Blog Track : top news stories identification task
Rank news articles by predicted importance

More Related Content

News Article Ranking : Leveraging the Wisdom of Bloggers

Editor's Notes

  • #11: More blog posts the more important the news articleApproximate editor ranking
  • #16: Displays perrformanceGreen trec best systemsBlue votes spprach
  • #17: sumarise