This document summarizes the BBC News Labs' involvement with Seedhack 4.0. The BBC News Labs drives and promotes innovation in news through events like Seedhack and pilots projects like JUICER. JUICER extracts concepts from news articles and matches them to a knowledge base, allowing the annotated content and entities to be exposed through an API and remixed for other uses. The BBC News Labs is providing this semantic annotation of over 500,000 news articles from multiple sources to Seedhack participants.
1 of 12
Download to read offline
More Related Content
BBC NEWS LABS - the story & the Juicer - for SeedHack 4.0
5. 10 News Orgs
6 Universities
27 Prototypes
newsHACK.co.uk
Next #newsHACK
in March 2014.
6. What are we bringing to Seedhack?
News Content
Tagged with Linked Data concepts
The Juicer
1
2
3
4
5
6
Get
Content
Extract
Concepts
Match to
DBpedia
Annotate
Content
Push to
Triplestore
Expose
via
API
8. JUICER - content for REMIXING
1. 500k articles
2. Since 2001: BBC News & Sport
3. Since August: SKY, Mirror, Independent, Huffington
Post, Guardian, et al
4. Content is tagged with Semantic annotation
5. And you can use DBPedia ontology for inferencing
9. JUICER THINGS in that content
1.
2.
3.
4.
32,705 unique People
12,220 unique Organisations
14,520 unique Places
90,250 unique things (not People, Places, Orgs)
Thats 150,000 real world things that you can use...
10. JUICER STORYLINE
1. Editorially Curated News Stories
2. 300 unique events & storylines
3. For more info see:1. http://www.youtube.com/watch?v=6rbycvC2zzs
2. http://www.slideshare.net/BBCnewslabs/storyline-fornewshack-2013-jeremy-tarling
3. Both by @jeremytarling