際際滷

際際滷Share a Scribd company logo
Linked Open Data
Presented for the Penn Humanities Forum
by Corey A Harper
2013-04-11
Tools and techniques for putting
Metadata, Context, & Narrative
On the Web
@chrpr[際際滷share]
2014-04-11 Harper - Penn Humanities Forum - LODLAM 2
The Web Becomes Semantic

Originally:

Metadata about Web things (documents)

Eventually:

Metadata about all sorts of things

And about relationships between things

TBLs original vision (Weaving the Web  1999)

Then: Focus on Machine Reasoning

Scientific American Article

Now: Focus on things & links

Reasoning & Inferencing less central
2014-04-11 Harper - Penn Humanities Forum - LODLAM 3
Semantic Web Terminology

Resource: Any thing

Class: Abstraction of a type of thing

Individual: An instance of a class

Property: An attribute of an individual
Statement/Triple:

A Resource (subject)

A Property (predicate / verb)

A Value (object) - Nodes

Graph: Visual Representation of statements

Ontology/Vocabulary: A domain specific collection
of classes and properties
2014-04-11 Harper - Penn Humanities Forum - LODLAM 4
2014-04-11 Harper - Penn Humanities Forum - LODLAM 5
Linked Open Data

Use URIs as names for things

Use HTTP URIs so that people can look up those
names.

When someone looks up a URI, provide useful
information.

Include links to other URIs. so that they can
discover more things.
http://www.w3.org/DesignIssues/LinkedData.html
2014-04-11 Harper - Penn Humanities Forum - LODLAM 6
Linked Data

Metadata as a Graph

Typed things, named by URIs

The relationships between those things,
also built on URIs

Ease of integration *across* data
sources  merging graphs
2014-04-11 Harper - Penn Humanities Forum - LODLAM 7
Publish Publish Publish!
http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html
2014-04-11 Harper - Penn Humanities Forum - LODLAM 8
2014-04-11 Harper - Penn Humanities Forum - LODLAM 9
DBpedia
Structured Wikipedia Data

Partial basis in data entry conventions

InfoBoxs, and InfoBox Templates

Metadata Entry Format

Partial source of Ontology

Class Structure

Vocabulary Design
2014-04-11 Harper - Penn Humanities Forum - LODLAM 10
DBpedia

3.4 Million things described

Ontology based on infoboxes

1.5 million things classified

http://wiki.dbpedia.org/Ontology

Approx. 50,000 Properties

Approx. 1,200 defined in ontology
2014-04-11 Harper - Penn Humanities Forum - LODLAM 11
2014-04-11 Harper - Penn Humanities Forum - LODLAM 12
2014-04-11 Harper - Penn Humanities Forum - LODLAM 13
2014-04-11 Harper - Penn Humanities Forum - LODLAM 14
http://thinkbase.cs.auckland.ac.nz/start.jsp
2014-04-11 Harper - Penn Humanities Forum - LODLAM 15
Google Knowledge Graph
2014-04-11 Harper - Penn Humanities Forum - LODLAM 16
RelFinder
http://www.visualdataweb.org/relfinder.php
2014-04-11 Harper - Penn Humanities Forum - LODLAM 17
RelFinder
http://www.visualdataweb.org/relfinder.php
2014-04-11 Harper - Penn Humanities Forum - LODLAM 18
Linked Jazz
http://linkedjazz.org/network/
2014-04-11 Harper - Penn Humanities Forum - LODLAM 19
Social Networks of Archival Context
Image From: http://inkdroid.org/journal/2010/08/12/archival-context-on-the-web/
2014-04-11 Harper - Penn Humanities Forum - LODLAM 20
Linking Lives  Screenshots from P. Johnston
2014-04-11 Harper - Penn Humanities Forum - LODLAM 21
Linking Lives  Screenshots from P. Johnston
2014-04-11 Harper - Penn Humanities Forum - LODLAM 22
http://viewshare.org/
2014-04-11 Harper - Penn Humanities Forum - LODLAM 23
DM2E
http://dm2e.eu/
2014-04-11 Harper - Penn Humanities Forum - LODLAM 24
Pundit
Imagefrom:http://summit2013.lodlam.net/2013/04/03/pundit/
2014-04-11 Harper - Penn Humanities Forum - LODLAM 25
Annotations

The Pundit (thepund.it)

Hypothesis: Peer Review for the Web (hypthos.is)

Open Annotation Collaboration

Distributed bibliographic control environment

Focus on identification over description
In short, by treating values as non-literal resources and
assigning URIs to them we give ourselves (and others) the
hooks on which to hang further descriptions. - Andy Powell
2014-04-11 Harper - Penn Humanities Forum - LODLAM 26
Context
Narrative
Story telling
The Library's story,
and the Archives story,
but also
2014-04-11 Harper - Penn Humanities Forum - LODLAM 27
Users stories
Scholars' stories
Adding context through recombinant metadata
2014-04-11 Harper - Penn Humanities Forum - LODLAM 28
Scholars & Users Stories  Tim Sherratt
(@wragge)
Also: http://discontents.com.au/a-map-and-some-pins-open-data-and-unlimited-horizons/
2014-04-11 Harper - Penn Humanities Forum - LODLAM 29
Linked Data Based UI Design
For Boutique Collections
2014-04-11 Harper - Penn Humanities Forum - LODLAM 30
2014-04-11 Harper - Penn Humanities Forum - LODLAM 31
2014-04-11 Harper - Penn Humanities Forum - LODLAM 32
2014-04-11 Harper - Penn Humanities Forum - LODLAM 33
FuzzyWuzzy & SeatGeek!
FuzzyWuzzyAwesomeLibraryfromSeatGeek
https://github.com/seatgeek/fuzzywuzzy
http://seatgeek.com/blog/dev/fuzzywuzzy-fuzzy-string-matching-in-python
2014-04-11 Harper - Penn Humanities Forum - LODLAM 34
際際滷 courtesy of Doug Oard
University of Maryland
2014-04-11 Harper - Penn Humanities Forum - LODLAM 35
Reconciliation & NER
http://freeyourmetadata.org/
Watch for their book
http://www.amazon.com/Linked-Data-Libraries-Archives-Museums/dp/1856049647/
2014-04-11 Harper - Penn Humanities Forum - LODLAM 37
2014-04-11 Harper - Penn Humanities Forum - LODLAM 38
Open Refine RDF Skeleton
2014-04-11 Harper - Penn Humanities Forum - LODLAM 39
2014-04-11 Harper - Penn Humanities Forum - LODLAM 40
And onward...
 Rethinking UI Design
 Aggregate data from more sources
 Provide more context
 Building proofs of concept
 Improving Search Engine Optimization
(JSON-LD, Schema.org, RDFa, &c.)
 Use cases!
 Experimentation!
2014-04-11 Harper - Penn Humanities Forum - LODLAM 41
Thanks!
corey.harper@nyu.edu
212.998.2479
@chrpr

More Related Content

Charper.penn.20140411

  • 1. Linked Open Data Presented for the Penn Humanities Forum by Corey A Harper 2013-04-11 Tools and techniques for putting Metadata, Context, & Narrative On the Web @chrpr[際際滷share]
  • 2. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 2 The Web Becomes Semantic Originally: Metadata about Web things (documents) Eventually: Metadata about all sorts of things And about relationships between things TBLs original vision (Weaving the Web 1999) Then: Focus on Machine Reasoning Scientific American Article Now: Focus on things & links Reasoning & Inferencing less central
  • 3. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 3 Semantic Web Terminology Resource: Any thing Class: Abstraction of a type of thing Individual: An instance of a class Property: An attribute of an individual Statement/Triple: A Resource (subject) A Property (predicate / verb) A Value (object) - Nodes Graph: Visual Representation of statements Ontology/Vocabulary: A domain specific collection of classes and properties
  • 4. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 4
  • 5. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 5 Linked Open Data Use URIs as names for things Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information. Include links to other URIs. so that they can discover more things. http://www.w3.org/DesignIssues/LinkedData.html
  • 6. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 6 Linked Data Metadata as a Graph Typed things, named by URIs The relationships between those things, also built on URIs Ease of integration *across* data sources merging graphs
  • 7. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 7 Publish Publish Publish! http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html
  • 8. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 8
  • 9. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 9 DBpedia Structured Wikipedia Data Partial basis in data entry conventions InfoBoxs, and InfoBox Templates Metadata Entry Format Partial source of Ontology Class Structure Vocabulary Design
  • 10. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 10 DBpedia 3.4 Million things described Ontology based on infoboxes 1.5 million things classified http://wiki.dbpedia.org/Ontology Approx. 50,000 Properties Approx. 1,200 defined in ontology
  • 11. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 11
  • 12. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 12
  • 13. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 13
  • 14. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 14 http://thinkbase.cs.auckland.ac.nz/start.jsp
  • 15. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 15 Google Knowledge Graph
  • 16. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 16 RelFinder http://www.visualdataweb.org/relfinder.php
  • 17. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 17 RelFinder http://www.visualdataweb.org/relfinder.php
  • 18. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 18 Linked Jazz http://linkedjazz.org/network/
  • 19. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 19 Social Networks of Archival Context Image From: http://inkdroid.org/journal/2010/08/12/archival-context-on-the-web/
  • 20. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 20 Linking Lives Screenshots from P. Johnston
  • 21. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 21 Linking Lives Screenshots from P. Johnston
  • 22. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 22 http://viewshare.org/
  • 23. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 23 DM2E http://dm2e.eu/
  • 24. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 24 Pundit Imagefrom:http://summit2013.lodlam.net/2013/04/03/pundit/
  • 25. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 25 Annotations The Pundit (thepund.it) Hypothesis: Peer Review for the Web (hypthos.is) Open Annotation Collaboration Distributed bibliographic control environment Focus on identification over description In short, by treating values as non-literal resources and assigning URIs to them we give ourselves (and others) the hooks on which to hang further descriptions. - Andy Powell
  • 26. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 26 Context Narrative Story telling The Library's story, and the Archives story, but also
  • 27. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 27 Users stories Scholars' stories Adding context through recombinant metadata
  • 28. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 28 Scholars & Users Stories Tim Sherratt (@wragge) Also: http://discontents.com.au/a-map-and-some-pins-open-data-and-unlimited-horizons/
  • 29. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 29 Linked Data Based UI Design For Boutique Collections
  • 30. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 30
  • 31. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 31
  • 32. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 32
  • 33. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 33 FuzzyWuzzy & SeatGeek! FuzzyWuzzyAwesomeLibraryfromSeatGeek https://github.com/seatgeek/fuzzywuzzy http://seatgeek.com/blog/dev/fuzzywuzzy-fuzzy-string-matching-in-python
  • 34. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 34 際際滷 courtesy of Doug Oard University of Maryland
  • 35. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 35 Reconciliation & NER http://freeyourmetadata.org/ Watch for their book http://www.amazon.com/Linked-Data-Libraries-Archives-Museums/dp/1856049647/
  • 36. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 37
  • 37. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 38 Open Refine RDF Skeleton
  • 38. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 39
  • 39. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 40 And onward... Rethinking UI Design Aggregate data from more sources Provide more context Building proofs of concept Improving Search Engine Optimization (JSON-LD, Schema.org, RDFa, &c.) Use cases! Experimentation!
  • 40. 2014-04-11 Harper - Penn Humanities Forum - LODLAM 41 Thanks! corey.harper@nyu.edu 212.998.2479 @chrpr