ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
Integration of oreChemwith the eCrystals repository for crystal structuresMark Borkum, Simon Coles and Jeremy Frey15 September 2010
OverviewMotivationImplementationDiscussion and Summary2
Current Practice in CrystallographyCrystallography data is highly structuredThe de facto standard adopted by the community is the CIF (Crystallographic Information File)Relatively few crystal structures are openly published3http://www.rin.ac.uk/our-work/data-management-and-curation/share-or-not-share-research-data-outputs
Open Access JournalsAdvantages:Rapid publicationHighly citedData is available to downloadDisadvantages:Electronic onlyNot all data is of primary importance to the underlying chemistryBy-products, unexpected results, tracking reactions, etc.4
Crystallography and Fraud5
The eCrystals FederationJISC project to establish a network of crystallography resources on the Internet, with metadata that is harvested by a number of aggregation servicesLed by the UK National Crystallography Service (NCS)With core partners at UKOLN, the Digital Curation Centre, and the Unilever Centre for Molecular Science Informatics6
eCrystals ¨C University of SouthamptonLocated @ http://ecrystals.chem.soton.ac.ukArchive for crystal structures that are generated by:Southampton Chemical Crystallography GroupUK National Crystallography Service (NCS)Modified version of EPrints 3.1OAI-PMH compliantExtensible platform (with plug-ins architecture)7
What is an eCrystal?¡°all the fundamental and derived data resulting from a single crystal X-ray structure determination¡±¡°the information supplied should enable any reader to check the reliability and validity¡±8http://www.ukoln.ac.uk/projects/ebank-uk/images/collage-web.gif
The Scientific Web9
The Data Deluge10In Haiku:Lots of producers;Generating more datathan ever before.40 years ago, a PhD student would determine 3 structures over the entire course of their study!The Great Wave off Kanagawa by Katsushika Hokusai
ProvenanceThe 7 W¡¯s [Goble 2002]Who, What, Where,  Why, When, Which, & (W)HowThe Why aspect is usually ignored ?Rational, intent, hypothesis, protocol, methodology, workflow, etc.11¡°Diana and Actaeon by Titian has a full provenance covering its passage through several owners and four countries since it was painted for Philip II of Spain in the 1550s.¡±Source: http://en.wikipedia.org/wiki/Diana_and_Actaeon_%28Titian%29
¡°In theory, there is no difference between theory and practice.But, in practice, there is.¡± Unknown (possibly Yogi Berra)12
Why ¡°Why¡± MattersIt is the reason for the data¡¯s existenceIt gives us the ability to interpret the data in the correct contextIt allows us to align the data with the big picture13http://www.myexperiment.org/workflows/16.html
The oreChem Core OntologyDescribes three concepts:The methodology (planned method) of a scientific experimentThe enactment of methodologiesThe provenance of realised artefacts14
Methodology (Planned Method)The ¡°plan¡± is modelled as a directed graphTwo node types:Plan Stagedescription of an activity that will be enactedPlan Objectdescription of an artefact that will be realised15
Enactment (of a Methodology)Each ¡°run¡± (of a plan) is modelled as a directed graph Two node types:Stagedescription of an activity that has been enactedObjectdescription of an artefact that has been realised16
ProvenanceProspectiveThe plan describes a scientific experiment that will be enactedRetrospectiveThe run describes a scientific experiment that hasbeen enactedEvery ¡®run thing¡¯ is linked to exactly one ¡®plan thing¡¯17
oreChem Plug-in for eCrystalsThree components:orechem:Plan (the eCrystals methodology) ¡°eCrystal?orechem:Run¡± mapping ¡°orechem:Run? provenance graph¡± pipeline18
The eCrystals Methodology19BeforeAfter
Example: eCrystal #643BeforeAfter20
SPARQL RequestPREFIX orechem:   <http://www.openarchives.org/2010/05/24-orechem-ns#>PREFIX ecrystals: <http://ecrystals.chem.soton.ac.uk/plan.rdf#>SELECT ?run ?raw ?derived ?reportedWHERE {  ?run a orechem:Run ;orechem:hasPlanecrystals:Ecrystals ;orechem:containsObject ?raw ;orechem:containsObject ?derived ;orechem:containsObject ?reported .  ?raw a orechem:File ;orechem:hasPlanObjectecrystals:HKL .  ?derived a orechem:File ;orechem:derivedFrom ?raw .  ?reported a orechem:File ;orechem:hasPlanObjectecrystals:CIF ;orechem:derivedFrom ?derived .}21
SPARQL Response (for eCrystal #643)22?run?reported?derived?raw
Summary<summary/>23
AcknowledgmentsoreChem is funded by Microsoft External ResearcheCrystals is funded by both EPSRC and JISCThe oreChem project team:Nico Adams, Mark Borkum, William Brouwer, RameswaraSashiKiranChalla, Simon Coles, Nick Day, Jim Downing, Jeremy Frey, C. Lee Giles, Carl Lagoze (PI), Na Li, PrasenjitMitra, Karl Meuller, Peter Murray-Rust, Marlon Pierce, Joe Townsend, and Theresa Velden.24
25#ahm2010#ahm#ahm10#pch2010http://pegasus.chem.soton.ac.uk#ahm2010 until 11am Wed 15 Sept 2010

More Related Content

Viewers also liked (8)

Change
ChangeChange
Change
frank tan
?
The Power Of Multiplication
The Power Of MultiplicationThe Power Of Multiplication
The Power Of Multiplication
frank tan
?
Soo presentation
Soo presentationSoo presentation
Soo presentation
frank tan
?
Presentatie webrichtlijnen
Presentatie webrichtlijnenPresentatie webrichtlijnen
Presentatie webrichtlijnen
Tjitte Folkertsma
?
FAS: Shop2market over Conversie Attributie
FAS: Shop2market over Conversie AttributieFAS: Shop2market over Conversie Attributie
FAS: Shop2market over Conversie Attributie
Tjitte Folkertsma
?
New Excited Info
New Excited InfoNew Excited Info
New Excited Info
frank tan
?

Similar to Integration of oreChem with the eCrystals repository for crystal structures (20)

The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
ManjulaPatel
?
Datasalon6 2011 - "Rise of the robo scientists": where is data coming from?
Datasalon6 2011 - "Rise of the robo scientists": where is data coming from?Datasalon6 2011 - "Rise of the robo scientists": where is data coming from?
Datasalon6 2011 - "Rise of the robo scientists": where is data coming from?
Pieter Pauwels
?
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Paragon_Science_Inc
?
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
Sarah Jones
?
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-Science
Andrew Sallans
?
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
Ian Foster
?
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication System
Herbert Van de Sompel
?
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
guest0dc425
?
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
EOSC-hub project
?
Perx and TechXtra
Perx and TechXtraPerx and TechXtra
Perx and TechXtra
Roddy MacLeod
?
PaNOSC Overview - ExPaNDS kick-off meeting - September 2019
PaNOSC Overview - ExPaNDS kick-off meeting - September 2019PaNOSC Overview - ExPaNDS kick-off meeting - September 2019
PaNOSC Overview - ExPaNDS kick-off meeting - September 2019
PaNOSC
?
On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurements
Nina Jeliazkova
?
Showcasing research data tools
Showcasing research data toolsShowcasing research data tools
Showcasing research data tools
Jisc RDM
?
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
Carole Goble
?
Cyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean ObservatoriesCyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean Observatories
Larry Smarr
?
WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair"
OpenAIRE
?
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony Hey
Cornelius Puschmann
?
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
Scott Edmunds
?
Showcasing research data tools - Jisc Digifest 2016
Showcasing research data tools - Jisc Digifest 2016Showcasing research data tools - Jisc Digifest 2016
Showcasing research data tools - Jisc Digifest 2016
Jisc
?
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on them
Ross Mounce
?
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
ManjulaPatel
?
Datasalon6 2011 - "Rise of the robo scientists": where is data coming from?
Datasalon6 2011 - "Rise of the robo scientists": where is data coming from?Datasalon6 2011 - "Rise of the robo scientists": where is data coming from?
Datasalon6 2011 - "Rise of the robo scientists": where is data coming from?
Pieter Pauwels
?
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Paragon_Science_Inc
?
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
Sarah Jones
?
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-Science
Andrew Sallans
?
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
Ian Foster
?
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication System
Herbert Van de Sompel
?
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
guest0dc425
?
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
Gergely Sipos, Claudio Cacciari: Welcome and mapping the landscape: EOSC-hub ...
EOSC-hub project
?
PaNOSC Overview - ExPaNDS kick-off meeting - September 2019
PaNOSC Overview - ExPaNDS kick-off meeting - September 2019PaNOSC Overview - ExPaNDS kick-off meeting - September 2019
PaNOSC Overview - ExPaNDS kick-off meeting - September 2019
PaNOSC
?
On chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurementsOn chemical structures, substances, nanomaterials and measurements
On chemical structures, substances, nanomaterials and measurements
Nina Jeliazkova
?
Showcasing research data tools
Showcasing research data toolsShowcasing research data tools
Showcasing research data tools
Jisc RDM
?
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
Carole Goble
?
Cyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean ObservatoriesCyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean Observatories
Larry Smarr
?
WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair"
OpenAIRE
?
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony Hey
Cornelius Puschmann
?
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
Scott Edmunds
?
Showcasing research data tools - Jisc Digifest 2016
Showcasing research data tools - Jisc Digifest 2016Showcasing research data tools - Jisc Digifest 2016
Showcasing research data tools - Jisc Digifest 2016
Jisc
?
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on them
Ross Mounce
?

Recently uploaded (20)

AIXMOOC 2.3 - Modelli di reti neurali con esperimenti di addestramento
AIXMOOC 2.3 - Modelli di reti neurali con esperimenti di addestramentoAIXMOOC 2.3 - Modelli di reti neurali con esperimenti di addestramento
AIXMOOC 2.3 - Modelli di reti neurali con esperimenti di addestramento
Alessandro Bogliolo
?
UiPath Automation Developer Associate Training Series 2025 - Session 1
UiPath Automation Developer Associate Training Series 2025 - Session 1UiPath Automation Developer Associate Training Series 2025 - Session 1
UiPath Automation Developer Associate Training Series 2025 - Session 1
DianaGray10
?
DealBook of Ukraine: 2025 edition | AVentures Capital
DealBook of Ukraine: 2025 edition | AVentures CapitalDealBook of Ukraine: 2025 edition | AVentures Capital
DealBook of Ukraine: 2025 edition | AVentures Capital
Yevgen Sysoyev
?
Computational Photography: How Technology is Changing Way We Capture the World
Computational Photography: How Technology is Changing Way We Capture the WorldComputational Photography: How Technology is Changing Way We Capture the World
Computational Photography: How Technology is Changing Way We Capture the World
HusseinMalikMammadli
?
FTSG TRENDS REPORT 2025 as seen at #SXSW2025
FTSG TRENDS REPORT 2025 as seen  at #SXSW2025FTSG TRENDS REPORT 2025 as seen  at #SXSW2025
FTSG TRENDS REPORT 2025 as seen at #SXSW2025
HUB INSTITUTE
?
Q4 2024 Earnings and Investor Presentation
Q4 2024 Earnings and Investor PresentationQ4 2024 Earnings and Investor Presentation
Q4 2024 Earnings and Investor Presentation
Dropbox
?
Transform Your Future with Front-End Development Training
Transform Your Future with Front-End Development TrainingTransform Your Future with Front-End Development Training
Transform Your Future with Front-End Development Training
Vtechlabs
?
DevNexus - Building 10x Development Organizations.pdf
DevNexus - Building 10x Development Organizations.pdfDevNexus - Building 10x Development Organizations.pdf
DevNexus - Building 10x Development Organizations.pdf
Justin Reock
?
Understanding Traditional AI with Custom Vision & MuleSoft.pptx
Understanding Traditional AI with Custom Vision & MuleSoft.pptxUnderstanding Traditional AI with Custom Vision & MuleSoft.pptx
Understanding Traditional AI with Custom Vision & MuleSoft.pptx
shyamraj55
?
Early Adopter's Guide to AI Moderation (Preview)
Early Adopter's Guide to AI Moderation (Preview)Early Adopter's Guide to AI Moderation (Preview)
Early Adopter's Guide to AI Moderation (Preview)
nick896721
?
The Future of Repair: Transparent and Incremental by Botond De?nes
The Future of Repair: Transparent and Incremental by Botond De?nesThe Future of Repair: Transparent and Incremental by Botond De?nes
The Future of Repair: Transparent and Incremental by Botond De?nes
ScyllaDB
?
Integrated Operating Window - A Gateway to PM
Integrated Operating Window - A Gateway to PMIntegrated Operating Window - A Gateway to PM
Integrated Operating Window - A Gateway to PM
Farhan Tariq
?
BoxLang JVM Language : The Future is Dynamic
BoxLang JVM Language : The Future is DynamicBoxLang JVM Language : The Future is Dynamic
BoxLang JVM Language : The Future is Dynamic
Ortus Solutions, Corp
?
What Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI AgentsWhat Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI Agents
Zilliz
?
Stronger Together: Combining Data Quality and Governance for Confident AI & A...
Stronger Together: Combining Data Quality and Governance for Confident AI & A...Stronger Together: Combining Data Quality and Governance for Confident AI & A...
Stronger Together: Combining Data Quality and Governance for Confident AI & A...
Precisely
?
Technology use over time and its impact on consumers and businesses.pptx
Technology use over time and its impact on consumers and businesses.pptxTechnology use over time and its impact on consumers and businesses.pptx
Technology use over time and its impact on consumers and businesses.pptx
kaylagaze
?
[Webinar] Scaling Made Simple: Getting Started with No-Code Web Apps
[Webinar] Scaling Made Simple: Getting Started with No-Code Web Apps[Webinar] Scaling Made Simple: Getting Started with No-Code Web Apps
[Webinar] Scaling Made Simple: Getting Started with No-Code Web Apps
Safe Software
?
Technology use over time and its impact on consumers and businesses.pptx
Technology use over time and its impact on consumers and businesses.pptxTechnology use over time and its impact on consumers and businesses.pptx
Technology use over time and its impact on consumers and businesses.pptx
kaylagaze
?
Brave Browser Crack 1.45.133 Activated 2025
Brave Browser Crack 1.45.133 Activated 2025Brave Browser Crack 1.45.133 Activated 2025
Brave Browser Crack 1.45.133 Activated 2025
kherorpacca00126
?
Unlock AI Creativity: Image Generation with DALL¡¤E
Unlock AI Creativity: Image Generation with DALL¡¤EUnlock AI Creativity: Image Generation with DALL¡¤E
Unlock AI Creativity: Image Generation with DALL¡¤E
Expeed Software
?
AIXMOOC 2.3 - Modelli di reti neurali con esperimenti di addestramento
AIXMOOC 2.3 - Modelli di reti neurali con esperimenti di addestramentoAIXMOOC 2.3 - Modelli di reti neurali con esperimenti di addestramento
AIXMOOC 2.3 - Modelli di reti neurali con esperimenti di addestramento
Alessandro Bogliolo
?
UiPath Automation Developer Associate Training Series 2025 - Session 1
UiPath Automation Developer Associate Training Series 2025 - Session 1UiPath Automation Developer Associate Training Series 2025 - Session 1
UiPath Automation Developer Associate Training Series 2025 - Session 1
DianaGray10
?
DealBook of Ukraine: 2025 edition | AVentures Capital
DealBook of Ukraine: 2025 edition | AVentures CapitalDealBook of Ukraine: 2025 edition | AVentures Capital
DealBook of Ukraine: 2025 edition | AVentures Capital
Yevgen Sysoyev
?
Computational Photography: How Technology is Changing Way We Capture the World
Computational Photography: How Technology is Changing Way We Capture the WorldComputational Photography: How Technology is Changing Way We Capture the World
Computational Photography: How Technology is Changing Way We Capture the World
HusseinMalikMammadli
?
FTSG TRENDS REPORT 2025 as seen at #SXSW2025
FTSG TRENDS REPORT 2025 as seen  at #SXSW2025FTSG TRENDS REPORT 2025 as seen  at #SXSW2025
FTSG TRENDS REPORT 2025 as seen at #SXSW2025
HUB INSTITUTE
?
Q4 2024 Earnings and Investor Presentation
Q4 2024 Earnings and Investor PresentationQ4 2024 Earnings and Investor Presentation
Q4 2024 Earnings and Investor Presentation
Dropbox
?
Transform Your Future with Front-End Development Training
Transform Your Future with Front-End Development TrainingTransform Your Future with Front-End Development Training
Transform Your Future with Front-End Development Training
Vtechlabs
?
DevNexus - Building 10x Development Organizations.pdf
DevNexus - Building 10x Development Organizations.pdfDevNexus - Building 10x Development Organizations.pdf
DevNexus - Building 10x Development Organizations.pdf
Justin Reock
?
Understanding Traditional AI with Custom Vision & MuleSoft.pptx
Understanding Traditional AI with Custom Vision & MuleSoft.pptxUnderstanding Traditional AI with Custom Vision & MuleSoft.pptx
Understanding Traditional AI with Custom Vision & MuleSoft.pptx
shyamraj55
?
Early Adopter's Guide to AI Moderation (Preview)
Early Adopter's Guide to AI Moderation (Preview)Early Adopter's Guide to AI Moderation (Preview)
Early Adopter's Guide to AI Moderation (Preview)
nick896721
?
The Future of Repair: Transparent and Incremental by Botond De?nes
The Future of Repair: Transparent and Incremental by Botond De?nesThe Future of Repair: Transparent and Incremental by Botond De?nes
The Future of Repair: Transparent and Incremental by Botond De?nes
ScyllaDB
?
Integrated Operating Window - A Gateway to PM
Integrated Operating Window - A Gateway to PMIntegrated Operating Window - A Gateway to PM
Integrated Operating Window - A Gateway to PM
Farhan Tariq
?
BoxLang JVM Language : The Future is Dynamic
BoxLang JVM Language : The Future is DynamicBoxLang JVM Language : The Future is Dynamic
BoxLang JVM Language : The Future is Dynamic
Ortus Solutions, Corp
?
What Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI AgentsWhat Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI Agents
Zilliz
?
Stronger Together: Combining Data Quality and Governance for Confident AI & A...
Stronger Together: Combining Data Quality and Governance for Confident AI & A...Stronger Together: Combining Data Quality and Governance for Confident AI & A...
Stronger Together: Combining Data Quality and Governance for Confident AI & A...
Precisely
?
Technology use over time and its impact on consumers and businesses.pptx
Technology use over time and its impact on consumers and businesses.pptxTechnology use over time and its impact on consumers and businesses.pptx
Technology use over time and its impact on consumers and businesses.pptx
kaylagaze
?
[Webinar] Scaling Made Simple: Getting Started with No-Code Web Apps
[Webinar] Scaling Made Simple: Getting Started with No-Code Web Apps[Webinar] Scaling Made Simple: Getting Started with No-Code Web Apps
[Webinar] Scaling Made Simple: Getting Started with No-Code Web Apps
Safe Software
?
Technology use over time and its impact on consumers and businesses.pptx
Technology use over time and its impact on consumers and businesses.pptxTechnology use over time and its impact on consumers and businesses.pptx
Technology use over time and its impact on consumers and businesses.pptx
kaylagaze
?
Brave Browser Crack 1.45.133 Activated 2025
Brave Browser Crack 1.45.133 Activated 2025Brave Browser Crack 1.45.133 Activated 2025
Brave Browser Crack 1.45.133 Activated 2025
kherorpacca00126
?
Unlock AI Creativity: Image Generation with DALL¡¤E
Unlock AI Creativity: Image Generation with DALL¡¤EUnlock AI Creativity: Image Generation with DALL¡¤E
Unlock AI Creativity: Image Generation with DALL¡¤E
Expeed Software
?

Integration of oreChem with the eCrystals repository for crystal structures

  • 1. Integration of oreChemwith the eCrystals repository for crystal structuresMark Borkum, Simon Coles and Jeremy Frey15 September 2010
  • 3. Current Practice in CrystallographyCrystallography data is highly structuredThe de facto standard adopted by the community is the CIF (Crystallographic Information File)Relatively few crystal structures are openly published3http://www.rin.ac.uk/our-work/data-management-and-curation/share-or-not-share-research-data-outputs
  • 4. Open Access JournalsAdvantages:Rapid publicationHighly citedData is available to downloadDisadvantages:Electronic onlyNot all data is of primary importance to the underlying chemistryBy-products, unexpected results, tracking reactions, etc.4
  • 6. The eCrystals FederationJISC project to establish a network of crystallography resources on the Internet, with metadata that is harvested by a number of aggregation servicesLed by the UK National Crystallography Service (NCS)With core partners at UKOLN, the Digital Curation Centre, and the Unilever Centre for Molecular Science Informatics6
  • 7. eCrystals ¨C University of SouthamptonLocated @ http://ecrystals.chem.soton.ac.ukArchive for crystal structures that are generated by:Southampton Chemical Crystallography GroupUK National Crystallography Service (NCS)Modified version of EPrints 3.1OAI-PMH compliantExtensible platform (with plug-ins architecture)7
  • 8. What is an eCrystal?¡°all the fundamental and derived data resulting from a single crystal X-ray structure determination¡±¡°the information supplied should enable any reader to check the reliability and validity¡±8http://www.ukoln.ac.uk/projects/ebank-uk/images/collage-web.gif
  • 10. The Data Deluge10In Haiku:Lots of producers;Generating more datathan ever before.40 years ago, a PhD student would determine 3 structures over the entire course of their study!The Great Wave off Kanagawa by Katsushika Hokusai
  • 11. ProvenanceThe 7 W¡¯s [Goble 2002]Who, What, Where, Why, When, Which, & (W)HowThe Why aspect is usually ignored ?Rational, intent, hypothesis, protocol, methodology, workflow, etc.11¡°Diana and Actaeon by Titian has a full provenance covering its passage through several owners and four countries since it was painted for Philip II of Spain in the 1550s.¡±Source: http://en.wikipedia.org/wiki/Diana_and_Actaeon_%28Titian%29
  • 12. ¡°In theory, there is no difference between theory and practice.But, in practice, there is.¡± Unknown (possibly Yogi Berra)12
  • 13. Why ¡°Why¡± MattersIt is the reason for the data¡¯s existenceIt gives us the ability to interpret the data in the correct contextIt allows us to align the data with the big picture13http://www.myexperiment.org/workflows/16.html
  • 14. The oreChem Core OntologyDescribes three concepts:The methodology (planned method) of a scientific experimentThe enactment of methodologiesThe provenance of realised artefacts14
  • 15. Methodology (Planned Method)The ¡°plan¡± is modelled as a directed graphTwo node types:Plan Stagedescription of an activity that will be enactedPlan Objectdescription of an artefact that will be realised15
  • 16. Enactment (of a Methodology)Each ¡°run¡± (of a plan) is modelled as a directed graph Two node types:Stagedescription of an activity that has been enactedObjectdescription of an artefact that has been realised16
  • 17. ProvenanceProspectiveThe plan describes a scientific experiment that will be enactedRetrospectiveThe run describes a scientific experiment that hasbeen enactedEvery ¡®run thing¡¯ is linked to exactly one ¡®plan thing¡¯17
  • 18. oreChem Plug-in for eCrystalsThree components:orechem:Plan (the eCrystals methodology) ¡°eCrystal?orechem:Run¡± mapping ¡°orechem:Run? provenance graph¡± pipeline18
  • 21. SPARQL RequestPREFIX orechem: <http://www.openarchives.org/2010/05/24-orechem-ns#>PREFIX ecrystals: <http://ecrystals.chem.soton.ac.uk/plan.rdf#>SELECT ?run ?raw ?derived ?reportedWHERE { ?run a orechem:Run ;orechem:hasPlanecrystals:Ecrystals ;orechem:containsObject ?raw ;orechem:containsObject ?derived ;orechem:containsObject ?reported . ?raw a orechem:File ;orechem:hasPlanObjectecrystals:HKL . ?derived a orechem:File ;orechem:derivedFrom ?raw . ?reported a orechem:File ;orechem:hasPlanObjectecrystals:CIF ;orechem:derivedFrom ?derived .}21
  • 22. SPARQL Response (for eCrystal #643)22?run?reported?derived?raw
  • 24. AcknowledgmentsoreChem is funded by Microsoft External ResearcheCrystals is funded by both EPSRC and JISCThe oreChem project team:Nico Adams, Mark Borkum, William Brouwer, RameswaraSashiKiranChalla, Simon Coles, Nick Day, Jim Downing, Jeremy Frey, C. Lee Giles, Carl Lagoze (PI), Na Li, PrasenjitMitra, Karl Meuller, Peter Murray-Rust, Marlon Pierce, Joe Townsend, and Theresa Velden.24