際際滷

際際滷Share a Scribd company logo
A semantic query interface for the OGO platform Jos辿 Antonio Mi単arro-Gim辿nez (jose.minyarro@um.es)  Mikel Ega単a Aranguren, Ph.D. (mikel.egana.aranguren@gmail.com) Francisco Garc鱈a-S叩nchez, Ph.D. (frgarcia@um.es) Jesualdo Tom叩s Fern叩ndez-Breis, Ph.D. (jfernand@um.es) Faculty of Computer Science  University of Murcia  Spain ITBAM (DEXA) Bilbo 2010 http://tinyurl.com/35amhn6
Overview Orthologs Information about orthologs and diseases OGO system A semantic query interface for the OGO system Sample query
Ortholog sequences http://www.stanford.edu/group/pandegroup/folding/education/h.htm
Ortholog sequences Trait Trait
Orthologs and genetic diseases Homologene KOG Inparanoid OrthoMCL Online Mendelian Inheritance in Man (OMIM)  Gene 1  Disease Gene 2  Orthologs
OGO system
OGO ontology
OGO ontology: imported ontologies Gene Ontology (OBOF): molecular function, biological process and cellular component of gene products Evidence Codes Ontology (Candidate OBOF): GO annotations evidence codes OBO Relationship Types (Candidate OBOF):  Gene product  participates in  some (molecular function or biological process) Gene product  located in  some cellular component NCBI taxonomy: organisms classification
Imported ontologies: OWL punning
OGO ontology: mappings to OMIM
Implementation of the OGO system
Interfaces of the OGO system Keyword based querying Semantic querying
Semantic interface
Semantic interface
Sample query Ortholog genes of the gene that causes prostate cancer on  Rattus norvegicus ?
Sample query Ortholog genes of the gene that causes prostate cancer on  Rattus norvegicus ?
Sample query Ortholog genes of the gene that causes prostate cancer on  Rattus norvegicus ?
Sample query Ortholog genes of the gene that causes prostate cancer on  Rattus norvegicus ?
Sample query Ortholog genes of the gene that causes prostate cancer on  Rattus norvegicus ?
Sample query Ortholog genes of the gene that causes prostate cancer on  Rattus norvegicus ? @prefix ncbi: <http://um.es/ncbi.owl>. @prefix ogo: <http://miuras.inf.um.es/ontologies/OGO.owl>. SELECT ?Gene_0 ?Genetic_disease_1 WHERE  { ?Gene_0 ogo:fromSpecies ncbi:NCBI_10116 ?Genetic_disease_1 ogo:Name ?literal_4 . FILTER  (regex(?literal_4,&quot;Prostate cancer, susceptibility to&quot;)) . ?Genetic_disease_1 ogo:causedBy ?Gene_2 . ?Cluster_of_Orthologous_genes_3 ogo:hasOrthologous ?Gene_2 . ?Cluster_of_Orthologous_genes_3 ogo:hasOrthologous ?Gene_0 . }
Sample query Ortholog genes of the gene that causes prostate cancer on  Rattus norvegicus ?
Query grammar Query::= &quot;SELECT&quot; ListVar (WhereClause)? ListVar::=Var (Var)* WhereClause::=&quot;WHERE {&quot; ConditionClause (ConditionClause)* &quot;}&quot; ConditionClause::=[VarCondition | LiteralCondition] &quot;.&quot; VarCondition::=[Var | Individual] Property [Var | Individual] LiteralCondition::=[Var | Individual] Property [Var | Individual] &quot;.&quot; &quot;FILTER (regex (&quot; Var &quot;,&quot; Literal &quot;))&quot; Var -> This term represents a variable in the query which can be matched to any concept or individual in the ontology. Individual -> This term represents a concept or individual identied by an URI in the ontology. Property -> This term represents a relationship or property identied by an URI in the ontology. Literal -> This term represents any data value dened by the user.
Future plans OWL reasoning for querying (OWL 2 QL?) Pellet Integrity Constraint Validator (Pellet ICV): OWL as schema language for RDF (CWA)  Check the gathered information More bio-ontologies Clinical archetypes for querying (ISO 13606): exchange of ortholog/disease information in a standard biomedical research setting
Conclusions Orthologs and diseases: new hypotheses OGO provides a resource for exploiting such combined information Semantic query interface: Complex queries easily (No SPARQL syntax) http://miuras.inf.um.es/~ogo/
Acknowledgements Spanish Ministry for Science and Education (grant TSI2007-66575-C02-02)  Comunidad Aut坦noma de la Regi坦n de Murcia (grant BIO-TEC 06/01-0005) Fundaci坦n S辿neca, Servicio de Empleo y Formaci坦n (grant 07836/BPS/07)

More Related Content

What's hot (20)

Genentech icgc 2015
Genentech icgc 2015Genentech icgc 2015
Genentech icgc 2015
Neuro, McGill University
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discovery
Jeremy Yang
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds poster
Jeremy Yang
Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015
Chris Mungall
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
adcobb
The Monarch Initiative: An integrated genotype-phenotype platform for disease...
The Monarch Initiative: An integrated genotype-phenotype platform for disease...The Monarch Initiative: An integrated genotype-phenotype platform for disease...
The Monarch Initiative: An integrated genotype-phenotype platform for disease...
mhaendel
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
Elena S端gis
Use of semantic phenotyping to aid disease diagnosis
Use of semantic phenotyping to aid disease diagnosisUse of semantic phenotyping to aid disease diagnosis
Use of semantic phenotyping to aid disease diagnosis
mhaendel
Crash. Burn. Roast the Marshmallows.
Crash. Burn. Roast the Marshmallows.Crash. Burn. Roast the Marshmallows.
Crash. Burn. Roast the Marshmallows.
Yaniv Erlich
EMBL
EMBLEMBL
EMBL
Vidya Kalaivani Rajkumar
UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOA
EBI
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
open_phacts
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
ExternalEvents
cBioPortal Webinar 際際滷s (2/3)
cBioPortal Webinar 際際滷s (2/3)cBioPortal Webinar 際際滷s (2/3)
cBioPortal Webinar 際際滷s (2/3)
Pistoia Alliance
Biological database by kk sahu
Biological database by kk sahuBiological database by kk sahu
Biological database by kk sahu
KAUSHAL SAHU
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
Sreekanth Gali
Visualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All TogetherVisualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All Together
Nils Gehlenborg
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
Bioinformatics Databases
Bioinformatics DatabasesBioinformatics Databases
Bioinformatics Databases
cschlos2
Rishi
RishiRishi
Rishi
Narayan Awasthi
Molecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discoveryMolecular scaffolds are special and useful guides to discovery
Molecular scaffolds are special and useful guides to discovery
Jeremy Yang
Molecular scaffolds poster
Molecular scaffolds posterMolecular scaffolds poster
Molecular scaffolds poster
Jeremy Yang
Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015Computing on Phenotypes AMP 2015
Computing on Phenotypes AMP 2015
Chris Mungall
Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!Introduction to Gene Mining Part A: BLASTn-off!
Introduction to Gene Mining Part A: BLASTn-off!
adcobb
The Monarch Initiative: An integrated genotype-phenotype platform for disease...
The Monarch Initiative: An integrated genotype-phenotype platform for disease...The Monarch Initiative: An integrated genotype-phenotype platform for disease...
The Monarch Initiative: An integrated genotype-phenotype platform for disease...
mhaendel
Introduction to Bioinformatics.
 Introduction to Bioinformatics. Introduction to Bioinformatics.
Introduction to Bioinformatics.
Elena S端gis
Use of semantic phenotyping to aid disease diagnosis
Use of semantic phenotyping to aid disease diagnosisUse of semantic phenotyping to aid disease diagnosis
Use of semantic phenotyping to aid disease diagnosis
mhaendel
Crash. Burn. Roast the Marshmallows.
Crash. Burn. Roast the Marshmallows.Crash. Burn. Roast the Marshmallows.
Crash. Burn. Roast the Marshmallows.
Yaniv Erlich
UniProt-GOA
UniProt-GOAUniProt-GOA
UniProt-GOA
EBI
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
open_phacts
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
The National Center for Biotechnology Information (NCBI) Pathogen Analysis Pi...
ExternalEvents
cBioPortal Webinar 際際滷s (2/3)
cBioPortal Webinar 際際滷s (2/3)cBioPortal Webinar 際際滷s (2/3)
cBioPortal Webinar 際際滷s (2/3)
Pistoia Alliance
Biological database by kk sahu
Biological database by kk sahuBiological database by kk sahu
Biological database by kk sahu
KAUSHAL SAHU
Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02Biodatabases 101220022654-phpapp02
Biodatabases 101220022654-phpapp02
Sreekanth Gali
Visualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All TogetherVisualization Approaches for Biomedical Omics Data: Putting It All Together
Visualization Approaches for Biomedical Omics Data: Putting It All Together
Nils Gehlenborg
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
Bioinformatics Databases
Bioinformatics DatabasesBioinformatics Databases
Bioinformatics Databases
cschlos2

Viewers also liked (9)

Medioambiente Linked DataMedioambiente Linked Data
Medioambiente Linked Data
Mikel Ega単a Aranguren, Ph.D.
OPPL-Galaxy: Enhancing ontology exploitation in Galaxy with OPPL
OPPL-Galaxy: Enhancing ontology exploitation in Galaxy with OPPLOPPL-Galaxy: Enhancing ontology exploitation in Galaxy with OPPL
OPPL-Galaxy: Enhancing ontology exploitation in Galaxy with OPPL
Mikel Ega単a Aranguren, Ph.D.
Applying sw mikel_eganaApplying sw mikel_egana
Applying sw mikel_egana
Mikel Ega単a Aranguren, Ph.D.
Linked data functional genomics
Linked data functional genomicsLinked data functional genomics
Linked data functional genomics
Mikel Ega単a Aranguren, Ph.D.
Life Sciences Linked DataLife Sciences Linked Data
Life Sciences Linked Data
Mikel Ega単a Aranguren, Ph.D.
M辿todos y Resultados Actuales en Bioinform叩tica: know-how y know-what de las ...M辿todos y Resultados Actuales en Bioinform叩tica: know-how y know-what de las ...
M辿todos y Resultados Actuales en Bioinform叩tica: know-how y know-what de las ...
Mikel Ega単a Aranguren, Ph.D.
Populous swat4ls slides_slideshare
Populous swat4ls slides_slidesharePopulous swat4ls slides_slideshare
Populous swat4ls slides_slideshare
Mikel Ega単a Aranguren, Ph.D.
Aplicaci坦n de la Web Sem叩ntica en Bioinform叩ticaAplicaci坦n de la Web Sem叩ntica en Bioinform叩tica
Aplicaci坦n de la Web Sem叩ntica en Bioinform叩tica
Mikel Ega単a Aranguren, Ph.D.
Clase 04 CIENCIAS MICAS: BIOINFORMTICA, GENMICA, PROTEMICA, TRANCRIPTMIC...Clase 04 CIENCIAS MICAS: BIOINFORMTICA, GENMICA, PROTEMICA, TRANCRIPTMIC...
Clase 04 CIENCIAS MICAS: BIOINFORMTICA, GENMICA, PROTEMICA, TRANCRIPTMIC...
Edgar Fernando Salcedo Ramirez
Medioambiente Linked DataMedioambiente Linked Data
Medioambiente Linked Data
Mikel Ega単a Aranguren, Ph.D.
OPPL-Galaxy: Enhancing ontology exploitation in Galaxy with OPPL
OPPL-Galaxy: Enhancing ontology exploitation in Galaxy with OPPLOPPL-Galaxy: Enhancing ontology exploitation in Galaxy with OPPL
OPPL-Galaxy: Enhancing ontology exploitation in Galaxy with OPPL
Mikel Ega単a Aranguren, Ph.D.
Applying sw mikel_eganaApplying sw mikel_egana
Applying sw mikel_egana
Mikel Ega単a Aranguren, Ph.D.
Life Sciences Linked DataLife Sciences Linked Data
Life Sciences Linked Data
Mikel Ega単a Aranguren, Ph.D.
M辿todos y Resultados Actuales en Bioinform叩tica: know-how y know-what de las ...M辿todos y Resultados Actuales en Bioinform叩tica: know-how y know-what de las ...
M辿todos y Resultados Actuales en Bioinform叩tica: know-how y know-what de las ...
Mikel Ega単a Aranguren, Ph.D.
Aplicaci坦n de la Web Sem叩ntica en Bioinform叩ticaAplicaci坦n de la Web Sem叩ntica en Bioinform叩tica
Aplicaci坦n de la Web Sem叩ntica en Bioinform叩tica
Mikel Ega単a Aranguren, Ph.D.
Clase 04 CIENCIAS MICAS: BIOINFORMTICA, GENMICA, PROTEMICA, TRANCRIPTMIC...Clase 04 CIENCIAS MICAS: BIOINFORMTICA, GENMICA, PROTEMICA, TRANCRIPTMIC...
Clase 04 CIENCIAS MICAS: BIOINFORMTICA, GENMICA, PROTEMICA, TRANCRIPTMIC...
Edgar Fernando Salcedo Ramirez

Similar to Mikel egana itbam_2010_ogo_system (20)

Fundamentals of Analysis of Exomes
Fundamentals of Analysis of ExomesFundamentals of Analysis of Exomes
Fundamentals of Analysis of Exomes
daforerog
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giants
Benjamin Good
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
mikaelhuss
GIGA2 Structuring Phenotype Data
GIGA2 Structuring Phenotype DataGIGA2 Structuring Phenotype Data
GIGA2 Structuring Phenotype Data
Chris Mungall
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology:  How High Throughput Sequencing changed our view on bac...Genomic Epidemiology:  How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Jo達o Andr辿 Carri巽o
FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
Rothamsted Research, UK
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be one
mhaendel
The Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision MedicineThe Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision Medicine
mhaendel
Common languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithmsCommon languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithms
Jo達o Andr辿 Carri巽o
Human Reference Genome Browser Presentation at BIO-ITWorld 2008
Human Reference Genome Browser Presentation at BIO-ITWorld 2008Human Reference Genome Browser Presentation at BIO-ITWorld 2008
Human Reference Genome Browser Presentation at BIO-ITWorld 2008
Saul Kravitz
TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...
Phoenix Bioinformatics
SMBE 2015: Expression STRs
SMBE 2015: Expression STRsSMBE 2015: Expression STRs
SMBE 2015: Expression STRs
Yaniv Erlich
Dynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical CommunicationsDynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical Communications
Tim Clark
Investigating Term Reuse and Overlap in Biomedical Ontologies
Investigating Term Reuse and Overlap in Biomedical OntologiesInvestigating Term Reuse and Overlap in Biomedical Ontologies
Investigating Term Reuse and Overlap in Biomedical Ontologies
Maulik Kamdar
Major biological nucleotide databases
Major biological nucleotide databasesMajor biological nucleotide databases
Major biological nucleotide databases
Vidya Kalaivani Rajkumar
The uni prot knowledgebase
The uni prot knowledgebaseThe uni prot knowledgebase
The uni prot knowledgebase
Kew Sama
The Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, FutureThe Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, Future
myGrid team
ICAR2016 TAIR talk
ICAR2016 TAIR talkICAR2016 TAIR talk
ICAR2016 TAIR talk
Donghui Li
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experiments
Helena Deus
2012 03 01_bioinformatics_ii_les1
2012 03 01_bioinformatics_ii_les12012 03 01_bioinformatics_ii_les1
2012 03 01_bioinformatics_ii_les1
Prof. Wim Van Criekinge
Fundamentals of Analysis of Exomes
Fundamentals of Analysis of ExomesFundamentals of Analysis of Exomes
Fundamentals of Analysis of Exomes
daforerog
Computing on the shoulders of giants
Computing on the shoulders of giantsComputing on the shoulders of giants
Computing on the shoulders of giants
Benjamin Good
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
mikaelhuss
GIGA2 Structuring Phenotype Data
GIGA2 Structuring Phenotype DataGIGA2 Structuring Phenotype Data
GIGA2 Structuring Phenotype Data
Chris Mungall
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology:  How High Throughput Sequencing changed our view on bac...Genomic Epidemiology:  How High Throughput Sequencing changed our view on bac...
Genomic Epidemiology: How High Throughput Sequencing changed our view on bac...
Jo達o Andr辿 Carri巽o
FAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use CaseFAIR Agronomy, where are we? The KnetMiner Use Case
FAIR Agronomy, where are we? The KnetMiner Use Case
Rothamsted Research, UK
Why the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be oneWhy the world needs phenopacketeers, and how to be one
Why the world needs phenopacketeers, and how to be one
mhaendel
The Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision MedicineThe Monarch Initiative: From Model Organism to Precision Medicine
The Monarch Initiative: From Model Organism to Precision Medicine
mhaendel
Common languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithmsCommon languages in genomic epidemiology: from ontologies to algorithms
Common languages in genomic epidemiology: from ontologies to algorithms
Jo達o Andr辿 Carri巽o
Human Reference Genome Browser Presentation at BIO-ITWorld 2008
Human Reference Genome Browser Presentation at BIO-ITWorld 2008Human Reference Genome Browser Presentation at BIO-ITWorld 2008
Human Reference Genome Browser Presentation at BIO-ITWorld 2008
Saul Kravitz
TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...TAIR -Using biological ontologies to accelerate progress in plant biology res...
TAIR -Using biological ontologies to accelerate progress in plant biology res...
Phoenix Bioinformatics
SMBE 2015: Expression STRs
SMBE 2015: Expression STRsSMBE 2015: Expression STRs
SMBE 2015: Expression STRs
Yaniv Erlich
Dynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical CommunicationsDynamic Semantic Metadata in Biomedical Communications
Dynamic Semantic Metadata in Biomedical Communications
Tim Clark
Investigating Term Reuse and Overlap in Biomedical Ontologies
Investigating Term Reuse and Overlap in Biomedical OntologiesInvestigating Term Reuse and Overlap in Biomedical Ontologies
Investigating Term Reuse and Overlap in Biomedical Ontologies
Maulik Kamdar
The uni prot knowledgebase
The uni prot knowledgebaseThe uni prot knowledgebase
The uni prot knowledgebase
Kew Sama
The Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, FutureThe Taverna Workflow Management Software Suite - Past, Present, Future
The Taverna Workflow Management Software Suite - Past, Present, Future
myGrid team
ICAR2016 TAIR talk
ICAR2016 TAIR talkICAR2016 TAIR talk
ICAR2016 TAIR talk
Donghui Li
provenance of microarray experiments
provenance of microarray experimentsprovenance of microarray experiments
provenance of microarray experiments
Helena Deus

Mikel egana itbam_2010_ogo_system

  • 1. A semantic query interface for the OGO platform Jos辿 Antonio Mi単arro-Gim辿nez (jose.minyarro@um.es) Mikel Ega単a Aranguren, Ph.D. (mikel.egana.aranguren@gmail.com) Francisco Garc鱈a-S叩nchez, Ph.D. (frgarcia@um.es) Jesualdo Tom叩s Fern叩ndez-Breis, Ph.D. (jfernand@um.es) Faculty of Computer Science University of Murcia Spain ITBAM (DEXA) Bilbo 2010 http://tinyurl.com/35amhn6
  • 2. Overview Orthologs Information about orthologs and diseases OGO system A semantic query interface for the OGO system Sample query
  • 5. Orthologs and genetic diseases Homologene KOG Inparanoid OrthoMCL Online Mendelian Inheritance in Man (OMIM) Gene 1 Disease Gene 2 Orthologs
  • 8. OGO ontology: imported ontologies Gene Ontology (OBOF): molecular function, biological process and cellular component of gene products Evidence Codes Ontology (Candidate OBOF): GO annotations evidence codes OBO Relationship Types (Candidate OBOF): Gene product participates in some (molecular function or biological process) Gene product located in some cellular component NCBI taxonomy: organisms classification
  • 11. Implementation of the OGO system
  • 12. Interfaces of the OGO system Keyword based querying Semantic querying
  • 15. Sample query Ortholog genes of the gene that causes prostate cancer on Rattus norvegicus ?
  • 16. Sample query Ortholog genes of the gene that causes prostate cancer on Rattus norvegicus ?
  • 17. Sample query Ortholog genes of the gene that causes prostate cancer on Rattus norvegicus ?
  • 18. Sample query Ortholog genes of the gene that causes prostate cancer on Rattus norvegicus ?
  • 19. Sample query Ortholog genes of the gene that causes prostate cancer on Rattus norvegicus ?
  • 20. Sample query Ortholog genes of the gene that causes prostate cancer on Rattus norvegicus ? @prefix ncbi: <http://um.es/ncbi.owl>. @prefix ogo: <http://miuras.inf.um.es/ontologies/OGO.owl>. SELECT ?Gene_0 ?Genetic_disease_1 WHERE { ?Gene_0 ogo:fromSpecies ncbi:NCBI_10116 ?Genetic_disease_1 ogo:Name ?literal_4 . FILTER (regex(?literal_4,&quot;Prostate cancer, susceptibility to&quot;)) . ?Genetic_disease_1 ogo:causedBy ?Gene_2 . ?Cluster_of_Orthologous_genes_3 ogo:hasOrthologous ?Gene_2 . ?Cluster_of_Orthologous_genes_3 ogo:hasOrthologous ?Gene_0 . }
  • 21. Sample query Ortholog genes of the gene that causes prostate cancer on Rattus norvegicus ?
  • 22. Query grammar Query::= &quot;SELECT&quot; ListVar (WhereClause)? ListVar::=Var (Var)* WhereClause::=&quot;WHERE {&quot; ConditionClause (ConditionClause)* &quot;}&quot; ConditionClause::=[VarCondition | LiteralCondition] &quot;.&quot; VarCondition::=[Var | Individual] Property [Var | Individual] LiteralCondition::=[Var | Individual] Property [Var | Individual] &quot;.&quot; &quot;FILTER (regex (&quot; Var &quot;,&quot; Literal &quot;))&quot; Var -> This term represents a variable in the query which can be matched to any concept or individual in the ontology. Individual -> This term represents a concept or individual identied by an URI in the ontology. Property -> This term represents a relationship or property identied by an URI in the ontology. Literal -> This term represents any data value dened by the user.
  • 23. Future plans OWL reasoning for querying (OWL 2 QL?) Pellet Integrity Constraint Validator (Pellet ICV): OWL as schema language for RDF (CWA) Check the gathered information More bio-ontologies Clinical archetypes for querying (ISO 13606): exchange of ortholog/disease information in a standard biomedical research setting
  • 24. Conclusions Orthologs and diseases: new hypotheses OGO provides a resource for exploiting such combined information Semantic query interface: Complex queries easily (No SPARQL syntax) http://miuras.inf.um.es/~ogo/
  • 25. Acknowledgements Spanish Ministry for Science and Education (grant TSI2007-66575-C02-02) Comunidad Aut坦noma de la Regi坦n de Murcia (grant BIO-TEC 06/01-0005) Fundaci坦n S辿neca, Servicio de Empleo y Formaci坦n (grant 07836/BPS/07)

Editor's Notes

  • #2: The OGO system ... Presentation URL Creative commons attribution non commercial share alike
  • #4: Orthologs are homolog sequences (they share a common ancestor) that diverged by an speciation event
  • #5: Orthologs can be used to generate hypotheses. For example, if frog alpha and chicken alpha are ortholog genes, and it is known that frog alpha is involved in a certain trait (e.g. a disease), then it is likely that chicken alpha is also involved in or related to such trait, in chicken Therefore, the information about orthologs is very important in biomedical research, since they show new research paths for human diseases with a genetic cause
  • #6: Unfortunately, information about orthologs and diseases is scattered and it is difficult to combine
  • #7: The OGO system provides a resource for accesing the ortholog/diseases combined information in a precise way. The OGO system is an OWL KB, in which the OGO ontology provides the schema and the information regarding orthologs and diseases is stored in instances, with relationships between them The OGO ontology is also used as a guide for the user to build queries The system is accessed with keywords or SPARQL The pipeline is executed periodically (Mappings, information checking)
  • #8: OGO ontology (KB schema and querying)
  • #9: Imported ontologies (GO, ECO, RO) reuse existing semantics for querying, as we will see when I describe the queries OBOF: Wealth of quality reusable semantics of the biodomain GO: Member ECO, RO: Candidates
  • #10: Not detailed Classes as values (OBO format) Future DL
  • #11: Pipeline
  • #12: JENA allows to store OWL in a MySQL database, and to access it with SPARQL
  • #13: The OGO system has two interfaces: Keyword based interface (by disease/by orthologs): not very expressive but fast Semantic interface (next)
  • #14: The semantic interface is more expressive than the keyword based interface. However, as SPARQL is difficult to use by biologists, the semantic interface provides a graphical interface for creating queries, that, later, are translated into SPARQL It should be noted that this does not allow to use the whole expressivity of SPARQL, but a considerable part of it (see grammar)
  • #15: In order to define the query, we can select concepts from the OGO ontology, and add any requirements, also using the OGO ontology We can exploit the imported ontologies for querying: GO, ECO, NCBI The defined query is translated into SPARQL and executed against the KB
  • #16: Whole process First we select the variables that we are interested in from the OGO ontology. In this case, Gene and Genetic disease (i.e, we want to retrieve Genes and Genetic diseases) The imported ontologies can be exploited (GO, ECO, NCBI) for querying
  • #17: Then we add requirements, also using the OGO ontology (And Imported ontologies ). We can use the selected variables or new ones. We can delete/edit requirements
  • #18: We edit a requirement by using the OGO ontology (to add new variables and values) or by using the already defined variables NCBI (imported, like GO, and ECO) for providing values for the requirement
  • #19: We add the finished requirement to the the query
  • #20: We can add as many requirements as we want
  • #21: Finally, the query is translated into SPARQL and executed against the KB
  • #22: Results
  • #23: The expressivity of the query is limited by the grammar
  • #26: YOGY already does this, however, redundant results by resource, instead of gene centric, i.e.same gene in different resources OGO ontology is used to check the consistency of the info Less expressivity in SPARQL: no OPTIONAL