際際滷

際際滷Share a Scribd company logo
Efficient Validation of Large Models using
the Mogwa誰 Tool
Gwendal Daniel
Introduction
 Large models are not a research chimera!
 Civil engineering
 Biology
 Reverse engineering
 Need to provide solutions to
 Store large models
 Efficiently query, transform and validate them
Model Persistence
 Default serialization mechanism: XMI
 Scalable model persistence frameworks
 Use databases to store models
 Relational: CDO
 NoSQL: Morsa, NeoEMF
 Advanced features (versioning, collaborative editing, transactions)
 Low memory footprint
Model Queries and Transformations
API Call1

API Calln
OCL
Query
OCL
Interpreter
EMF API Database
Package.allInstances().name
get(p1)
get(p1,name)

get(pn)
get(pn,name)
Low-level modeling API
 Not aligned with the
database capabilities
Fragmented queries
 Not efficient
 Remote database
Intermediate objects
 Memory consumption
 Execution time overhead
Model Queries and Transformations
 Not efficient to compute model queries and transformations
 Why cant we write database queries manually?
 Model persistence frameworks typically rely on NoSQL databases
 Multiple query languages, data representation, etc
 Low-level queries are hard to understand and maintain
 Modeling expertise vs. Database expertise
 Solution: generate them!
Mogwa誰
 Translation engine
 Support for OCL and ATL
 Generates generic Gremlin traversals
 Backend-agnostic queries
 Model datastore
 Abstraction layer on top of the backend
 Defines query and transformation primitives
 Implementations for NeoEMF, Neo4j3, and experimental Jdbc support
Mogwa誰
Mogwa誰
Translation Engine
Model
Database
Model Query
Gremlin
Script
Model Datastore
Impl.
uses
e1: El
Model Datastore
Definition
uses
implements
Database
results
e1: El
e1: ElModel
Elements
v1
v3
v2Reification
Model Validation
Mogwa誰 efficiently supports model queries and transformations
What about model validation?
 Two-step process
 Find the elements to check (i.e. involved in at least one constraint)
 Typically costly in scalable model persistence frameworks
 Check the constraints for each elements
 Efficiently computed by Mogwa誰 query engine
 Can we optimize this two step process to benefit from Mogwa誰?
Model Validation
Mogwa誰
Translation Engine
Model
Database
Model Constraints
Gremlin
Script
Model Datastore
Impl.
uses
e1: El
Model Datastore
Definition
uses
implements
Database
results
e1: El
e1: ElModel
Elements
v1
v3
v2Reification
Constraint
Pre-processing
Constraint pre-processing
context Client inv validClient:
self.hasPaid implies self.orders.products
->forAll(p | p.price > 0)
Constraint Rewriting
 Based on existing work
1- Find all the Clients
2- Evaluate validClient for each
Client instance
Client.allInstances()->select(c | !(c.hasPaid implies
c.orders.products->forAll(p.price > 0)))
Constraint Rewriting
 Systematic mapping
 Translate context-specific constraints into global queries
 Local optimizations
 Mogwa誰 benefits
 Efficient allInstances() computation
 Execute a single query
 Return all the instances violating the constraint
Evaluation
0
1000
2000
3000
4000
5000
6000
7000
8000
9000
10000
MDT OCL Standard AllInstances + Mogwa誰 Pre-processing + Mogwa誰
Model Validation Execution Time (ms)
NotEmptyClassName ValidJavadocTags
Benchmarks
Java Model
~ 80 000 elements
3 configurations / 2 constraints
Evaluation
0
20
40
60
80
100
120
140
MDT OCL Standard AllInstances + Mogwa誰 Pre-processing + Mogwa誰
Model Validation Memory Consumption (MB)
NotEmptyClassName ValidJavadocTags
Benchmarks
Java Model
~ 80 000 elements
3 configurations / 2 constraints
Conclusion
 Mogwai
 Translation-based approach
 Standard language support
 No query fragmentation
 Support for various datastores
 Model validation with Mogwa誰
 Constraint pre-processing
 Single query execution
 Positive results
Future Work
 Advanced rewriting techniques
 Adapted to the underlying database schema
 Heuristics (number of operations, nested iterators, etc)
 Constraint composition
 Static analysis
 Reuse common parts in constraint navigation paths
 Rewrite a set of constraints
Websites & Software Repositories
NeoEMF: neoemf.com
Mogwa誰: github.com/atlanmod/mogwai
gdaniel.github.io

More Related Content

Similar to Efficient Validation of Large Models using the Mogwa誰 Tool (20)

Clipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving SystemClipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving System
Databricks
Thesis Defense (Gwendal DANIEL) - Nov 2017
Thesis Defense (Gwendal DANIEL) - Nov 2017Thesis Defense (Gwendal DANIEL) - Nov 2017
Thesis Defense (Gwendal DANIEL) - Nov 2017
Gwendal Daniel
Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017
Dan Crankshaw
EMF-IncQuery presentation at TOOLS 2012
EMF-IncQuery presentation at TOOLS 2012EMF-IncQuery presentation at TOOLS 2012
EMF-IncQuery presentation at TOOLS 2012
Istvan Rath
Object- Relational Persistence in Smalltalk
Object- Relational Persistence in SmalltalkObject- Relational Persistence in Smalltalk
Object- Relational Persistence in Smalltalk
ESUG
Real-world Entity Framework
Real-world Entity FrameworkReal-world Entity Framework
Real-world Entity Framework
Lynn Langit
Pragmatic Architecture in .NET
Pragmatic Architecture in .NETPragmatic Architecture in .NET
Pragmatic Architecture in .NET
housecor
Oleksii Moskalenko "Continuous Delivery of ML Pipelines to Production"
Oleksii Moskalenko "Continuous Delivery of ML Pipelines to Production"Oleksii Moskalenko "Continuous Delivery of ML Pipelines to Production"
Oleksii Moskalenko "Continuous Delivery of ML Pipelines to Production"
Fwdays
Clipper: A Low-Latency Online Prediction Serving System: Spark Summit East ta...
Clipper: A Low-Latency Online Prediction Serving System: Spark Summit East ta...Clipper: A Low-Latency Online Prediction Serving System: Spark Summit East ta...
Clipper: A Low-Latency Online Prediction Serving System: Spark Summit East ta...
Spark Summit
Agile Development in .NET
Agile Development in .NETAgile Development in .NET
Agile Development in .NET
danhermes
Tensors Are All You Need: Faster Inference with Hummingbird
Tensors Are All You Need: Faster Inference with HummingbirdTensors Are All You Need: Faster Inference with Hummingbird
Tensors Are All You Need: Faster Inference with Hummingbird
Databricks
From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet
From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNetFrom Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet
From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet
Eric Haibin Lin
NoSQL and ACID
NoSQL and ACIDNoSQL and ACID
NoSQL and ACID
FoundationDB
IncQuery-D: Incremental Queries in the Cloud
IncQuery-D: Incremental Queries in the CloudIncQuery-D: Incremental Queries in the Cloud
IncQuery-D: Incremental Queries in the Cloud
G叩bor Sz叩rnyas
Apache Drill (ver. 0.2)
Apache Drill (ver. 0.2)Apache Drill (ver. 0.2)
Apache Drill (ver. 0.2)
Camuel Gilyadov
Towards a Macrobenchmark Framework for Performance Analysis of Java Applications
Towards a Macrobenchmark Framework for Performance Analysis of Java ApplicationsTowards a Macrobenchmark Framework for Performance Analysis of Java Applications
Towards a Macrobenchmark Framework for Performance Analysis of Java Applications
G叩bor Sz叩rnyas
How to Create Your Own Product-Modeling Environment
How to Create Your Own Product-Modeling EnvironmentHow to Create Your Own Product-Modeling Environment
How to Create Your Own Product-Modeling Environment
Tim Geisler
Using Grid Technologies in the Cloud for High Scalability
Using Grid Technologies in the Cloud for High ScalabilityUsing Grid Technologies in the Cloud for High Scalability
Using Grid Technologies in the Cloud for High Scalability
mabuhr
Camunda BPM 7.2: Performance and Scalability (English)
Camunda BPM 7.2: Performance and Scalability (English)Camunda BPM 7.2: Performance and Scalability (English)
Camunda BPM 7.2: Performance and Scalability (English)
camunda services GmbH
NoSQL Introduction, Theory, Implementations
NoSQL Introduction, Theory, ImplementationsNoSQL Introduction, Theory, Implementations
NoSQL Introduction, Theory, Implementations
Firat Atagun
Clipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving SystemClipper: A Low-Latency Online Prediction Serving System
Clipper: A Low-Latency Online Prediction Serving System
Databricks
Thesis Defense (Gwendal DANIEL) - Nov 2017
Thesis Defense (Gwendal DANIEL) - Nov 2017Thesis Defense (Gwendal DANIEL) - Nov 2017
Thesis Defense (Gwendal DANIEL) - Nov 2017
Gwendal Daniel
Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017Clipper at UC Berkeley RISECamp 2017
Clipper at UC Berkeley RISECamp 2017
Dan Crankshaw
EMF-IncQuery presentation at TOOLS 2012
EMF-IncQuery presentation at TOOLS 2012EMF-IncQuery presentation at TOOLS 2012
EMF-IncQuery presentation at TOOLS 2012
Istvan Rath
Object- Relational Persistence in Smalltalk
Object- Relational Persistence in SmalltalkObject- Relational Persistence in Smalltalk
Object- Relational Persistence in Smalltalk
ESUG
Real-world Entity Framework
Real-world Entity FrameworkReal-world Entity Framework
Real-world Entity Framework
Lynn Langit
Pragmatic Architecture in .NET
Pragmatic Architecture in .NETPragmatic Architecture in .NET
Pragmatic Architecture in .NET
housecor
Oleksii Moskalenko "Continuous Delivery of ML Pipelines to Production"
Oleksii Moskalenko "Continuous Delivery of ML Pipelines to Production"Oleksii Moskalenko "Continuous Delivery of ML Pipelines to Production"
Oleksii Moskalenko "Continuous Delivery of ML Pipelines to Production"
Fwdays
Clipper: A Low-Latency Online Prediction Serving System: Spark Summit East ta...
Clipper: A Low-Latency Online Prediction Serving System: Spark Summit East ta...Clipper: A Low-Latency Online Prediction Serving System: Spark Summit East ta...
Clipper: A Low-Latency Online Prediction Serving System: Spark Summit East ta...
Spark Summit
Agile Development in .NET
Agile Development in .NETAgile Development in .NET
Agile Development in .NET
danhermes
Tensors Are All You Need: Faster Inference with Hummingbird
Tensors Are All You Need: Faster Inference with HummingbirdTensors Are All You Need: Faster Inference with Hummingbird
Tensors Are All You Need: Faster Inference with Hummingbird
Databricks
From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet
From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNetFrom Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet
From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet
Eric Haibin Lin
IncQuery-D: Incremental Queries in the Cloud
IncQuery-D: Incremental Queries in the CloudIncQuery-D: Incremental Queries in the Cloud
IncQuery-D: Incremental Queries in the Cloud
G叩bor Sz叩rnyas
Apache Drill (ver. 0.2)
Apache Drill (ver. 0.2)Apache Drill (ver. 0.2)
Apache Drill (ver. 0.2)
Camuel Gilyadov
Towards a Macrobenchmark Framework for Performance Analysis of Java Applications
Towards a Macrobenchmark Framework for Performance Analysis of Java ApplicationsTowards a Macrobenchmark Framework for Performance Analysis of Java Applications
Towards a Macrobenchmark Framework for Performance Analysis of Java Applications
G叩bor Sz叩rnyas
How to Create Your Own Product-Modeling Environment
How to Create Your Own Product-Modeling EnvironmentHow to Create Your Own Product-Modeling Environment
How to Create Your Own Product-Modeling Environment
Tim Geisler
Using Grid Technologies in the Cloud for High Scalability
Using Grid Technologies in the Cloud for High ScalabilityUsing Grid Technologies in the Cloud for High Scalability
Using Grid Technologies in the Cloud for High Scalability
mabuhr
Camunda BPM 7.2: Performance and Scalability (English)
Camunda BPM 7.2: Performance and Scalability (English)Camunda BPM 7.2: Performance and Scalability (English)
Camunda BPM 7.2: Performance and Scalability (English)
camunda services GmbH
NoSQL Introduction, Theory, Implementations
NoSQL Introduction, Theory, ImplementationsNoSQL Introduction, Theory, Implementations
NoSQL Introduction, Theory, Implementations
Firat Atagun

Recently uploaded (20)

COMPLEXATION AND PROTEIN BINDING. B.pharm
COMPLEXATION  AND PROTEIN BINDING. B.pharmCOMPLEXATION  AND PROTEIN BINDING. B.pharm
COMPLEXATION AND PROTEIN BINDING. B.pharm
kalyanshanam6
Travis Hills of Minnesota Explores Thermal Energy Concepts Breakthroughs in ...
Travis Hills of Minnesota Explores Thermal Energy Concepts Breakthroughs in ...Travis Hills of Minnesota Explores Thermal Energy Concepts Breakthroughs in ...
Travis Hills of Minnesota Explores Thermal Energy Concepts Breakthroughs in ...
Travis Hills MN
Mechanism of action of lysozyme and chymotrypsin Priyanka ppt.pptx
Mechanism of action of lysozyme and chymotrypsin Priyanka ppt.pptxMechanism of action of lysozyme and chymotrypsin Priyanka ppt.pptx
Mechanism of action of lysozyme and chymotrypsin Priyanka ppt.pptx
kadampriyanka549
Muscle Physiology A level Biology Powerpoint Edexcel A
Muscle Physiology A level Biology Powerpoint Edexcel AMuscle Physiology A level Biology Powerpoint Edexcel A
Muscle Physiology A level Biology Powerpoint Edexcel A
ridhimahindroo
Polymer Composites Classification, Reinforcements, Matrices,.pptx
Polymer Composites Classification, Reinforcements, Matrices,.pptxPolymer Composites Classification, Reinforcements, Matrices,.pptx
Polymer Composites Classification, Reinforcements, Matrices,.pptx
JinnJinnkiJaddu
African biomass burning is a substantial source of phosphorus deposition to ...
African biomass burning is a substantial source of phosphorus deposition to ...African biomass burning is a substantial source of phosphorus deposition to ...
African biomass burning is a substantial source of phosphorus deposition to ...
Anne Barkley
Introduction of Cutaneous Nerves by M.Sundari Singh
Introduction of Cutaneous Nerves  by M.Sundari SinghIntroduction of Cutaneous Nerves  by M.Sundari Singh
Introduction of Cutaneous Nerves by M.Sundari Singh
meharsundarisingh
Sciences of Europe No 161 (2025)
Sciences of Europe No 161 (2025)Sciences of Europe No 161 (2025)
Sciences of Europe No 161 (2025)
Sciences of Europe
Importance and Essentials and Necessities of Cell and Molecular Biology
Importance and Essentials and Necessities of Cell and Molecular BiologyImportance and Essentials and Necessities of Cell and Molecular Biology
Importance and Essentials and Necessities of Cell and Molecular Biology
johnfreeguydoe
Use of plasma proteins in diagnostics.pptx
Use of plasma proteins in diagnostics.pptxUse of plasma proteins in diagnostics.pptx
Use of plasma proteins in diagnostics.pptx
amanee3
basic tissuse oral epithelium the classifications and subunits
basic tissuse oral epithelium the classifications and subunitsbasic tissuse oral epithelium the classifications and subunits
basic tissuse oral epithelium the classifications and subunits
jemimahrachel1299
The Sheena Bora Murder Case - Forensic Science.pptx
The Sheena Bora Murder Case - Forensic Science.pptxThe Sheena Bora Murder Case - Forensic Science.pptx
The Sheena Bora Murder Case - Forensic Science.pptx
SahilShah140721
GNU Linux - Introduction and Administration.
GNU Linux - Introduction and Administration.GNU Linux - Introduction and Administration.
GNU Linux - Introduction and Administration.
Xavier de Pedro
The circulation of blood general anatomy by M.Sundari Singh
The circulation of blood general anatomy by M.Sundari SinghThe circulation of blood general anatomy by M.Sundari Singh
The circulation of blood general anatomy by M.Sundari Singh
meharsundarisingh
Types of skeletal muscles fibres by G.Manojna
Types of skeletal muscles fibres by G.ManojnaTypes of skeletal muscles fibres by G.Manojna
Types of skeletal muscles fibres by G.Manojna
grandhimanojna06
Transforming routine healthcare delivery and crisis preparedness: The role o...
Transforming routine healthcare delivery and crisis preparedness:  The role o...Transforming routine healthcare delivery and crisis preparedness:  The role o...
Transforming routine healthcare delivery and crisis preparedness: The role o...
Eleni Kaldoudi
Global change and effects on infectious disease
Global change and effects on infectious diseaseGlobal change and effects on infectious disease
Global change and effects on infectious disease
MichaelMahonPhD
Different Strategies in Scientific Publishing
Different Strategies in Scientific PublishingDifferent Strategies in Scientific Publishing
Different Strategies in Scientific Publishing
Carlos Baquero
Igniting Progress: Outcomes from the FLARE workshop and three challenges for ...
Igniting Progress: Outcomes from the FLARE workshop and three challenges for ...Igniting Progress: Outcomes from the FLARE workshop and three challenges for ...
Igniting Progress: Outcomes from the FLARE workshop and three challenges for ...
Anne Barkley
Mixing small molecules and macromolecules in the world of informatics
Mixing small molecules and macromolecules in the world of informaticsMixing small molecules and macromolecules in the world of informatics
Mixing small molecules and macromolecules in the world of informatics
Alex Clark
COMPLEXATION AND PROTEIN BINDING. B.pharm
COMPLEXATION  AND PROTEIN BINDING. B.pharmCOMPLEXATION  AND PROTEIN BINDING. B.pharm
COMPLEXATION AND PROTEIN BINDING. B.pharm
kalyanshanam6
Travis Hills of Minnesota Explores Thermal Energy Concepts Breakthroughs in ...
Travis Hills of Minnesota Explores Thermal Energy Concepts Breakthroughs in ...Travis Hills of Minnesota Explores Thermal Energy Concepts Breakthroughs in ...
Travis Hills of Minnesota Explores Thermal Energy Concepts Breakthroughs in ...
Travis Hills MN
Mechanism of action of lysozyme and chymotrypsin Priyanka ppt.pptx
Mechanism of action of lysozyme and chymotrypsin Priyanka ppt.pptxMechanism of action of lysozyme and chymotrypsin Priyanka ppt.pptx
Mechanism of action of lysozyme and chymotrypsin Priyanka ppt.pptx
kadampriyanka549
Muscle Physiology A level Biology Powerpoint Edexcel A
Muscle Physiology A level Biology Powerpoint Edexcel AMuscle Physiology A level Biology Powerpoint Edexcel A
Muscle Physiology A level Biology Powerpoint Edexcel A
ridhimahindroo
Polymer Composites Classification, Reinforcements, Matrices,.pptx
Polymer Composites Classification, Reinforcements, Matrices,.pptxPolymer Composites Classification, Reinforcements, Matrices,.pptx
Polymer Composites Classification, Reinforcements, Matrices,.pptx
JinnJinnkiJaddu
African biomass burning is a substantial source of phosphorus deposition to ...
African biomass burning is a substantial source of phosphorus deposition to ...African biomass burning is a substantial source of phosphorus deposition to ...
African biomass burning is a substantial source of phosphorus deposition to ...
Anne Barkley
Introduction of Cutaneous Nerves by M.Sundari Singh
Introduction of Cutaneous Nerves  by M.Sundari SinghIntroduction of Cutaneous Nerves  by M.Sundari Singh
Introduction of Cutaneous Nerves by M.Sundari Singh
meharsundarisingh
Sciences of Europe No 161 (2025)
Sciences of Europe No 161 (2025)Sciences of Europe No 161 (2025)
Sciences of Europe No 161 (2025)
Sciences of Europe
Importance and Essentials and Necessities of Cell and Molecular Biology
Importance and Essentials and Necessities of Cell and Molecular BiologyImportance and Essentials and Necessities of Cell and Molecular Biology
Importance and Essentials and Necessities of Cell and Molecular Biology
johnfreeguydoe
Use of plasma proteins in diagnostics.pptx
Use of plasma proteins in diagnostics.pptxUse of plasma proteins in diagnostics.pptx
Use of plasma proteins in diagnostics.pptx
amanee3
basic tissuse oral epithelium the classifications and subunits
basic tissuse oral epithelium the classifications and subunitsbasic tissuse oral epithelium the classifications and subunits
basic tissuse oral epithelium the classifications and subunits
jemimahrachel1299
The Sheena Bora Murder Case - Forensic Science.pptx
The Sheena Bora Murder Case - Forensic Science.pptxThe Sheena Bora Murder Case - Forensic Science.pptx
The Sheena Bora Murder Case - Forensic Science.pptx
SahilShah140721
GNU Linux - Introduction and Administration.
GNU Linux - Introduction and Administration.GNU Linux - Introduction and Administration.
GNU Linux - Introduction and Administration.
Xavier de Pedro
The circulation of blood general anatomy by M.Sundari Singh
The circulation of blood general anatomy by M.Sundari SinghThe circulation of blood general anatomy by M.Sundari Singh
The circulation of blood general anatomy by M.Sundari Singh
meharsundarisingh
Types of skeletal muscles fibres by G.Manojna
Types of skeletal muscles fibres by G.ManojnaTypes of skeletal muscles fibres by G.Manojna
Types of skeletal muscles fibres by G.Manojna
grandhimanojna06
Transforming routine healthcare delivery and crisis preparedness: The role o...
Transforming routine healthcare delivery and crisis preparedness:  The role o...Transforming routine healthcare delivery and crisis preparedness:  The role o...
Transforming routine healthcare delivery and crisis preparedness: The role o...
Eleni Kaldoudi
Global change and effects on infectious disease
Global change and effects on infectious diseaseGlobal change and effects on infectious disease
Global change and effects on infectious disease
MichaelMahonPhD
Different Strategies in Scientific Publishing
Different Strategies in Scientific PublishingDifferent Strategies in Scientific Publishing
Different Strategies in Scientific Publishing
Carlos Baquero
Igniting Progress: Outcomes from the FLARE workshop and three challenges for ...
Igniting Progress: Outcomes from the FLARE workshop and three challenges for ...Igniting Progress: Outcomes from the FLARE workshop and three challenges for ...
Igniting Progress: Outcomes from the FLARE workshop and three challenges for ...
Anne Barkley
Mixing small molecules and macromolecules in the world of informatics
Mixing small molecules and macromolecules in the world of informaticsMixing small molecules and macromolecules in the world of informatics
Mixing small molecules and macromolecules in the world of informatics
Alex Clark

Efficient Validation of Large Models using the Mogwa誰 Tool

  • 1. Efficient Validation of Large Models using the Mogwa誰 Tool Gwendal Daniel
  • 2. Introduction Large models are not a research chimera! Civil engineering Biology Reverse engineering Need to provide solutions to Store large models Efficiently query, transform and validate them
  • 3. Model Persistence Default serialization mechanism: XMI Scalable model persistence frameworks Use databases to store models Relational: CDO NoSQL: Morsa, NeoEMF Advanced features (versioning, collaborative editing, transactions) Low memory footprint
  • 4. Model Queries and Transformations API Call1 API Calln OCL Query OCL Interpreter EMF API Database Package.allInstances().name get(p1) get(p1,name) get(pn) get(pn,name) Low-level modeling API Not aligned with the database capabilities Fragmented queries Not efficient Remote database Intermediate objects Memory consumption Execution time overhead
  • 5. Model Queries and Transformations Not efficient to compute model queries and transformations Why cant we write database queries manually? Model persistence frameworks typically rely on NoSQL databases Multiple query languages, data representation, etc Low-level queries are hard to understand and maintain Modeling expertise vs. Database expertise Solution: generate them!
  • 6. Mogwa誰 Translation engine Support for OCL and ATL Generates generic Gremlin traversals Backend-agnostic queries Model datastore Abstraction layer on top of the backend Defines query and transformation primitives Implementations for NeoEMF, Neo4j3, and experimental Jdbc support
  • 7. Mogwa誰 Mogwa誰 Translation Engine Model Database Model Query Gremlin Script Model Datastore Impl. uses e1: El Model Datastore Definition uses implements Database results e1: El e1: ElModel Elements v1 v3 v2Reification
  • 8. Model Validation Mogwa誰 efficiently supports model queries and transformations What about model validation? Two-step process Find the elements to check (i.e. involved in at least one constraint) Typically costly in scalable model persistence frameworks Check the constraints for each elements Efficiently computed by Mogwa誰 query engine Can we optimize this two step process to benefit from Mogwa誰?
  • 9. Model Validation Mogwa誰 Translation Engine Model Database Model Constraints Gremlin Script Model Datastore Impl. uses e1: El Model Datastore Definition uses implements Database results e1: El e1: ElModel Elements v1 v3 v2Reification Constraint Pre-processing
  • 10. Constraint pre-processing context Client inv validClient: self.hasPaid implies self.orders.products ->forAll(p | p.price > 0) Constraint Rewriting Based on existing work 1- Find all the Clients 2- Evaluate validClient for each Client instance Client.allInstances()->select(c | !(c.hasPaid implies c.orders.products->forAll(p.price > 0)))
  • 11. Constraint Rewriting Systematic mapping Translate context-specific constraints into global queries Local optimizations Mogwa誰 benefits Efficient allInstances() computation Execute a single query Return all the instances violating the constraint
  • 12. Evaluation 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 MDT OCL Standard AllInstances + Mogwa誰 Pre-processing + Mogwa誰 Model Validation Execution Time (ms) NotEmptyClassName ValidJavadocTags Benchmarks Java Model ~ 80 000 elements 3 configurations / 2 constraints
  • 13. Evaluation 0 20 40 60 80 100 120 140 MDT OCL Standard AllInstances + Mogwa誰 Pre-processing + Mogwa誰 Model Validation Memory Consumption (MB) NotEmptyClassName ValidJavadocTags Benchmarks Java Model ~ 80 000 elements 3 configurations / 2 constraints
  • 14. Conclusion Mogwai Translation-based approach Standard language support No query fragmentation Support for various datastores Model validation with Mogwa誰 Constraint pre-processing Single query execution Positive results
  • 15. Future Work Advanced rewriting techniques Adapted to the underlying database schema Heuristics (number of operations, nested iterators, etc) Constraint composition Static analysis Reuse common parts in constraint navigation paths Rewrite a set of constraints
  • 16. Websites & Software Repositories NeoEMF: neoemf.com Mogwa誰: github.com/atlanmod/mogwai gdaniel.github.io