狠狠撸

狠狠撸Share a Scribd company logo
Index Activiti data 

on Elasticsearch
Activiti User Day
Paris 2015
Silvio dos Passos Neto
CTO at iColabora
@silvioneto
“Don’t	
 ?bridge	
 ?the	
 ?
business-?‐IT	
 ?divide.	
 ?
Obliterate	
 ?it!”	
 ?(2003)
Smith & Fingar
Index Activiti Data on Elasticsearch
Index Activiti Data on Elasticsearch
?
?
Index Activiti Data on Elasticsearch
Index Activiti Data on Elasticsearch
Index Activiti Data on Elasticsearch
@mike_dias
The big table
problem
ID_ NAME_ VALUE_* …
ACT_HI_VARINST
User form
Process

Instance
ID_ NAME_ VALUE_* …
1 client_name Jonh …
2 client_tel 123456 …
3 due_date 01/06/2015 …
4 demand_desc I have a problem… …
ACT_HI_VARINST
User form
Process

Instance
Process

Instance
User form
ID_ NAME_ VALUE_* …
1 client_name Jonh …
2 client_tel 123456 …
3 due_date 01/06/2015 …
4 demand_desc I have a problem… …
5 client_name Bob …
6 client_tel 654321 …
7 due_date 10/06/2015 …
8 demand_desc My internet conn… …
ACT_HI_VARINST
85 ?elds
x
~1000 Process per day
=
~85.000
variables per day
~15 million
variables
in 9 months
Index Activiti Data on Elasticsearch
The Tool
Index Activiti Data on Elasticsearch
Built on top of
Analytics
Distributed
Indexing
the data
Historic
Data
P
P P
P
P
P
P
P
P
P
P P
P
P
P
P
P
P
P
Process Lake
P
P P
P
P
P
P
P
P
P
P P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
P
CPU 1 CPU 2 CPU 3 CPU 4
Process Lake
P
P
P
CPU 1
V V
V V
V V
V V
Variables
T
T
T
Tasks
P
Process
{ }P V V V
V V V V
V T T T
JSON
REST API
Real-Time
Data
E E E E
E E E E
E E E E
E E E E
E E E E
E E E E
E E E E
E E E E
Engine Events
E
E
E
E
E
E
E
E
E
E
E
E
E E E E
E E E E
E E E E
E E E E
E E E E
E E E E
E E E E
E E E E
Engine Events
E
E
E
E
E
E
E
E
E
E
E
E
E EE
E EE
{ }
JSON
REST API
Listeners
Playing with
the data
Search
{

"query":{

"path":"variables",

"nested":{

"query":{

"match":{
"text":"Jo?o Silva"
}

}

}

}

}
Search results
Compare
SELECT * FROM ACT_HI_VARINST

WHERE NAME_ = 'passport'

AND TEXT_ = '1234'
{

"?lter":{

"nested":{

"path":"variables",

"?lter":{

"bool":{

"must":[

{ “term": { "name":"passport" }},
{ “term": { "text":"1234" }}
]

}

}

}

}

}
Response Time
0 secs
45 secs
90 secs
135 secs
180 secs
MySQL Elasticsearch
0,08 secs
161 secs
Response Time
0 secs
45 secs
90 secs
135 secs
180 secs
MySQL Elasticsearch
0,08 secs
161 secs
CENSURED
Lessons learned
Full text search is
a helpful feature
Reduce MySQL
workload
ES is great for
analytics
Next steps
Apache Spark
Lightning-Fast Cluster Computing
Java EE dependency
Open source
Thank you!
@mike_dias @silvioneto
Questions?
@mike_dias @silvioneto

More Related Content

Similar to Index Activiti Data on Elasticsearch (20)

PDF
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStream
gogo6
?
PPTX
Lean Startup for AaltoES Summer of Startups
Marko Taipale
?
PDF
Adversary Driven Defense in the Real World
James Wickett
?
PDF
Using Customer Development to get Traction in a Crowded Space
Outlyer
?
PDF
Smarter Analytics: Supporting the Enterprise with Automation
Inside Analysis
?
PPTX
Digital Transformation How Digital Disruption is redefining the industries an...
technology_forum
?
PDF
Future ready
Ben Turner
?
PDF
Big Data LDN 2018: THE PATH TO ENTERPRISE AI: TALES FROM THE FIELD
Matt Stubbs
?
PPTX
Leveraging Mainframe Machine and Log Data in Splunk Analytics
Precisely
?
PPTX
TCO: An Achilles Heel of Hand-Built Data Warehouses
Kalido
?
PDF
A Connected Data Landscape: Virtualization and the Internet of Things
Inside Analysis
?
PDF
Living in a data driven world by V Laxmikanth Broadridge
Zinnov
?
PDF
Moving To MicroServices
David Walker
?
PPTX
Data Virtualization: An Introduction
Denodo
?
PDF
Cisco Connect Toronto 2018 DNA assurance
Cisco Canada
?
PDF
When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...
CA Technologies
?
PDF
Smoothing the continuous delivery path a tale of two architectures - expert...
Equal Experts
?
PDF
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Caserta
?
PDF
The value of a connected factory
Croonwolter&dros
?
PDF
Time Difference: How Tomorrow's Companies Will Outpace Today's
Inside Analysis
?
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStream
gogo6
?
Lean Startup for AaltoES Summer of Startups
Marko Taipale
?
Adversary Driven Defense in the Real World
James Wickett
?
Using Customer Development to get Traction in a Crowded Space
Outlyer
?
Smarter Analytics: Supporting the Enterprise with Automation
Inside Analysis
?
Digital Transformation How Digital Disruption is redefining the industries an...
technology_forum
?
Future ready
Ben Turner
?
Big Data LDN 2018: THE PATH TO ENTERPRISE AI: TALES FROM THE FIELD
Matt Stubbs
?
Leveraging Mainframe Machine and Log Data in Splunk Analytics
Precisely
?
TCO: An Achilles Heel of Hand-Built Data Warehouses
Kalido
?
A Connected Data Landscape: Virtualization and the Internet of Things
Inside Analysis
?
Living in a data driven world by V Laxmikanth Broadridge
Zinnov
?
Moving To MicroServices
David Walker
?
Data Virtualization: An Introduction
Denodo
?
Cisco Connect Toronto 2018 DNA assurance
Cisco Canada
?
When Downtime Isn’t an Option: Performance Optimization Analytics in the Era ...
CA Technologies
?
Smoothing the continuous delivery path a tale of two architectures - expert...
Equal Experts
?
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Caserta
?
The value of a connected factory
Croonwolter&dros
?
Time Difference: How Tomorrow's Companies Will Outpace Today's
Inside Analysis
?

Recently uploaded (20)

DOCX
COT Feb 19, 2025 DLLgvbbnnjjjjjj_Digestive System and its Functions_PISA_CBA....
kayemorales1105
?
PPTX
RESEARCH-FINAL-GROUP-3, about the final .pptx
gwapokoha1
?
DOCX
The Influence off Flexible Work Policies
sales480687
?
PDF
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Prasenjit Debnath
?
PDF
TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018
MELJUN CORTES
?
DOCX
brigada_PROGRAM_25.docx the boys white house
RonelNebrao
?
PDF
A Web Repository System for Data Mining in Drug Discovery
IJDKP
?
PPTX
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
?
PDF
Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshs
hernandezemma379
?
PPTX
一比一原版(罢鲍颁毕业证书)开姆尼茨工业大学毕业证如何办理
taqyed
?
PPTX
english9quizw1-240228142338-e9bcf6fd.pptx
rossanthonytan130
?
PDF
Orchestrating Data Workloads With Airflow.pdf
ssuserae5511
?
PDF
Business Automation Solution with Excel 1.1.pdf
Vivek Kedia
?
PPTX
Model Evaluation & Visualisation part of a series of intro modules for data ...
brandonlee626749
?
PPTX
727325165-Unit-1-Data-Analytics-PPT-1.pptx
revathi148366
?
PPSX
PPT1_CB_VII_CS_Ch3_FunctionsandChartsinCalc.ppsx
animaroy81
?
PDF
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints: A...
Mahmoud Shoush
?
PDF
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
Tamanna36
?
PPTX
Monitoring Improvement ( Pomalaa Branch).pptx
fajarkunee
?
DOCX
Starbucks in the Indian market through its joint venture.
sales480687
?
COT Feb 19, 2025 DLLgvbbnnjjjjjj_Digestive System and its Functions_PISA_CBA....
kayemorales1105
?
RESEARCH-FINAL-GROUP-3, about the final .pptx
gwapokoha1
?
The Influence off Flexible Work Policies
sales480687
?
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Prasenjit Debnath
?
TCU EVALUATION FACULTY TCU Taguig City 1st Semester 2017-2018
MELJUN CORTES
?
brigada_PROGRAM_25.docx the boys white house
RonelNebrao
?
A Web Repository System for Data Mining in Drug Discovery
IJDKP
?
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
?
Blood pressure (3).pdfbdbsbsbhshshshhdhdhshshs
hernandezemma379
?
一比一原版(罢鲍颁毕业证书)开姆尼茨工业大学毕业证如何办理
taqyed
?
english9quizw1-240228142338-e9bcf6fd.pptx
rossanthonytan130
?
Orchestrating Data Workloads With Airflow.pdf
ssuserae5511
?
Business Automation Solution with Excel 1.1.pdf
Vivek Kedia
?
Model Evaluation & Visualisation part of a series of intro modules for data ...
brandonlee626749
?
727325165-Unit-1-Data-Analytics-PPT-1.pptx
revathi148366
?
PPT1_CB_VII_CS_Ch3_FunctionsandChartsinCalc.ppsx
animaroy81
?
Prescriptive Process Monitoring Under Uncertainty and Resource Constraints: A...
Mahmoud Shoush
?
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
Tamanna36
?
Monitoring Improvement ( Pomalaa Branch).pptx
fajarkunee
?
Starbucks in the Indian market through its joint venture.
sales480687
?
Ad

Index Activiti Data on Elasticsearch