際際滷

際際滷Share a Scribd company logo
Staging Staging Data
Warehouse
Extract Transform Load
Pros:
1. Development Time Designing from the output backwards ensures that only data relevant to the solution is extracted and
processed, potentially reducing development, extract, and processing overhead; and therefore time.
2. Targeted data Due to the targeted nature of the load process, the warehouse contains only data relevant to the presentation.
Administration Overhead Reduced warehouse content simplifies the security regime implemented and hence the
administration overhead.
3. Tools Availability The prolific number of tools available that implement ETL provides flexibility of approach and the
opportunity to identify a most appropriate tool. The proliferation of tools has lead to a competitive functionality war, which
often results in loss of maintainability.
Cons:
1. Flexibility Targeting only relevant data for output means that any future requirements, that may need data that was not
included in the original design, will need to be added to the ETL routines. Due to nature of tight dependency between the
routines developed, this often leads to a need for fundamental re-design and development.
2. Hardware Most third party tools utilize their own engine to implement the ETL process. Regardless of the size of
the solution this can necessitate the investment in additional hardware to implement the tools ETL engine.
3. Learning Curve Implementing a third party tool that uses foreign processes and languages results in the learning
curve that is implicit in all technologies new to an organization and can often lead to following blind alleys in their use due
to lack of experience.
Staging Data
Warehouse
Extract TransformLoad
Data
Warehouse
Pros:
1. Project Management Being able to split the warehouse process into specific and isolated tasks, enables a project to be
designed on a smaller task basis, therefore the project can be broken down into manageable chunks.
2. Flexible & Future Proof In general, in an ELT implementation all data from the sources are loaded into the warehouse
as part of the extract and load process. This, combined with the isolation of the transformation process, means that future
requirements can easily be incorporated into the warehouse structure.
3. Risk minimization Removing the close interdependencies between each stage of the warehouse build process enables
the development process to be isolated, and the individual process design can thus also be isolated. This provides an
excellent platform for change, maintenance and management.
4. Utilize Existing Hardware In implementing ELT as a warehouse build process, the inherent tools provided with the database
engine can be used. Alternatively, the vast majority of the third party ELT tools available employ the use of the
database engines capability and hence the ELT process is run on the same hardware as the database engine underpinning
the data warehouse, using the existing hardware deployed.
5. Utilize Existing Skill sets By using the functionality provided by the database engine, the existing investments in
database skills are re-used to develop the warehouse. No new skills need be learned and the full weight of the experience in
developing the engines technology is utilized, further reducing the cost and risk in the development process.
Cons:
1. Tools: Availability of mature tools as it is an emergent technology
Ad

Recommended

PDF
Why shift from ETL to ELT?
HEXANIKA
PPTX
Etl - Extract Transform Load
ABDUL KHALIQ
PPTX
What is ETL?
Ismail El Gayar
PDF
PCA (Principal component analysis)
Learnbay Datascience
PDF
Is it sensible to use Data Vault at all? Conclusions from a project.
Capgemini
PDF
Introduction to ETL and Data Integration
CloverDX (formerly known as CloverETL)
PPTX
Counting Elements in Streams
Jamie Grier
PPTX
Lect7 Association analysis to correlation analysis
hktripathy
PPT
Query processing-and-optimization
WBUTTUTORIALS
PDF
Data Structure and its Fundamentals
Hitesh Mohapatra
PPTX
Exploratory data analysis with Python
Davis David
PPT
data modeling and models
sabah N
PPTX
Data Lake Overview
James Serra
PPTX
ETL Process
Rashmi Bhat
PDF
Data engineering
Suman Debnath
PPTX
Object database standards, languages and design
Dabbal Singh Mahara
PPTX
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...
Anastasija Nikiforova
PPTX
Oracle business intelligence overview
nvvrajesh
PDF
Speeding Time to Insight with a Modern ELT Approach
Databricks
PDF
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Neo4j
PDF
NOSQLEU - Graph Databases and Neo4j
Tobias Lindaaker
PDF
Dimensionality Reduction
Saad Elbeleidy
PDF
Data warehousing testing strategies cognos
Sandeep Mehta
PPT
Dbms Lec Uog 02
smelltulip
PPT
Dbms relational model
Chirag vasava
PDF
Machine Learning for dummies!
ZOLLHOF - Tech Incubator
PPTX
Delta lake and the delta architecture
Adam Doyle
PDF
[EWTT2022] Strategi Implementasi Database dalam Microservice Architecture.pdf
Equnix Business Solutions
PDF
Vyu転ijte svou Oracle datab叩zi naplno
MarketingArrowECS_CZ

More Related Content

What's hot (20)

PPT
Query processing-and-optimization
WBUTTUTORIALS
PDF
Data Structure and its Fundamentals
Hitesh Mohapatra
PPTX
Exploratory data analysis with Python
Davis David
PPT
data modeling and models
sabah N
PPTX
Data Lake Overview
James Serra
PPTX
ETL Process
Rashmi Bhat
PDF
Data engineering
Suman Debnath
PPTX
Object database standards, languages and design
Dabbal Singh Mahara
PPTX
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...
Anastasija Nikiforova
PPTX
Oracle business intelligence overview
nvvrajesh
PDF
Speeding Time to Insight with a Modern ELT Approach
Databricks
PDF
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Neo4j
PDF
NOSQLEU - Graph Databases and Neo4j
Tobias Lindaaker
PDF
Dimensionality Reduction
Saad Elbeleidy
PDF
Data warehousing testing strategies cognos
Sandeep Mehta
PPT
Dbms Lec Uog 02
smelltulip
PPT
Dbms relational model
Chirag vasava
PDF
Machine Learning for dummies!
ZOLLHOF - Tech Incubator
PPTX
Delta lake and the delta architecture
Adam Doyle
Query processing-and-optimization
WBUTTUTORIALS
Data Structure and its Fundamentals
Hitesh Mohapatra
Exploratory data analysis with Python
Davis David
data modeling and models
sabah N
Data Lake Overview
James Serra
ETL Process
Rashmi Bhat
Data engineering
Suman Debnath
Object database standards, languages and design
Dabbal Singh Mahara
Data Lake or Data Warehouse? Data Cleaning or Data Wrangling? How to Ensure t...
Anastasija Nikiforova
Oracle business intelligence overview
nvvrajesh
Speeding Time to Insight with a Modern ELT Approach
Databricks
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Neo4j
NOSQLEU - Graph Databases and Neo4j
Tobias Lindaaker
Dimensionality Reduction
Saad Elbeleidy
Data warehousing testing strategies cognos
Sandeep Mehta
Dbms Lec Uog 02
smelltulip
Dbms relational model
Chirag vasava
Machine Learning for dummies!
ZOLLHOF - Tech Incubator
Delta lake and the delta architecture
Adam Doyle

Similar to Etl elt simplified (20)

PDF
[EWTT2022] Strategi Implementasi Database dalam Microservice Architecture.pdf
Equnix Business Solutions
PDF
Vyu転ijte svou Oracle datab叩zi naplno
MarketingArrowECS_CZ
PPTX
Warehouse Planning and Implementation
SHIKHA GAUTAM
DOC
Hardware enhanced association rule mining
StudsPlanet.com
PPT
Building High Performance MySQL Query Systems and Analytic Applications
Calpont
PPT
Building High Performance MySql Query Systems And Analytic Applications
guest40cda0b
DOCX
Data Gaurd Final Thesis for University in Progress (2).docx
MohdKashif82
DOC
127801976 mobile-shop-management-system-documentation
Nitesh Kumar
PDF
Periodic Auditing of Data in Cloud Using Random Bits
IJTET Journal
PDF
Cloudera federal summit
Matt Carroll
PDF
(Lecture 2)Data Warehouse Architecture.pdf
MobeenMasoudi
PDF
Analysis of a high availability and data integration solution of an electroni...
IvanZennaro
PDF
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET Journal
PDF
Conspectus data warehousing appliances fad or future
David Walker
PDF
MODERN DATA PIPELINE
IRJET Journal
PDF
Graduate Project Summary
JustAnotherAbstraction
PDF
Datasheet foldermanagementpluginforrd
MidVision
DOCX
Research Paper 油Find a peer reviewed article in the following d.docx
eleanorg1
PDF
THE SURVEY ON REFERENCE MODEL FOR OPEN STORAGE SYSTEMS INTERCONNECTION MASS S...
IRJET Journal
PDF
Fluid Data Storage:Driving Flexibility in the Data Center
Kingfin Enterprises Limited
[EWTT2022] Strategi Implementasi Database dalam Microservice Architecture.pdf
Equnix Business Solutions
Vyu転ijte svou Oracle datab叩zi naplno
MarketingArrowECS_CZ
Warehouse Planning and Implementation
SHIKHA GAUTAM
Hardware enhanced association rule mining
StudsPlanet.com
Building High Performance MySQL Query Systems and Analytic Applications
Calpont
Building High Performance MySql Query Systems And Analytic Applications
guest40cda0b
Data Gaurd Final Thesis for University in Progress (2).docx
MohdKashif82
127801976 mobile-shop-management-system-documentation
Nitesh Kumar
Periodic Auditing of Data in Cloud Using Random Bits
IJTET Journal
Cloudera federal summit
Matt Carroll
(Lecture 2)Data Warehouse Architecture.pdf
MobeenMasoudi
Analysis of a high availability and data integration solution of an electroni...
IvanZennaro
IRJET - The 3-Level Database Architectural Design for OLAP and OLTP Ops
IRJET Journal
Conspectus data warehousing appliances fad or future
David Walker
MODERN DATA PIPELINE
IRJET Journal
Graduate Project Summary
JustAnotherAbstraction
Datasheet foldermanagementpluginforrd
MidVision
Research Paper 油Find a peer reviewed article in the following d.docx
eleanorg1
THE SURVEY ON REFERENCE MODEL FOR OPEN STORAGE SYSTEMS INTERCONNECTION MASS S...
IRJET Journal
Fluid Data Storage:Driving Flexibility in the Data Center
Kingfin Enterprises Limited
Ad

More from Ramchandra Koty (7)

PPTX
How software is eating the world
Ramchandra Koty
PPTX
Staff aspiration template
Ramchandra Koty
PPTX
Enabling Continuous Delivery
Ramchandra Koty
PPTX
Reactive programming
Ramchandra Koty
PPTX
Nodejs
Ramchandra Koty
PPTX
Microservice
Ramchandra Koty
PPTX
Docker
Ramchandra Koty
How software is eating the world
Ramchandra Koty
Staff aspiration template
Ramchandra Koty
Enabling Continuous Delivery
Ramchandra Koty
Reactive programming
Ramchandra Koty
Microservice
Ramchandra Koty
Ad

Etl elt simplified

  • 1. Staging Staging Data Warehouse Extract Transform Load Pros: 1. Development Time Designing from the output backwards ensures that only data relevant to the solution is extracted and processed, potentially reducing development, extract, and processing overhead; and therefore time. 2. Targeted data Due to the targeted nature of the load process, the warehouse contains only data relevant to the presentation. Administration Overhead Reduced warehouse content simplifies the security regime implemented and hence the administration overhead. 3. Tools Availability The prolific number of tools available that implement ETL provides flexibility of approach and the opportunity to identify a most appropriate tool. The proliferation of tools has lead to a competitive functionality war, which often results in loss of maintainability. Cons: 1. Flexibility Targeting only relevant data for output means that any future requirements, that may need data that was not included in the original design, will need to be added to the ETL routines. Due to nature of tight dependency between the routines developed, this often leads to a need for fundamental re-design and development. 2. Hardware Most third party tools utilize their own engine to implement the ETL process. Regardless of the size of the solution this can necessitate the investment in additional hardware to implement the tools ETL engine. 3. Learning Curve Implementing a third party tool that uses foreign processes and languages results in the learning curve that is implicit in all technologies new to an organization and can often lead to following blind alleys in their use due to lack of experience.
  • 2. Staging Data Warehouse Extract TransformLoad Data Warehouse Pros: 1. Project Management Being able to split the warehouse process into specific and isolated tasks, enables a project to be designed on a smaller task basis, therefore the project can be broken down into manageable chunks. 2. Flexible & Future Proof In general, in an ELT implementation all data from the sources are loaded into the warehouse as part of the extract and load process. This, combined with the isolation of the transformation process, means that future requirements can easily be incorporated into the warehouse structure. 3. Risk minimization Removing the close interdependencies between each stage of the warehouse build process enables the development process to be isolated, and the individual process design can thus also be isolated. This provides an excellent platform for change, maintenance and management. 4. Utilize Existing Hardware In implementing ELT as a warehouse build process, the inherent tools provided with the database engine can be used. Alternatively, the vast majority of the third party ELT tools available employ the use of the database engines capability and hence the ELT process is run on the same hardware as the database engine underpinning the data warehouse, using the existing hardware deployed. 5. Utilize Existing Skill sets By using the functionality provided by the database engine, the existing investments in database skills are re-used to develop the warehouse. No new skills need be learned and the full weight of the experience in developing the engines technology is utilized, further reducing the cost and risk in the development process. Cons: 1. Tools: Availability of mature tools as it is an emergent technology