The document discusses a data brewery meetup in Dallas for increasing business data awareness and literacy. It provides an overview of the data pipeline process including data sources, extraction, transformation, modeling, and decisioning. The meetup agenda is outlined which includes aligning understanding, sharing stories and use cases, discussion, and networking. The overall goals are to get feedback, answers, make connections, and share knowledge.
10. Decisioning
Governance
Presentation, Exploration
and Publishing
Analytical ModelingCleansing, Transformation and IntegrationExtractionDiscovery and
Acquisition
Data Sources
Mapping
Audit
Crowd Sourcing
Crawling
Manual
Digitization
web pages
text documents
structured
documents
databases
scienti鍖c data
Bulk Digitization
Scraping
Parsing
Loading to Data
Store
Automation
Data Pipes
ETL Process
Management
Data Quality
Management
Auditability and
Provenance
Reference Data
Management
Metadata
Visualization
Method Selection
Visualization and
Plotting
Report
Development
Publishing Online
Map Geo-TaggingStory Telling
Natural Language
Processing
Merging, Joining Handling Manual
Corrections
Using Reference
Data
Normalization Entity Uniqueness
Treating
Duplicates
Indexing and
Optimization
Data Formats and
Standards
Changing
Dimensions
Analytical Model
Development
Graph/Network
Metrics
Online Analytical
Processing
Business Rules
Regression Outliers
Segmentation and
Clustering
Simulation
Shopping Basket
Analysis
Customer Value
Computation
Campaign
Management
Automated
Decisioning
Data Granularity
Behavior and
Impact
pipeline