際際滷

際際滷Share a Scribd company logo
Big Data with Hortonworks
Steve Howard, EXPRESS
Eric Thorsen, Hortonworks
 Who are we?
 Womens fashion brand started by Limited
Brands in 1980
 Added a mens line in 1986
 Mens spun off into its own brand,
Structure, in 1989
 Reintegrated Structure back into EXPRESS
in 2000 as a dual gender brand
 Limited Brands sold a majority stake in
2007 to GoldenGate Capital
 The company issued an IPO of its stock in
May, 2010
 Target market is 20  30 year olds
 Where have we been?
 Upgrading application and technology portfolio
 Upgrading internal processing systems such as GL and
HR
 Brought ecommerce in-house from hosted platform
 Installed allocation and size profile systems
 Brought order management system in-house
 Currently installing new merchandising and planning
systems
 Allows us to have:
 More flexible merchandise and organization
hierarchy
 Robust reporting
 Supported technology stack to allow for
organizational and associate engagement and
growth
 Business Goals
 Collecting stats on Millennial shopping
behavior
 Increase purchases through Loyalty program
 Improve conversion to loyalty program
 Seeking greater personalization of marketing
messages
 Visibility of customer across all channels
 Single cart
 360属 view of customer
 Better positioned to offer:
 what they want
 when they want it
 where they want it
 What are we doing?
 Need a processing platform that allows us to better understand our
customers at scale, in as close to real time as possible
 Use Hive for large scale analytics
 Use same Tableau reporting solution as existing datastores
 Use HBase for storing customer profiles with real time access
 Custom spring based API
 Hadoop/Hortonworks
 Initially hosted our customer data warehouse with an external partner
 Large capital and expense outlay
 Inflexibility with reporting changes
 2015
 Began buildout of HDP cluster that would allow us to:
 Bring data warehouse in house
 Provide data directly to external partners for deep data analytics
 Run operational reports on customer activity
 Load website request logs to Hadoop for omnichannel analysis
 2016
 Provide data to campaign management partner
 Develop API layer to provide real time access to customer profile records
 How did we get started?
 Install software
 Secure it
 Quick POCs/quick wins
 Weather compared to online checkouts
 Story  Are customers more likely to shop online when bad weather hits a
location close to them
 Plan
 Identify data sources
 Load data
 Construct query
 Results  No correlation whatsoeverthats OK! Sometimes the answer no is
as good as the answer yes. Actually, that is always true.
 Semantic analysis of the twitter feed
 Story - Could we build a language scoring tool to determine if a given tweet
was positive or negative given a search filter?
 Plan  How to obtain data?
 Twitter firehose
 Twitter gardenhose
 Results
 Good  We were able to prove we could load the data realtime using Apache Storm
 Bad  We found that semantic analysis is very hard to get right
 For example, yeah, Im going to shop there again real soon! Is that very positive, or
incredibly sarcastic?
 Using web logs to tie shopper behavior across channels
 Story  Customers purchase in store, browse online, how to track
 Plan
 Customers dont always log in, so how to track?
 Omniture cookies?
 Added our own cookie
 Ensure we protect data at all times
 Results
 We were able to identify shoppers online vs in store
 Looking at operationalization of data flow
 Lessons learned
 Identify all data sources
 Security early
 Allow people time to experiment with the technology
 Semantic analysis of twitter feed - used python natural language toolkit to score words in
tweets with search
 Communicate early and often
 Identify all data sources (web logs, transaction systems, social feeds, etc.)
 Come up with a list of POCs for important business questions
 Failure is OK, just communicate it and document it
 Utilize Hortonworks support for problems
 https://www.linkedin.com/in/steve-howard
 Questions?
Ad

Recommended

Web intelligence
Web intelligence
Koray Kocabas
How to migrate data from Shopify to Shopify by LitExtension
How to migrate data from Shopify to Shopify by LitExtension
LitExtension
Introduction to Cin7 Webinar Presentation
Introduction to Cin7 Webinar Presentation
Marc Zabern
Sales deck (generic) (5)
Sales deck (generic) (5)
Marc Zabern
Measurement Roadmap
Measurement Roadmap
Ani Lopez
Bangalore Executive Seminar 2015: MongoDB - Your database of choice for real ...
Bangalore Executive Seminar 2015: MongoDB - Your database of choice for real ...
MongoDB
Advanced Analytics Implementations at EA scale
Advanced Analytics Implementations at EA scale
Ani Lopez
Data import and widening in Google Analytics
Data import and widening in Google Analytics
Zorin Radovancevic
Hortonworks laurie maclachlan
Hortonworks laurie maclachlan
BigDataExpo
Hortonworks - How Hadoop makes the successful Retailer.
Hortonworks - How Hadoop makes the successful Retailer.
Mats Johansson
Big data analytics
Big data analytics
Amr Kamel Deklel
Agile data science
Agile data science
Joel Horwitz
Paris HUG - Agile Analytics Applications on Hadoop
Paris HUG - Agile Analytics Applications on Hadoop
Hortonworks
Utrecht NL-HUG/Data Science-NL - Agile Data 際際滷s
Utrecht NL-HUG/Data Science-NL - Agile Data 際際滷s
Hortonworks
UK - Agile Data Applications on Hadoop
UK - Agile Data Applications on Hadoop
Hortonworks
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
SoftServe
Hortonworks: Agile Analytics Applications
Hortonworks: Agile Analytics Applications
russell_jurney
Agile analytics applications on hadoop
Agile analytics applications on hadoop
Hortonworks
Deep-Dive: Predicting Customer Behavior with Apigee Insights
Deep-Dive: Predicting Customer Behavior with Apigee Insights
Apigee | Google Cloud
Big Data and User Segmentation in Mobile Context
Big Data and User Segmentation in Mobile Context
InMobi Technology
Hortonworks and HP Vertica Webinar
Hortonworks and HP Vertica Webinar
Hortonworks
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Matt Stubbs
LA HUG - Agile Analytics Applications on HDP
LA HUG - Agile Analytics Applications on HDP
Hortonworks
Leverage Big Data to Enhance Customer Experience in Telecommunications with...
Leverage Big Data to Enhance Customer Experience in Telecommunications with...
Hortonworks
Hadoop Data Reservoir Webinar
Hadoop Data Reservoir Webinar
Platfora
Analyzing Multi-Structured Data
Analyzing Multi-Structured Data
DataWorks Summit
Orange County HUG - Agile Data on HDP
Orange County HUG - Agile Data on HDP
Hortonworks
Taming data lake - scalable metrics model
Taming data lake - scalable metrics model
BharathiRaja Chandrasekaran

More Related Content

Similar to HDP_EXPRESS_LATEST (20)

Hortonworks laurie maclachlan
Hortonworks laurie maclachlan
BigDataExpo
Hortonworks - How Hadoop makes the successful Retailer.
Hortonworks - How Hadoop makes the successful Retailer.
Mats Johansson
Big data analytics
Big data analytics
Amr Kamel Deklel
Agile data science
Agile data science
Joel Horwitz
Paris HUG - Agile Analytics Applications on Hadoop
Paris HUG - Agile Analytics Applications on Hadoop
Hortonworks
Utrecht NL-HUG/Data Science-NL - Agile Data 際際滷s
Utrecht NL-HUG/Data Science-NL - Agile Data 際際滷s
Hortonworks
UK - Agile Data Applications on Hadoop
UK - Agile Data Applications on Hadoop
Hortonworks
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
SoftServe
Hortonworks: Agile Analytics Applications
Hortonworks: Agile Analytics Applications
russell_jurney
Agile analytics applications on hadoop
Agile analytics applications on hadoop
Hortonworks
Deep-Dive: Predicting Customer Behavior with Apigee Insights
Deep-Dive: Predicting Customer Behavior with Apigee Insights
Apigee | Google Cloud
Big Data and User Segmentation in Mobile Context
Big Data and User Segmentation in Mobile Context
InMobi Technology
Hortonworks and HP Vertica Webinar
Hortonworks and HP Vertica Webinar
Hortonworks
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Matt Stubbs
LA HUG - Agile Analytics Applications on HDP
LA HUG - Agile Analytics Applications on HDP
Hortonworks
Leverage Big Data to Enhance Customer Experience in Telecommunications with...
Leverage Big Data to Enhance Customer Experience in Telecommunications with...
Hortonworks
Hadoop Data Reservoir Webinar
Hadoop Data Reservoir Webinar
Platfora
Analyzing Multi-Structured Data
Analyzing Multi-Structured Data
DataWorks Summit
Orange County HUG - Agile Data on HDP
Orange County HUG - Agile Data on HDP
Hortonworks
Taming data lake - scalable metrics model
Taming data lake - scalable metrics model
BharathiRaja Chandrasekaran
Hortonworks laurie maclachlan
Hortonworks laurie maclachlan
BigDataExpo
Hortonworks - How Hadoop makes the successful Retailer.
Hortonworks - How Hadoop makes the successful Retailer.
Mats Johansson
Agile data science
Agile data science
Joel Horwitz
Paris HUG - Agile Analytics Applications on Hadoop
Paris HUG - Agile Analytics Applications on Hadoop
Hortonworks
Utrecht NL-HUG/Data Science-NL - Agile Data 際際滷s
Utrecht NL-HUG/Data Science-NL - Agile Data 際際滷s
Hortonworks
UK - Agile Data Applications on Hadoop
UK - Agile Data Applications on Hadoop
Hortonworks
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
SoftServe
Hortonworks: Agile Analytics Applications
Hortonworks: Agile Analytics Applications
russell_jurney
Agile analytics applications on hadoop
Agile analytics applications on hadoop
Hortonworks
Deep-Dive: Predicting Customer Behavior with Apigee Insights
Deep-Dive: Predicting Customer Behavior with Apigee Insights
Apigee | Google Cloud
Big Data and User Segmentation in Mobile Context
Big Data and User Segmentation in Mobile Context
InMobi Technology
Hortonworks and HP Vertica Webinar
Hortonworks and HP Vertica Webinar
Hortonworks
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Big Data LDN 2016: Case Studies of Business Transformation through Big Data
Matt Stubbs
LA HUG - Agile Analytics Applications on HDP
LA HUG - Agile Analytics Applications on HDP
Hortonworks
Leverage Big Data to Enhance Customer Experience in Telecommunications with...
Leverage Big Data to Enhance Customer Experience in Telecommunications with...
Hortonworks
Hadoop Data Reservoir Webinar
Hadoop Data Reservoir Webinar
Platfora
Analyzing Multi-Structured Data
Analyzing Multi-Structured Data
DataWorks Summit
Orange County HUG - Agile Data on HDP
Orange County HUG - Agile Data on HDP
Hortonworks

HDP_EXPRESS_LATEST

  • 1. Big Data with Hortonworks Steve Howard, EXPRESS Eric Thorsen, Hortonworks
  • 2. Who are we? Womens fashion brand started by Limited Brands in 1980 Added a mens line in 1986 Mens spun off into its own brand, Structure, in 1989 Reintegrated Structure back into EXPRESS in 2000 as a dual gender brand Limited Brands sold a majority stake in 2007 to GoldenGate Capital The company issued an IPO of its stock in May, 2010 Target market is 20 30 year olds
  • 3. Where have we been? Upgrading application and technology portfolio Upgrading internal processing systems such as GL and HR Brought ecommerce in-house from hosted platform Installed allocation and size profile systems Brought order management system in-house Currently installing new merchandising and planning systems Allows us to have: More flexible merchandise and organization hierarchy Robust reporting Supported technology stack to allow for organizational and associate engagement and growth
  • 4. Business Goals Collecting stats on Millennial shopping behavior Increase purchases through Loyalty program Improve conversion to loyalty program Seeking greater personalization of marketing messages Visibility of customer across all channels Single cart 360属 view of customer Better positioned to offer: what they want when they want it where they want it
  • 5. What are we doing? Need a processing platform that allows us to better understand our customers at scale, in as close to real time as possible Use Hive for large scale analytics Use same Tableau reporting solution as existing datastores Use HBase for storing customer profiles with real time access Custom spring based API
  • 6. Hadoop/Hortonworks Initially hosted our customer data warehouse with an external partner Large capital and expense outlay Inflexibility with reporting changes 2015 Began buildout of HDP cluster that would allow us to: Bring data warehouse in house Provide data directly to external partners for deep data analytics Run operational reports on customer activity Load website request logs to Hadoop for omnichannel analysis 2016 Provide data to campaign management partner Develop API layer to provide real time access to customer profile records
  • 7. How did we get started? Install software Secure it Quick POCs/quick wins
  • 8. Weather compared to online checkouts Story Are customers more likely to shop online when bad weather hits a location close to them Plan Identify data sources Load data Construct query Results No correlation whatsoeverthats OK! Sometimes the answer no is as good as the answer yes. Actually, that is always true.
  • 9. Semantic analysis of the twitter feed Story - Could we build a language scoring tool to determine if a given tweet was positive or negative given a search filter? Plan How to obtain data? Twitter firehose Twitter gardenhose Results Good We were able to prove we could load the data realtime using Apache Storm Bad We found that semantic analysis is very hard to get right For example, yeah, Im going to shop there again real soon! Is that very positive, or incredibly sarcastic?
  • 10. Using web logs to tie shopper behavior across channels Story Customers purchase in store, browse online, how to track Plan Customers dont always log in, so how to track? Omniture cookies? Added our own cookie Ensure we protect data at all times Results We were able to identify shoppers online vs in store Looking at operationalization of data flow
  • 11. Lessons learned Identify all data sources Security early Allow people time to experiment with the technology Semantic analysis of twitter feed - used python natural language toolkit to score words in tweets with search Communicate early and often Identify all data sources (web logs, transaction systems, social feeds, etc.) Come up with a list of POCs for important business questions Failure is OK, just communicate it and document it Utilize Hortonworks support for problems