狠狠撸

狠狠撸Share a Scribd company logo
CLUSTERING
Stores Dataset
N RAMACHANDRAN
Data Exploration
? The percentage sales is the highest for the category 1 ,Fresh Foods
section(38%)
? The percentage sales is the lowest for the category 3 , Health and
Beauty section at 13%
? Average Sale is around 210 Rs/sqft.
Clustering Methodology used
? K – Means technique used to cluster the different stores.
? Worked to understand the techniques with different number of
clusters : 4 , 5 and 6 at the initial clustering level.
? The 4 initial cluster works the best with the different clusters having a
z score with a good difference from that of the population. The
cluster strengths are greater too .So this looks to be the optimal
number of cluster.
? One of the clusters is then again processed to get 3 more clusters .
Cluster Profiling
? Cluster 1
? No of stores in the cluster : 91
? Cluster Strength : 4.71
? The significant difference between the cluster and the population is the avg
sale amount.It is around 291 Rs/sqft compared to 210Rs/sqft for the
population.
? Frozen food higher than in the population.(28% > 25%)
? Tobacco sales are less compared to the population. (20% < 23%)
Cluster Profiling
? Cluster 2
? No of stores in the cluster : 151 , Cluster Strength : 3.10
? The biggest differentiator here is the Tobacco and Alcohol sections.(27.5% >
23% in the population)
? Average sale is significantly lower than that of the population(163 Rs/sqft <<
210 Rs/sqft)
? The fresh food sections sales are also very low compared to the overall
population.(32% << 38%)
? Health and Beauty sales is slightly better than the population(16.2% > 13.7%)
Cluster Profiling
? Cluster 3
? No of stores in the cluster : 3 , Cluster Strength : 8.33
? Very high difference between the average sales in these stores(471Rs/sqft >
210Rs/sqft)
? Tobacco and Alcohol forms considerably low percentage if compared to the
population.(18% < 23%)
? Fresh Foods among these stores has a higher share of the sales compared to
that in the overall population.
? Frozen food share also is a little greater than in the population.(26.4% >
24.9%)
? Looks to be an outlier group with very less no of stores and significantly
different characteristics from the population.
Cluster Profiling
? Cluster 4
? No of stores in the cluster : 270 .A much higher than expected no of
stores(100-140) in the cluster .
? Cluster Strength : 3.12
? Not very significant changes from the characteristics of the whole population.
? The highest differentiator being the Fresh Food Section ( 41.4% > 38.4%)
? The Tobacco and Alcohol section sales accounts a little less than the average
population(21.4% < 22.8%)
? Another iteration on the stores of this clusters is done to get more details.
Cluster Profiling
? Cluster 4_1
? No of stores in the cluster : 101 , Cluster Strength : 3.26
? Significant difference in the Fresh Food sales in this cluster compared to the
population.(45% > 38%)
? Average sales is around 184 Rs/sqft compared to the 210 Rs/sqft for the
population.
? Tobacco & Alcohol , Health & Beauty sections sales are a tad lower to the
overall population.
Cluster Profiling
? Cluster 4_2
? No of stores in the cluster : 85 , Cluster Strength : 3.42
? It has a very similar characteristics for all the store sections and average sales
when compared to the overall population.
Cluster Profiling
? Cluster 4_3
? No of stores in the cluster : 84 , Cluster Strength : 3.55
? Frozen Food section sales are a little lower than the overall population(20% <
22%)
? Sales/sqft a little higher than the population for the stores in this cluster (233
Rs/sqft > 210 Rs/sqft)
Recommendations
? We find that the cluster 3 with the 3 stores is an outlier .On further
check(as mentioned in the class videos) , its found to be the open
area outside the store to sell the fresh foods that is not accounted for.
? Hence , the stores can maybe try to sell the fresh food items in the
extra open area outside the stores.
? We would need more data to analyze why certain stores and sections
have greater sales in different clusters.
Ad

Recommended

Boots Hair Care Case Study
Boots Hair Care Case Study
Anchit Ahuja
?
ashraf letest cv.
ashraf letest cv.
Ashraf Jahan
?
R&R 2015 Regnskabsafl?ggelse i interessentskaber
R&R 2015 Regnskabsafl?ggelse i interessentskaber
mettelaursen76
?
Final case study powerpoint
Final case study powerpoint
Ramachandran N
?
FSA strategic plan 2015-20
FSA strategic plan 2015-20
Richard McLean
?
Dove andare a vivere a Sydney?
Dove andare a vivere a Sydney?
Cangurizzato
?
10 причин
10 причин
Сапегина Ольга
?
Untitled Presentation
Untitled Presentation
aouatef17
?
Dominick’s retail analysis
Dominick’s retail analysis
Eeshan Srivastava
?
Cluster analysis - Wholesale customers data set
Cluster analysis - Wholesale customers data set
Divya Ganjoo, PMP? CSM?
?
T20 cluster analysis
T20 cluster analysis
kompellark
?
Database Marketing - Dominick's stores in Chicago distric
Database Marketing - Dominick's stores in Chicago distric
Demin Wang
?
Retail Management- Consumer Buying Behavior and Planogram
Retail Management- Consumer Buying Behavior and Planogram
Sumit Chakraborty
?
Clustering
Clustering
Shikha Ujjwal Dalmia
?
Predictive Analytics for Scott Toilet Tissue
Predictive Analytics for Scott Toilet Tissue
Vibhore Agarwal
?
Retail analytics (SAS programming,big data analytics)
Retail analytics (SAS programming,big data analytics)
data-analytics
?
Standard display procedure of vishal mega mart
Standard display procedure of vishal mega mart
noT yeT wOrKinG!! i M sTilL sTuDInG!!
?
Customer Profiling
Customer Profiling
manish gupta
?
GIRISH FOOD STORE ( CASE PRESENTATION
GIRISH FOOD STORE ( CASE PRESENTATION
Jyoti Punj Prakash
?
Retail sales forecasting - predictive data analytics
Retail sales forecasting - predictive data analytics
Ristanti Ramadanti
?
Uday synopsis
Uday synopsis
Uday Dakre
?
A DETAILED ANALYSIS OF A RETAIL BUSINESS DATASET
A DETAILED ANALYSIS OF A RETAIL BUSINESS DATASET
Kehinde (Kenny) Ogunjemilusi
?
Final Project Report - Walmart Sales
Final Project Report - Walmart Sales
Deepti Bahel
?
“Customer perception towards Reliance mart” At Bangalore
“Customer perception towards Reliance mart” At Bangalore
udgamdwd
?
Store location
Store location
Kittu Rajpal
?
Food Shopper Insights: Grocery Shopping Trends in the U.S.
Food Shopper Insights: Grocery Shopping Trends in the U.S.
MarketResearch.com
?
7p's of Retailing Industry
7p's of Retailing Industry
Jalpa Patel
?
Dhara Mehta-Reliance Fresh SIP
Dhara Mehta-Reliance Fresh SIP
MdharaP
?
ppt somu_Jarvis_AI_Assistant_presen.pptx
ppt somu_Jarvis_AI_Assistant_presen.pptx
MohammedumarFarhan
?
Residential Zone 4 for industrial village
Residential Zone 4 for industrial village
MdYasinArafat13
?

More Related Content

Similar to Cluster Analysis (20)

Dominick’s retail analysis
Dominick’s retail analysis
Eeshan Srivastava
?
Cluster analysis - Wholesale customers data set
Cluster analysis - Wholesale customers data set
Divya Ganjoo, PMP? CSM?
?
T20 cluster analysis
T20 cluster analysis
kompellark
?
Database Marketing - Dominick's stores in Chicago distric
Database Marketing - Dominick's stores in Chicago distric
Demin Wang
?
Retail Management- Consumer Buying Behavior and Planogram
Retail Management- Consumer Buying Behavior and Planogram
Sumit Chakraborty
?
Clustering
Clustering
Shikha Ujjwal Dalmia
?
Predictive Analytics for Scott Toilet Tissue
Predictive Analytics for Scott Toilet Tissue
Vibhore Agarwal
?
Retail analytics (SAS programming,big data analytics)
Retail analytics (SAS programming,big data analytics)
data-analytics
?
Standard display procedure of vishal mega mart
Standard display procedure of vishal mega mart
noT yeT wOrKinG!! i M sTilL sTuDInG!!
?
Customer Profiling
Customer Profiling
manish gupta
?
GIRISH FOOD STORE ( CASE PRESENTATION
GIRISH FOOD STORE ( CASE PRESENTATION
Jyoti Punj Prakash
?
Retail sales forecasting - predictive data analytics
Retail sales forecasting - predictive data analytics
Ristanti Ramadanti
?
Uday synopsis
Uday synopsis
Uday Dakre
?
A DETAILED ANALYSIS OF A RETAIL BUSINESS DATASET
A DETAILED ANALYSIS OF A RETAIL BUSINESS DATASET
Kehinde (Kenny) Ogunjemilusi
?
Final Project Report - Walmart Sales
Final Project Report - Walmart Sales
Deepti Bahel
?
“Customer perception towards Reliance mart” At Bangalore
“Customer perception towards Reliance mart” At Bangalore
udgamdwd
?
Store location
Store location
Kittu Rajpal
?
Food Shopper Insights: Grocery Shopping Trends in the U.S.
Food Shopper Insights: Grocery Shopping Trends in the U.S.
MarketResearch.com
?
7p's of Retailing Industry
7p's of Retailing Industry
Jalpa Patel
?
Dhara Mehta-Reliance Fresh SIP
Dhara Mehta-Reliance Fresh SIP
MdharaP
?
Cluster analysis - Wholesale customers data set
Cluster analysis - Wholesale customers data set
Divya Ganjoo, PMP? CSM?
?
T20 cluster analysis
T20 cluster analysis
kompellark
?
Database Marketing - Dominick's stores in Chicago distric
Database Marketing - Dominick's stores in Chicago distric
Demin Wang
?
Retail Management- Consumer Buying Behavior and Planogram
Retail Management- Consumer Buying Behavior and Planogram
Sumit Chakraborty
?
Predictive Analytics for Scott Toilet Tissue
Predictive Analytics for Scott Toilet Tissue
Vibhore Agarwal
?
Retail analytics (SAS programming,big data analytics)
Retail analytics (SAS programming,big data analytics)
data-analytics
?
GIRISH FOOD STORE ( CASE PRESENTATION
GIRISH FOOD STORE ( CASE PRESENTATION
Jyoti Punj Prakash
?
Retail sales forecasting - predictive data analytics
Retail sales forecasting - predictive data analytics
Ristanti Ramadanti
?
Final Project Report - Walmart Sales
Final Project Report - Walmart Sales
Deepti Bahel
?
“Customer perception towards Reliance mart” At Bangalore
“Customer perception towards Reliance mart” At Bangalore
udgamdwd
?
Food Shopper Insights: Grocery Shopping Trends in the U.S.
Food Shopper Insights: Grocery Shopping Trends in the U.S.
MarketResearch.com
?
7p's of Retailing Industry
7p's of Retailing Industry
Jalpa Patel
?
Dhara Mehta-Reliance Fresh SIP
Dhara Mehta-Reliance Fresh SIP
MdharaP
?

Recently uploaded (20)

ppt somu_Jarvis_AI_Assistant_presen.pptx
ppt somu_Jarvis_AI_Assistant_presen.pptx
MohammedumarFarhan
?
Residential Zone 4 for industrial village
Residential Zone 4 for industrial village
MdYasinArafat13
?
最新版美国加利福尼亚大学旧金山法学院毕业证(鲍颁尝补飞厂贵毕业证书)定制
最新版美国加利福尼亚大学旧金山法学院毕业证(鲍颁尝补飞厂贵毕业证书)定制
taqyea
?
Communication_Skills_Class10_Visual.pptx
Communication_Skills_Class10_Visual.pptx
namanrastogi70555
?
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Prasenjit Debnath
?
@Reset-Password.pptx presentakh;kenvtion
@Reset-Password.pptx presentakh;kenvtion
MarkLariosa1
?
最新版美国约翰霍普金斯大学毕业证(闯贬鲍毕业证书)原版定制
最新版美国约翰霍普金斯大学毕业证(闯贬鲍毕业证书)原版定制
Taqyea
?
Camuflaje Tipos Características Militar 2025.ppt
Camuflaje Tipos Características Militar 2025.ppt
e58650738
?
The Influence off Flexible Work Policies
The Influence off Flexible Work Policies
sales480687
?
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
gun3awan88
?
Measurecamp Copenhagen - Consent Context
Measurecamp Copenhagen - Consent Context
Human37
?
Artigo - Playing to Win.planejamento docx
Artigo - Playing to Win.planejamento docx
KellyXavier15
?
Indigo dyeing Presentation (2).pptx as dye
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
?
Boost Business Efficiency with Professional Data Entry Services
Boost Business Efficiency with Professional Data Entry Services
eloiacs eloiacs
?
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
Tamanna36
?
Presentation by Tariq & Mohammed (1).pptx
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
?
UPS and Big Data intro to Business Analytics.pptx
UPS and Big Data intro to Business Analytics.pptx
sanjum5582
?
Data Visualisation in data science for students
Data Visualisation in data science for students
confidenceascend
?
PPT2 W1L2.pptx.........................................
PPT2 W1L2.pptx.........................................
palicteronalyn26
?
NASA ESE Study Results v4 05.29.2020.pptx
NASA ESE Study Results v4 05.29.2020.pptx
CiroAlejandroCamacho
?
ppt somu_Jarvis_AI_Assistant_presen.pptx
ppt somu_Jarvis_AI_Assistant_presen.pptx
MohammedumarFarhan
?
Residential Zone 4 for industrial village
Residential Zone 4 for industrial village
MdYasinArafat13
?
最新版美国加利福尼亚大学旧金山法学院毕业证(鲍颁尝补飞厂贵毕业证书)定制
最新版美国加利福尼亚大学旧金山法学院毕业证(鲍颁尝补飞厂贵毕业证书)定制
taqyea
?
Communication_Skills_Class10_Visual.pptx
Communication_Skills_Class10_Visual.pptx
namanrastogi70555
?
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Prasenjit Debnath
?
@Reset-Password.pptx presentakh;kenvtion
@Reset-Password.pptx presentakh;kenvtion
MarkLariosa1
?
最新版美国约翰霍普金斯大学毕业证(闯贬鲍毕业证书)原版定制
最新版美国约翰霍普金斯大学毕业证(闯贬鲍毕业证书)原版定制
Taqyea
?
Camuflaje Tipos Características Militar 2025.ppt
Camuflaje Tipos Características Militar 2025.ppt
e58650738
?
The Influence off Flexible Work Policies
The Influence off Flexible Work Policies
sales480687
?
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
11_L2_Defects_and_Trouble_Shooting_2014[1].pdf
gun3awan88
?
Measurecamp Copenhagen - Consent Context
Measurecamp Copenhagen - Consent Context
Human37
?
Artigo - Playing to Win.planejamento docx
Artigo - Playing to Win.planejamento docx
KellyXavier15
?
Indigo dyeing Presentation (2).pptx as dye
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
?
Boost Business Efficiency with Professional Data Entry Services
Boost Business Efficiency with Professional Data Entry Services
eloiacs eloiacs
?
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
Tamanna36
?
Presentation by Tariq & Mohammed (1).pptx
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
?
UPS and Big Data intro to Business Analytics.pptx
UPS and Big Data intro to Business Analytics.pptx
sanjum5582
?
Data Visualisation in data science for students
Data Visualisation in data science for students
confidenceascend
?
PPT2 W1L2.pptx.........................................
PPT2 W1L2.pptx.........................................
palicteronalyn26
?
NASA ESE Study Results v4 05.29.2020.pptx
NASA ESE Study Results v4 05.29.2020.pptx
CiroAlejandroCamacho
?
Ad

Cluster Analysis

  • 2. Data Exploration ? The percentage sales is the highest for the category 1 ,Fresh Foods section(38%) ? The percentage sales is the lowest for the category 3 , Health and Beauty section at 13% ? Average Sale is around 210 Rs/sqft.
  • 3. Clustering Methodology used ? K – Means technique used to cluster the different stores. ? Worked to understand the techniques with different number of clusters : 4 , 5 and 6 at the initial clustering level. ? The 4 initial cluster works the best with the different clusters having a z score with a good difference from that of the population. The cluster strengths are greater too .So this looks to be the optimal number of cluster. ? One of the clusters is then again processed to get 3 more clusters .
  • 4. Cluster Profiling ? Cluster 1 ? No of stores in the cluster : 91 ? Cluster Strength : 4.71 ? The significant difference between the cluster and the population is the avg sale amount.It is around 291 Rs/sqft compared to 210Rs/sqft for the population. ? Frozen food higher than in the population.(28% > 25%) ? Tobacco sales are less compared to the population. (20% < 23%)
  • 5. Cluster Profiling ? Cluster 2 ? No of stores in the cluster : 151 , Cluster Strength : 3.10 ? The biggest differentiator here is the Tobacco and Alcohol sections.(27.5% > 23% in the population) ? Average sale is significantly lower than that of the population(163 Rs/sqft << 210 Rs/sqft) ? The fresh food sections sales are also very low compared to the overall population.(32% << 38%) ? Health and Beauty sales is slightly better than the population(16.2% > 13.7%)
  • 6. Cluster Profiling ? Cluster 3 ? No of stores in the cluster : 3 , Cluster Strength : 8.33 ? Very high difference between the average sales in these stores(471Rs/sqft > 210Rs/sqft) ? Tobacco and Alcohol forms considerably low percentage if compared to the population.(18% < 23%) ? Fresh Foods among these stores has a higher share of the sales compared to that in the overall population. ? Frozen food share also is a little greater than in the population.(26.4% > 24.9%) ? Looks to be an outlier group with very less no of stores and significantly different characteristics from the population.
  • 7. Cluster Profiling ? Cluster 4 ? No of stores in the cluster : 270 .A much higher than expected no of stores(100-140) in the cluster . ? Cluster Strength : 3.12 ? Not very significant changes from the characteristics of the whole population. ? The highest differentiator being the Fresh Food Section ( 41.4% > 38.4%) ? The Tobacco and Alcohol section sales accounts a little less than the average population(21.4% < 22.8%) ? Another iteration on the stores of this clusters is done to get more details.
  • 8. Cluster Profiling ? Cluster 4_1 ? No of stores in the cluster : 101 , Cluster Strength : 3.26 ? Significant difference in the Fresh Food sales in this cluster compared to the population.(45% > 38%) ? Average sales is around 184 Rs/sqft compared to the 210 Rs/sqft for the population. ? Tobacco & Alcohol , Health & Beauty sections sales are a tad lower to the overall population.
  • 9. Cluster Profiling ? Cluster 4_2 ? No of stores in the cluster : 85 , Cluster Strength : 3.42 ? It has a very similar characteristics for all the store sections and average sales when compared to the overall population.
  • 10. Cluster Profiling ? Cluster 4_3 ? No of stores in the cluster : 84 , Cluster Strength : 3.55 ? Frozen Food section sales are a little lower than the overall population(20% < 22%) ? Sales/sqft a little higher than the population for the stores in this cluster (233 Rs/sqft > 210 Rs/sqft)
  • 11. Recommendations ? We find that the cluster 3 with the 3 stores is an outlier .On further check(as mentioned in the class videos) , its found to be the open area outside the store to sell the fresh foods that is not accounted for. ? Hence , the stores can maybe try to sell the fresh food items in the extra open area outside the stores. ? We would need more data to analyze why certain stores and sections have greater sales in different clusters.