The document provides an agenda and overview for a Champion Training module on basic analytics. It includes definitions of descriptive and inferential statistics, how to monitor descriptive statistics over time, examples of numeric display terms like mean, median, and standard deviation, definitions of defects and defective units, and a table for calculating sigma levels and associated defects per million. The training aims to provide champions with knowledge and skills to lead process improvement projects.
1 of 22
Downloaded 10 times
More Related Content
Basic Analytics Module for Sponsors
1. Action plan and SOP for
Special Cause Variation
Determine new Goals
(UCL, LCL)
2. Module 8: Basic Analytics
Welcome to Champion Training
Facilitated by Kaplans Process Improvement Team
3. Agenda of Champion Training Modules
# Module # Pages
1 Introduction 19
2 Project Selection and Engaging Process Improvement 22
3 Champion Role through Project Lifecycle 26
4 Calculating Financial Benefit/ the Cost of Poor Quality 13
5 Define Overview & Tools 26
6 Measure Overview & Tools 27
7 Analyze Overview & Tools 18
8 Basic Analytics 21
9 Improve Overview & Tools 33
10 Control Overview & Tools 28
11 How to Effectuate Change Using Change Management 31
12 Kaplans Work Out 15
4. Purpose of This Training
Provide Kaplan champions with the knowledge and skills to be
effective leaders and coaches to their people engaged in Process
Improvement/Six Sigma projects
Everything should be made as simple as
possible, but not too simple.
Albert Einstein
5. Types of Statistics
Descriptive Statistics are used to describe the basic features of the data
in a study. They provide simple summaries about the sample and the
measures. Together with simple graphics analysis, they form the basis of
virtually every quantitative analysis of data. With descriptive statistics you
are simply describing what is, what the data shows.
Inferential Statistics investigate questions, models and hypotheses. In
many cases, the conclusions from inferential statistics extend beyond the
immediate data alone. For instance, we use inferential statistics to try to
infer from the sample data what the population thinks. Or, we use inferential
statistics to make judgments of the probability that an observed difference
between groups is a dependable one or one that might have happened by
chance in this study. Thus, we use inferential statistics to make inferences
from our data to more general conditions; we use descriptive statistics
simply to describe what's going on in our data.
6. Monitor Descriptive Statistics
Monitor
performance of
the Xs and Ys
over time
Verify that the
improvement
actions on the
Xs have made
the desired
improvement in
the Y
Mean, Median, Mode
Standard Deviation
7. Numeric Display Terms
The number of data points with non-missing values in the data
set.N
The Average
Mean (Arithmetic Mean)
The middle data point in the data set.Median (50th Percentile)
The Value that occurs the most frequently in a data set.Mode
The average distance from the mean.StDev (Standard
Deviation)
The highest value form the lowest 25% of the ranked data.
Q1 (First Quartile or 25th
Percentile)
The lowest value from the highest 25% of the ranked data.
Q3 (Third Quartile or 75th
Percentile)
8. Defects
A DEFECT is failure to conform to customer
requirements
DEFECTIVE is when an entire unit fails to
meet acceptance criteria, regardless of the
number of defects within the unit.
Defective
Defect
Defective
10. Shift The Mean And Reduce Variation
Calculate new process capability after implementing the improvement or design
Determine if the new process capability (process sigma) meets stated goals
See if you achieved the desired shift, variance reduction, or DPMO reduction
11. Sigma and Normal Distribution
As you can see, the curve is divided into a series of equal increments, each
representing one standard deviation from the mean.
13. Causes for Greenbelts Not Completing Project
55
25
8
6 3 3
Not enough
time
Sponsor does
not understand
value
Did not pass
exam
Unclear of
what needed
to be done on
template
Lost template Office closed for
a month
55%
80%
88%
94%
100%97%
Causes
CumulativePercent
14. Central Limit Theorem
http://www.intuitor.com/statistics/CentralLim.html
If a random sample is drawn from any population, the sampling distribution of the sample
mean is approximately normal for a sufficiently large sample size. The larger the sample
size, the more closely the sampling distribution of the sample mean will resemble a normal
distribution
1 3 15 30
15. Yields
Rolled
Throughput
Yield
Receive request for Financial Aid
45,000 DPMO wasted
Step 1 in Financial Aid
28,650 DPMO wasted
Step 2 in Awarding Financial Aid
51,876 DPMO wasted
Financial Aid Awarded
Right
First
Time
125,526 DPMO
wasted opportunities
95.5% Yield (YTP)
97% Yield (YTP)
94.4% Yield (YTP)
Yields can be multiplied with many
process steps. Assumes independent
sources of defects.
YRT = .955*.97*.944 = 87.5%
16. Correlations are not Necessarily Causal
City of Oldenburg, Germany
1930- 1936
X-axis: stork population
Y-axis: human population
What your mother told you about
babies when you were three is still
not right, despite the strong
correlation evidence.
Causal means that one variable results in the other thing occurring. In general, it is
extremely difficult to establish causality between two correlated events or observances.
There are many statistical tools to establish a statistical significant correlation.
Source: Box, Hunter, hunter Statistics For Experiments 1978
17. Regression
Regression can be used for prediction, inference, hypothesis testing, and
modeling of causal relationships
The procedure calculates estimates of the relationship between the
independent variables (advertising, price, etc.) and the dependent variable
(sales).
Simple Linear
Regression Analysis:
Y = b0 + b1X
18. The Control Chart or Shewhart Chart
Observation
IndividualValue
28252219161310741
60
50
40
30
20
10
0
_
X=29.06
UCL=55.24
LCL=2.87
1
Control Chart of Recycle
Process
Center
(usually the Mean)
Special Cause
Variation
Detected
Control
Limits
Common
Cause
Variation
19. Common Distributions
Sample size - Normal Distributions
As the number of samples measured increases, to 30, the distribution
becomes more representative of the population.
Population
sample
NORMAL DISTRIBUTIONS IMPORTANCE
Most variables are approximately normally
distributed. This means we can use the
normal distribution as a model to help us
better understand these variables.
NORMALITY TESTS
Normality tests are used to determine if any
group of data fits a standard normal
distribution