This document discusses measures of dispersion and the normal distribution. It defines measures of dispersion as ways to quantify the variability in a data set beyond measures of central tendency like mean, median, and mode. The key measures discussed are range, quartile deviation, mean deviation, and standard deviation. It provides formulas and examples for calculating each measure. The document then explains the normal distribution as a theoretical probability distribution important in statistics. It outlines the characteristics of the normal curve and provides examples of using the normal distribution and calculating z-scores.
Describing quantitative data with numbersUlster BOCES
油
1. Quantitative data can be summarized using measures of center (mean, median), spread (range, IQR, standard deviation), and position (quartiles, percentiles, z-scores).
2. The mean is more affected by outliers than the median. The median is more resistant to outliers and a better measure of center for skewed data.
3. Additional summaries like the five-number summary and boxplots provide a graphical view of the distribution and identify potential outliers.
These is info only ill be attaching the questions work CJ 301 .docxmeagantobias
油
This document discusses measures of variability and dispersion in descriptive statistics. It defines variability as how scores differ from each other or from the mean. Four measures of dispersion are discussed: range, mean deviation, variance, and standard deviation. Standard deviation is described as the average distance from the mean and the most commonly used measure. Examples are provided to demonstrate how to calculate standard deviation step-by-step. The standard deviation is then used to estimate what percentage of values fall within certain ranges from the mean based on the normal distribution curve.
This document provides an introduction to inferential statistics and statistical significance. It discusses key concepts like standard error of the mean, confidence intervals, and comparing means from two samples using a t-test. The document explains how inferential statistics allow researchers to make inferences about populations based on samples and determine if observed differences are likely due to chance or a real effect.
This document discusses various statistical measures used to summarize and describe data, including measures of central tendency (mean, median, mode) and measures of dispersion (range, variance, standard deviation). It provides definitions and examples of calculating each measure. Standardized scores like z-scores and t-scores are also introduced as ways to compare performance across different tests or distributions. Exercises are included for readers to practice calculating and interpreting these common descriptive statistics.
CJ 301 Measures of DispersionVariability Think back to the .docxmonicafrancis71118
油
CJ 301 Measures of Dispersion/Variability
Think back to the description of measures of central tendency that describes these statistics as measures of how the data in a distribution are clustered, around what summary measure are most of the data points clustered.
But when comes to descriptive statistics and describing the characteristics of a distribution, averages are only half story. The other half is measures of variability.
In the most simple of terms, variability reflects how scores differ from one another. For example, the following set of scores shows some variability:
7, 6, 3, 3, 1
The following set of scores has the same mean (4) and has less variability than the previous set:
3, 4, 4, 5, 4
The next set has no variability at all the scores do not differ from one another but it also has the same mean as the other two sets we just showed you.
4, 4, 4, 4, 4
Variability (also called spread or dispersion) can be thought of as a measure of how different scores are from one another. It is even more accurate (and maybe even easier) to think of variability as how different scores are from one particular score. And what score do you think that might be? Well, instead of comparing each score to every other score in a distribution, the one score that could be used as a comparison is that is right- the mean. So, variability becomes a measure of how much each score in a group of scores differs from the mean.
Remember what you already know about computing averages that an average (whether it is the mean, the median or the mode) is a representative score in a set of scores. Now, add your new knowledge about variability- that it reflects how different scores are from one another. Each is important descriptive statistic. Together, these two (average and variability) can be used to describe the characteristics of a distribution and show how distribution differ from one another.
Measures of dispersion/variability describe how the data in a distribution are scattered or dispersed around, or from, the central point represented by the measure of central tendency.
We will discuss four different measures of dispersion, the range, the mean deviation, the variance, and the standard deviation.
RANGE
The range is a very simple measure of dispersion to calculate and interpret. The range is simply the difference between the highest score and the lowest score in a distribution.
Consider the following distribution that measures the Age of a random sample of eight police officers in a small rural jurisdiction.
Officer X = Age_
1 41
2 20
3 35
4 25
5 23
6 30
7 21
8 32
First, lets calculate the mean as our measure of central tendency by adding the individual ages of each officer and dividing by the number of officers. The calculation is 227/8 = 28.375 years.
In general, the formula for the range is:
R=h-l
Where:
揃 r is the range
揃 h is the highest score in the .
- The document discusses key concepts in descriptive statistics including types of distributions, measures of central tendency, and measures of dispersion.
- It covers normal, skewed, and other types of distributions. Measures of central tendency discussed are mean, median, and mode. Measures of dispersion covered are variance and standard deviation.
- The document uses examples and explanations to illustrate how to calculate and interpret these important statistical measures.
- The document discusses key concepts in descriptive statistics including types of distributions, measures of central tendency, and measures of dispersion.
- It covers normal, skewed, and other types of distributions. Measures of central tendency discussed are mean, median, and mode. Measures of dispersion covered are variance and standard deviation.
- The document uses examples and explanations to illustrate how to calculate and interpret these important statistical measures.
This document provides an overview of basic statistics concepts including descriptive statistics, measures of central tendency, variability, sampling, and distributions. It defines key terms like mean, median, mode, range, standard deviation, variance, and quantiles. Examples are provided to demonstrate how to calculate and interpret these common statistical measures.
The document provides information about measures of central tendency (mean, median, mode) and measures of dispersion (range, quartiles, variance, standard deviation) using examples of data distributions. It defines key terms like mean, median, mode, range, quartiles, variance and standard deviation. It also shows how to calculate and interpret these measures of central tendency and dispersion using sample data sets.
Measures of dispersion describe how similar or spread out scores in a data set are. The three main measures are range, semi-interquartile range (SIR), and variance/standard deviation. Range is the difference between the highest and lowest scores. SIR is less affected by outliers than range. Variance/standard deviation quantify the average squared distance of scores from the mean, with larger numbers indicating more spread out data. Skew and kurtosis further describe the shape of distributions.
This document provides an outline and overview of descriptive statistics. It discusses the key concepts including:
- Visualizing and understanding data through graphs and charts
- Measures of central tendency like mean, median, and mode
- Measures of spread like range, standard deviation, and interquartile range
- Different types of distributions like symmetrical, skewed, and their properties
- Levels of measurement for variables and appropriate statistics for each level
The document serves as an introduction to descriptive statistics, the goals of which are to summarize key characteristics of data through numerical and visual methods.
This document discusses measures of dispersion, which indicate how spread out or variable a set of data is. There are three main measures: the range, which is the difference between the highest and lowest values; the semi-interquartile range (SIR), which is the difference between the first and third quartiles divided by two; and variance/standard deviation. Variance is the average of the squared deviations from the mean, while standard deviation is the square root of the variance. These measures provide summaries of how concentrated or dispersed the observed values are from the average or expected value.
ch-4-measures-of-variability-11 2.ppt for nursingwindri3
油
This document discusses measures of variability used in statistics. It defines variability as the spread or dispersion of scores. The key measures of variability discussed are the range, variance, and standard deviation. The range is the difference between the highest and lowest scores. The variance is the average of the squared deviations from the mean and represents how far the scores deviate from the mean. The standard deviation is the square root of the variance and represents how much scores typically deviate from the mean. Larger standard deviations indicate greater variability in the scores.
This document discusses measures of variability used in statistics. It defines variability as the spread or dispersion of scores. The key measures of variability discussed are the range, variance, and standard deviation. The range is the difference between the highest and lowest scores. The variance is the average of the squared deviations from the mean and represents how far the scores deviate from the mean. The standard deviation is the square root of the variance and represents how much scores typically deviate from the mean. Larger standard deviations indicate greater variability in the scores.
This document discusses various statistical measures used to summarize and analyze data. It covers measures of central tendency (mean, median, mode), measures of variability (range, variance, standard deviation), measures of the shape of a distribution (skewness and kurtosis), and methods for comparing multiple groups (pooled mean and variance). It also discusses concepts like outliers, box plots, z-scores, correlations, and Pearson's correlation coefficient. The document provides definitions, formulas, and examples to explain each statistical measure.
Production involves transformation of inputs such as capital, equipment, labo...Krushna Ktk
油
The document discusses measures of dispersion in statistics. It begins by introducing the concept of measures of dispersion and how they are needed in addition to measures of central tendency to fully understand the variability in a data set. It defines four main measures of dispersion - range, quartile deviation, mean deviation, and standard deviation. It then focuses on explaining range and mean deviation in detail. Mean deviation is the average of the absolute deviations from a central value like the mean or median. It provides an example to illustrate the step-by-step calculation of mean deviation for both ungrouped and grouped data.
This document provides an overview of key concepts in descriptive statistics including measures of central tendency (mode, median, mean), measures of dispersion (range, variance, standard deviation), the normal distribution, z-scores, hypothesis testing, and the t-distribution. It defines each concept and provides examples of calculating and interpreting common statistics.
Basic Statistical Descriptions of Data.pptxAnusuya123
油
This document provides an overview of 7 basic statistical concepts for data science: 1) descriptive statistics such as mean, mode, median, and standard deviation, 2) measures of variability like variance and range, 3) correlation, 4) probability distributions, 5) regression, 6) normal distribution, and 7) types of bias. Descriptive statistics are used to summarize data, variability measures dispersion, correlation measures relationships between variables, and probability distributions specify likelihoods of events. Regression models relationships, normal distribution is often assumed, and biases can influence analyses.
The document discusses measures of dispersion, which describe how varied or spread out a data set is around the average value. It defines several measures of dispersion, including range, interquartile range, mean deviation, and standard deviation. The standard deviation is described as the most important measure, as it takes into account all values in the data set and is not overly influenced by outliers. The document provides a detailed example of calculating the standard deviation, which involves finding the differences from the mean, squaring those values, summing them, and taking the square root.
Descriptive statistics are used to summarize and describe characteristics of a data set. It includes measures of central tendency like mean, median, and mode, measures of variability like range and standard deviation, and the distribution of data through histograms. Inferential statistics are used to generalize results from a sample to the population it represents through estimation of population parameters and hypothesis testing. Correlation and regression analysis are used to study relationships between two or more variables.
The document discusses measures of variability in statistics including range, interquartile range, standard deviation, and variance. It provides examples of calculating each measure using sample data sets. The range is the difference between the highest and lowest values, while the interquartile range is the difference between the third and first quartiles. The standard deviation represents the average amount of dispersion from the mean, and variance is the average of the squared deviations from the mean. Both standard deviation and variance increase with greater variability in the data set.
This document defines and explains various measures of dispersion used in statistics, including range, semi-interquartile range (SIR), variance, standard deviation, skew, and kurtosis. It provides formulas for calculating each measure and discusses how to interpret the results, such as how skew values indicate the direction and degree of asymmetry and how kurtosis compares the spread of a distribution to a normal distribution. The purpose of measuring dispersion is to understand the variability and shape of distributions for purposes like determining reliability and facilitating other statistical analyses.
- The document discusses key concepts in descriptive statistics including types of distributions, measures of central tendency, and measures of dispersion.
- It covers normal, skewed, and other types of distributions. Measures of central tendency discussed are mean, median, and mode. Measures of dispersion covered are variance and standard deviation.
- The document uses examples and explanations to illustrate how to calculate and interpret these important statistical measures.
This document provides an overview of basic statistics concepts including descriptive statistics, measures of central tendency, variability, sampling, and distributions. It defines key terms like mean, median, mode, range, standard deviation, variance, and quantiles. Examples are provided to demonstrate how to calculate and interpret these common statistical measures.
The document provides information about measures of central tendency (mean, median, mode) and measures of dispersion (range, quartiles, variance, standard deviation) using examples of data distributions. It defines key terms like mean, median, mode, range, quartiles, variance and standard deviation. It also shows how to calculate and interpret these measures of central tendency and dispersion using sample data sets.
Measures of dispersion describe how similar or spread out scores in a data set are. The three main measures are range, semi-interquartile range (SIR), and variance/standard deviation. Range is the difference between the highest and lowest scores. SIR is less affected by outliers than range. Variance/standard deviation quantify the average squared distance of scores from the mean, with larger numbers indicating more spread out data. Skew and kurtosis further describe the shape of distributions.
This document provides an outline and overview of descriptive statistics. It discusses the key concepts including:
- Visualizing and understanding data through graphs and charts
- Measures of central tendency like mean, median, and mode
- Measures of spread like range, standard deviation, and interquartile range
- Different types of distributions like symmetrical, skewed, and their properties
- Levels of measurement for variables and appropriate statistics for each level
The document serves as an introduction to descriptive statistics, the goals of which are to summarize key characteristics of data through numerical and visual methods.
This document discusses measures of dispersion, which indicate how spread out or variable a set of data is. There are three main measures: the range, which is the difference between the highest and lowest values; the semi-interquartile range (SIR), which is the difference between the first and third quartiles divided by two; and variance/standard deviation. Variance is the average of the squared deviations from the mean, while standard deviation is the square root of the variance. These measures provide summaries of how concentrated or dispersed the observed values are from the average or expected value.
ch-4-measures-of-variability-11 2.ppt for nursingwindri3
油
This document discusses measures of variability used in statistics. It defines variability as the spread or dispersion of scores. The key measures of variability discussed are the range, variance, and standard deviation. The range is the difference between the highest and lowest scores. The variance is the average of the squared deviations from the mean and represents how far the scores deviate from the mean. The standard deviation is the square root of the variance and represents how much scores typically deviate from the mean. Larger standard deviations indicate greater variability in the scores.
This document discusses measures of variability used in statistics. It defines variability as the spread or dispersion of scores. The key measures of variability discussed are the range, variance, and standard deviation. The range is the difference between the highest and lowest scores. The variance is the average of the squared deviations from the mean and represents how far the scores deviate from the mean. The standard deviation is the square root of the variance and represents how much scores typically deviate from the mean. Larger standard deviations indicate greater variability in the scores.
This document discusses various statistical measures used to summarize and analyze data. It covers measures of central tendency (mean, median, mode), measures of variability (range, variance, standard deviation), measures of the shape of a distribution (skewness and kurtosis), and methods for comparing multiple groups (pooled mean and variance). It also discusses concepts like outliers, box plots, z-scores, correlations, and Pearson's correlation coefficient. The document provides definitions, formulas, and examples to explain each statistical measure.
Production involves transformation of inputs such as capital, equipment, labo...Krushna Ktk
油
The document discusses measures of dispersion in statistics. It begins by introducing the concept of measures of dispersion and how they are needed in addition to measures of central tendency to fully understand the variability in a data set. It defines four main measures of dispersion - range, quartile deviation, mean deviation, and standard deviation. It then focuses on explaining range and mean deviation in detail. Mean deviation is the average of the absolute deviations from a central value like the mean or median. It provides an example to illustrate the step-by-step calculation of mean deviation for both ungrouped and grouped data.
This document provides an overview of key concepts in descriptive statistics including measures of central tendency (mode, median, mean), measures of dispersion (range, variance, standard deviation), the normal distribution, z-scores, hypothesis testing, and the t-distribution. It defines each concept and provides examples of calculating and interpreting common statistics.
Basic Statistical Descriptions of Data.pptxAnusuya123
油
This document provides an overview of 7 basic statistical concepts for data science: 1) descriptive statistics such as mean, mode, median, and standard deviation, 2) measures of variability like variance and range, 3) correlation, 4) probability distributions, 5) regression, 6) normal distribution, and 7) types of bias. Descriptive statistics are used to summarize data, variability measures dispersion, correlation measures relationships between variables, and probability distributions specify likelihoods of events. Regression models relationships, normal distribution is often assumed, and biases can influence analyses.
The document discusses measures of dispersion, which describe how varied or spread out a data set is around the average value. It defines several measures of dispersion, including range, interquartile range, mean deviation, and standard deviation. The standard deviation is described as the most important measure, as it takes into account all values in the data set and is not overly influenced by outliers. The document provides a detailed example of calculating the standard deviation, which involves finding the differences from the mean, squaring those values, summing them, and taking the square root.
Descriptive statistics are used to summarize and describe characteristics of a data set. It includes measures of central tendency like mean, median, and mode, measures of variability like range and standard deviation, and the distribution of data through histograms. Inferential statistics are used to generalize results from a sample to the population it represents through estimation of population parameters and hypothesis testing. Correlation and regression analysis are used to study relationships between two or more variables.
The document discusses measures of variability in statistics including range, interquartile range, standard deviation, and variance. It provides examples of calculating each measure using sample data sets. The range is the difference between the highest and lowest values, while the interquartile range is the difference between the third and first quartiles. The standard deviation represents the average amount of dispersion from the mean, and variance is the average of the squared deviations from the mean. Both standard deviation and variance increase with greater variability in the data set.
This document defines and explains various measures of dispersion used in statistics, including range, semi-interquartile range (SIR), variance, standard deviation, skew, and kurtosis. It provides formulas for calculating each measure and discusses how to interpret the results, such as how skew values indicate the direction and degree of asymmetry and how kurtosis compares the spread of a distribution to a normal distribution. The purpose of measuring dispersion is to understand the variability and shape of distributions for purposes like determining reliability and facilitating other statistical analyses.
Teacher Sarah arranges the seating of her three tutees - Ana, Beauty, and Carl - differently each Saturday to see if their learning is affected by the seating arrangement. There are 6 possible seating arrangements that can be found using systematic listing, a tree diagram, or a table. The document then provides examples of determining the number of permutations in different situations using the fundamental counting principle and factorial notation.
11-permutations-and-combinations for grade 10.pptKirbyRaeDiaz2
油
This document discusses permutations and combinations. Permutations refer to ordered arrangements, while combinations refer to unordered arrangements where order does not matter. The key formulas are presented: P(n,r) refers to the number of r-permutations of a set of n elements, while C(n,r) refers to the number of r-combinations of a set of n elements. Several examples are worked out, such as the number of poker hands and blackjack hands. Corollaries to the combination formula are also discussed.
Section 5 - Permutations and CombinationsKirbyRaeDiaz2
油
1. The document discusses counting principles such as the fundamental counting principle, permutations, and combinations.
2. The fundamental counting principle states that if one event can occur in m ways and a second independent event can occur in n ways, the number of possible outcomes is mn.
3. Permutations refer to arrangements of objects in a particular order, while combinations refer to subsets of objects where order is unimportant.
Fundamental Counting Principle Q3 Week 1KirbyRaeDiaz2
油
The document discusses the fundamental counting principle for determining the number of possible outcomes in a given situation. It provides examples of applying the principle to situations with two or more groups of items that can be chosen from. These include determining the number of possible outfits given various clothing items, the number of ways to order a two-course meal from menu options, and the number of possible schedules or multiple-choice test answers involving multiple choice groups. The principle states that if one event can occur in m ways and another in n ways, the number of ways both can occur is mn, and if there are more than two groups, the counts are multiplied across all groups.
Grade 10 Math Quarter 2 Equation of the CircleKirbyRaeDiaz2
油
The document defines a circle as a set of points equidistant from a center point. It provides the standard equation of a circle in general form and for when the center is at the origin. Examples are given of writing the equation of a circle given its center and radius, or center and a point on the circle. The document also shows how to determine if a given point lies inside, outside or on the circle using the standard equation.
This document discusses the axiomatic structure of geometry. It defines key terms in geometry like point, line, segment, angle, and perpendicular. An axiomatic structure has three main properties - consistency, independence, and completeness. Consistency means no contradictions, independence means axioms are not derived from each other, and completeness means every statement can be proven true or false. The document provides definitions for important geometric concepts to precisely define their meanings.
1. Euclid defined axioms as common notions used throughout mathematics, while postulates were specific assumptions used in geometry.
2. The document lists Euclid's common axioms like "things equal to the same thing are equal" and postulates like "a straight line can be drawn between any two points."
3. It provides examples of theorems like two lines cannot intersect in more than one point, and exercises analyzing statements and defining terms related to geometry.
How to Configure Flexible Working Schedule in Odoo 18 EmployeeCeline George
油
In this slide, well discuss on how to configure flexible working schedule in Odoo 18 Employee module. In Odoo 18, the Employee module offers powerful tools to configure and manage flexible working schedules tailored to your organization's needs.
Finals of Kaun TALHA : a Travel, Architecture, Lifestyle, Heritage and Activism quiz, organized by Conquiztadors, the Quiz society of Sri Venkateswara College under their annual quizzing fest El Dorado 2025.
How to use Init Hooks in Odoo 18 - Odoo 際際滷sCeline George
油
In this slide, well discuss on how to use Init Hooks in Odoo 18. In Odoo, Init Hooks are essential functions specified as strings in the __init__ file of a module.
Prelims of Rass MELAI : a Music, Entertainment, Literature, Arts and Internet Culture Quiz organized by Conquiztadors, the Quiz society of Sri Venkateswara College under their annual quizzing fest El Dorado 2025.
Database population in Odoo 18 - Odoo slidesCeline George
油
In this slide, well discuss the database population in Odoo 18. In Odoo, performance analysis of the source code is more important. Database population is one of the methods used to analyze the performance of our code.
How to Configure Restaurants in Odoo 17 Point of SaleCeline George
油
Odoo, a versatile and integrated business management software, excels with its robust Point of Sale (POS) module. This guide delves into the intricacies of configuring restaurants in Odoo 17 POS, unlocking numerous possibilities for streamlined operations and enhanced customer experiences.
Research & Research Methods: Basic Concepts and Types.pptxDr. Sarita Anand
油
This ppt has been made for the students pursuing PG in social science and humanities like M.Ed., M.A. (Education), Ph.D. Scholars. It will be also beneficial for the teachers and other faculty members interested in research and teaching research concepts.
Useful environment methods in Odoo 18 - Odoo 際際滷sCeline George
油
In this slide well discuss on the useful environment methods in Odoo 18. In Odoo 18, environment methods play a crucial role in simplifying model interactions and enhancing data processing within the ORM framework.
APM People Interest Network Conference 2025
-Autonomy, Teams and Tension: Projects under stress
-Tim Lyons
-The neurological levels of
team-working: Harmony and tensions
With a background in projects spanning more than 40 years, Tim Lyons specialised in the delivery of large, complex, multi-disciplinary programmes for clients including Crossrail, Network Rail, ExxonMobil, Siemens and in patent development. His first career was in broadcasting, where he designed and built commercial radio station studios in Manchester, Cardiff and Bristol, also working as a presenter and programme producer. Tim now writes and presents extensively on matters relating to the human and neurological aspects of projects, including communication, ethics and coaching. He holds a Masters degree in NLP, is an NLP Master Practitioner and International Coach. He is the Deputy Lead for APMs People Interest Network.
Session | The Neurological Levels of Team-working: Harmony and Tensions
Understanding how teams really work at conscious and unconscious levels is critical to a harmonious workplace. This session uncovers what those levels are, how to use them to detect and avoid tensions and how to smooth the management of change by checking you have considered all of them.
Digital Tools with AI for e-Content Development.pptxDr. Sarita Anand
油
This ppt is useful for not only for B.Ed., M.Ed., M.A. (Education) or any other PG level students or Ph.D. scholars but also for the school, college and university teachers who are interested to prepare an e-content with AI for their students and others.
Digital Tools with AI for e-Content Development.pptxDr. Sarita Anand
油
Measure of Dispersion - Grade 8 Statistics.ppt
1. Measures of dispersion are descriptive statistics
that show how similar or varied the data are for
a particular variable (or data item).
Measures of spread include the range, quartiles
and the interquartile range, variance, standard
deviation and coefficient of variation.
3. The mode, median, and mean summarise the
data into a single value that is typical or
representative of all the values in the dataset.
But this is only part of the 'picture' that
summarises a dataset.
Measures of spread summarise the data in a way
that shows how scattered the values are and how
much they differ from the mean value.
Batsman A has four innings and scores 25, 25, 25, 25
Batsman B has four innings and scores 0, 0, 0, 100
They both average 25 but they are very different scores.
4. Measures of dispersion are sometimes referred
to as variation or spread.
The main measures of dispersion are:
Range
Quartile deviation
Mean deviation
Standard deviation
Variance
Coefficient of variation
5. Measures the difference between the highest and
the lowest item of the data.
Range = highest observation lowest observation
While easy to calculate and understand, the range
can easily be distorted by extreme values.
7. The quartiles divide the set of measurements into four equal parts.
Twenty-five per cent of the measurements are less than the lower quartile
Fifty per cent of the measurements are less than the median
Seventy-five per cent of the measurements are less than the upper quartile.
So, fifty per cent of the measurements are between the lower quartile and the
upper quartile.
The lower quartile, median and upper quartile are often denoted by Q1, Q2 and
Q3 respectively.
The median is also denoted by m.
.
8. A quartile is found by dividing by dividing
the arrayed data into four quarters.
There will be three quartiles (not four!).
11. Let n = the number of observations
Where n/4 is not a whole number -
let m= the next whole number larger than n/4
the lower quartile is the mth observation of the
sorted data counting from the lower end.
the upper quartile is the mth observation of the
sorted data counting from the upper end.
12. Where n/4 is a whole number - let m= n/4
the lower quartile is halfway between the mth
observation and the (m + 1)th observation of the
sorted data counting from the lower end.
the upper quartile is similarly defined counting
from the upper end
14. The median of an even data set is calculated as
the average of n/2 and [(n/2) +1]
16. By measuring the middle 50% of values only, the
interquartile range overcomes the problem of
outlying observations.
It may be calculated from grouped frequency
distributions that contain open-ended class
intervals
17. Deviation is the difference between each item
of data and the mean.
The mean deviation measures the average
distance of each observation away from the
mean of the data.
Mean deviation gives an equal weight to each
observation and is generally more sensitive
than either the range or interquartile range,
since a change in any value will affect it.
18. 1. Calculate the mean of the data
2. Subtract the mean from each observation
and record the difference
3. Write down the absolute value of each of
the differences (i.e. ignore positive and
negative signs)
4. Calculate the mean of the absolute values
19. The four steps for mean deviation are written as
1. Find x
2. For each x, find x x
3. Now find Ix - x
I for each x
4. Find 裡Ix - x
I and divide by n
21. The batting score of two cricketers, Joe and John were recorded
over their 10 completed innings to date. Their scores were
Joe 32 27 38 25 20 32 34 28
40 29
John 3 80 64 5 11 87 0 2
53 0
1. For each cricketer calculate the batting average (mean
score) and the mean deviation
2. There is only one batting position left on the team for
the next match.
Would you pick Joe or John? Why?
24. Mean Deviation calculations for Joe
Score
( x )
Deviation from mean
( x - x
)
Absolute value of
deviation
I x - x
I
32 +1.5 1.5
27 -3.5 3.5
38 +7.5 7.5
25 -5.5 5.5
20 -10.5 10.5
32 +1.5 1.5
34 +3.5 3.5
28 -2.5 2.5
40 +9.5 9.5
29 -1.5 1.5
裡( x - x
) = 0 裡I x - x
I = 47.0
Mean = 30.5
26. Mean Deviation calculations for John
Score
( x )
Deviation from mean
( x - x
)
Absolute value of
deviation
I x - x
I
3 -27.5 27.5
80 +49.5 49.5
64 +33.5 33.5
5 -25.5 25.5
11 -19.5 19.5
87 +56.5 56.5
0 -30.5 30.5
2 -28.5 28.5
53 +22.5 22.5
0 -30.5 30.5
裡( x - x
) = 0 裡I x - x
I = 324.0
Mean = 30.5
28. It depends on your priorities!
If you are looking for a consistent batter, the
choice will be Joe, since he has a much smaller
mean deviation.
While he probably would not make a large score,
his past record indicates he can be relied on to
make a score fairly close to his average (the mean
deviation of his score is less than 5).
29. If you are looking for a batter who could
possibly obtain a large score (and in
doing so considerably help to win a
match) then John will be the choice.
However there also seems a high risk that
he would get a very low score.
30. The standard deviation measures the average
distance each item of data is from the mean.
It differs from the mean deviation in that it
squares each deviation and then finds the square
root of this rather than taking the absolute value.
Standard deviation is the most commonly used
measure of dispersion for statisticians.
34. In practice, it is rare to calculate the value of mu
since populations are usually very large. Instead,
it is far more likely that the sample standard
deviation (denoted by S) will be required.
The formula for calculating S is not the same as
simply substituting S for and n for N. There
are good theoretical reasons for not doing so.
35. If we did this, and used the value of S to estimate
the value of , the result would be too small.
To correct this error, instead of dividing by n we
divide by (n-1). This results in the following
formula for S:
37. A market researcher, Gavin, was interested in the
discrepancy in the prices charged by supermarkets
for a leading brand of pet food. To check this he selected
a random sample of 12 stores and recorded the
price displayed for the same 400 gram can.
The prices in cents were
89 72 77 78 82 94
80 88 85 73 78 76
Find
a) the mean
b) the range of prices
c) the mean deviation of prices
d) the standard deviation of prices
40. Now use the Financial Calculator to Find the Mean and Standard
Deviation check the question to see if it a sample or a population.
41. The standard deviation can not be negative
The more scattered the data, the greater the
standard deviation
The standard deviation of a set of data is zero if, and
only if, the observations are of equal value
A rough guide to whether a calculated answer is
reasonable is for the standard deviation to be
approximately 30% of the range
42. Note for this data set is the standard deviation around 30% of the range?
Range is 94 72 = 22
Standard Deviation is 6.7 22 x .3 = 6.6 . It wont always be this close
43. The standard deviation can never exceed the range of
data
Due to the squaring operation involved in its calculation,
the standard deviation is more influenced by extreme
values than is the mean deviation and is usually slightly
larger than the mean deviation
The square of the standard deviation is called variance
44. Variance measures the spread (in total) of
the data.
Variance is equal to the square of the
standard deviation so
Variance = (Standard Deviation) 2
45. Batsman A has four innings & scores 25, 25, 25, 25
Batsman B scores 0, 0, 0, 100
What are their averages ?
What are their Standard Deviations?
Example using standard deviation
46. Using the calculator Stat Mode 1,1 then
25, xy, 0, ENT,
25,xy, 0, ENT,
25, xy, 0, ENT,
25, xy, 100, ENT
RCL 4 and RCL 7 will give the calculation for
the mean score for each batsman.
48. What is the difference between the
Population and a Sample?
How can I remember that on my calculator?
Sample smaller than the population 5<6 and
8<9? OR S for sample
49. Back to our batsmen .
Batsman A has four innings and scores 25, 25, 25, 25
Batsman B scores 0, 0, 0, 100
What are their Standard Deviations? If we took a sample of
their batting scores perhaps there were 20 innings and we
sampled 4 innings or the population that is they had only
batted 4 times these were the complete scores
Batsman A has a standard deviation of 0 whether it is a
sample or not (RCL 5, RCL 6) and Batsman B has a Standard
Deviation of 50 if it was a sample (RCL 8) and 43.3 if it was
the population (total data) (RCL 9)
Long Hand calculation : -
50. Long Hand calculation :
Sample for A (0^2 + 0^2 + 0^2 + 0^2) / 3 = 0
Population for A (0^2 + 0^2 + 0^2 + 0^2) / 4 = 0
Dev Dev
Scores B From mean Squared
1 0 -25 625
2 0 -25 625
3 0 -25 625
4 100 75 5625
Total 7500
Sum of deviations divided by 3 2500
Now find the square root 50
Sum of deviations divided by 4 1875
Now find the square root 43.30127
51. This is a measure of relative variability.
It is used to measure the changes that have taken
place in a population over time, or to compare
the variability of two populations that are
expressed in different units of measurement.
It is expressed as a percentage rather than in
terms of the units of the particular data.
52. The formula for the coefficient of variation,
denoted by V is:
V = 100 multiplied by S and divided by x
Where x
= the mean of the sample
S = the standard deviation of the sample
V = 100 . S. %
x
56. This is the Standard Deviation divided by the mean that is the ratio of
the standard deviation to the mean the higher the figure the greater the
deviation
Back to Batsman B we would have a Coefficient of variation of 50 / 25 =
2 quite a significant variation
57. Using the calculator for the Standard Deviation Mode 1,0 , then 10, ENT, 15,
ENT. Then RCL 5 since the question said it was a sample ( not RCL 6)
Answer is 4.1231
59. Using Calc Mode, 1,0 (2nd
f , Alpha,0,0 to clear just in case
36, xy, 3, ENT, 37, xy, 3, ENT . Then RCL 4 for the mean and RCL 5 for
sample deviation = 1.70
60. Note we will get the calculator to calculate the standard deviation just
demo long hand calculation here also shouldnt be asked for the Mean
Deviation in a class test.
61. Suggested Questions from Textbook
Select a range of questions from the Problems in this chapter enough so that you feel
comfortable with this topic
Editor's Notes
#1: Measures of dispersion (variability) will provide more information, specifically about the level of spread of the data around the mean, which will make the data more useful for the user.
#2: Summarising the dataset can help us understand the data, especially when the dataset is large.
#14: Work out the mean as well for next segment on Mean Deviation
n/4 is a whole number so Where n/4 is a whole number - let m= n/4
the lower quartile is halfway between the mth observation and the (m + 1)th observation of the sorted data counting from the lower end.
the upper quartile is similarly defined counting from the upper end
The median of an even data set is calculated as the average of n/2 + [(n/2) +1]
#16: but still ignores 50% of the values in the distribution
#17: A measure that does take into account the actual value of each observation is the Mean Deviation.
#19: Calculate the mean of the data
Subtract the mean from each observation and record the difference
Write down the absolute value of each of the differences (i.e. ignore positive and negative signs)
Calculate the mean of the absolute values