The document provides information about the normal distribution and standard normal distribution. It discusses key properties of the normal distribution including that it is defined by its mean and standard deviation. It also describes the 68-95-99.7 rule for how much of the data falls within 1, 2, and 3 standard deviations of the mean in a normal distribution. The document then introduces the standard normal distribution and how it allows converting any normal distribution to a standard scale for looking up probabilities. It provides examples of calculating probabilities and finding values corresponding to percentiles for both raw and standard normal distributions. Finally, it discusses checking if data are approximately normally distributed.
This document discusses the normal distribution and related concepts. It begins with an introduction to the normal distribution and its properties. It then covers the probability density function and cumulative distribution function of the normal distribution. The rest of the document discusses key properties like the 68-95-99.7 rule, using the standard normal distribution, and how to determine if a data set follows a normal distribution including using a normal probability plot. Examples are provided throughout to illustrate the concepts.
Statistik 1 6 distribusi probabilitas normalSelvin Hadi
?
This document discusses the key characteristics and concepts of the normal probability distribution. It outlines six goals related to understanding the normal distribution, its properties, calculating z-values, and using the normal distribution to approximate the binomial probability distribution. The key points covered include defining the mean, standard deviation, and shape of the normal curve; transforming variables to the standard normal distribution; and determining probabilities based on the areas under the normal curve.
The standard normal curve & its application in biomedical sciencesAbhi Manu
?
1) The document discusses the normal distribution and its applications in statistical inference. It is the most important probability distribution used to model many continuous variables in biomedical fields.
2) The normal distribution is characterized by its mean and standard deviation. It is perfectly symmetrical and bell-shaped. Properties of the normal curve include that about 68%, 95%, and 99.7% of the data lies within 1, 2, and 3 standard deviations of the mean, respectively.
3) The standard normal distribution is used to convert raw scores to z-scores in order to compare variables measured on different scales. Z-scores indicate how many standard deviations a score is above or below the mean and can be used to determine probabilities, percentiles
1. The document discusses the normal distribution and z-distribution (standard normal distribution). It provides definitions, properties, and examples of both.
2. The normal distribution is a bell-shaped curve that is symmetric around the mean. It is defined by its mean and standard deviation. The z-distribution is the standard normal distribution where the mean is 0 and standard deviation is 1.
3. Examples are provided to demonstrate how to calculate probabilities and find z-scores using the normal and z-distributions. Areas under the curve are calculated to find probabilities for various values in relation to the mean.
This document provides information about the normal distribution and calculating z-scores. It includes examples of calculating z-scores based on given means, standard deviations, and individual scores. It also provides examples calculating the mean and standard deviation from raw data and frequency tables. Worked examples are provided to demonstrate how to calculate z-scores in different contexts like test scores, physical attributes, and manufacturing data.
This document defines and explains key statistical concepts including measures of central tendency (mean, median, mode), measures of dispersion (range, standard deviation), and properties of distributions (skewness, symmetry). It provides examples of calculating the mean, median, mode, and standard deviation. It also describes the empirical rule and how a certain percentage of values in a normal distribution fall within 1, 2, or 3 standard deviations of the mean.
The document contains an agenda for a statistics class that includes topics such as the normal distribution, z-scores, and using a z-table to find probabilities. The agenda covers defining population and sample means and standard deviations, analyzing exam score data, continuous probability distributions, the properties of the normal distribution, and how to use a z-table to find probabilities for various z-scores and ranges of z-scores. Examples are provided to demonstrate how to find probabilities and areas under the normal curve. Homework problems are assigned from the textbook.
This document provides an overview of the normal distribution:
- It defines key terms like population, sample, parameters, and statistics.
- The normal distribution is symmetric and bell-shaped. Most data lies near the mean, and the percentage of data on either side of the mean is consistent.
- 68%, 95%, and 99% of data falls within 1, 2, and 3 standard deviations of the mean, respectively, in a normal distribution.
- The document provides an example of calculating the probability of a value being above the mean using the standard normal distribution and z-scores.
The document provides an introduction to the normal distribution including its key characteristics and how it can be used for inference. Some of the main points covered include:
- The normal distribution is symmetric and bell-shaped.
- It is characterized by its mean and standard deviation.
- Knowing that a variable is normally distributed allows us to determine probabilities of outcomes.
- The standard normal distribution has a mean of 0 and standard deviation of 1 and can be used to find probabilities.
- Z-scores indicate how many standard deviations an observation is from the mean and can be looked up in probability tables.
C2 st lecture 13 revision for test b handoutfatima d
?
This document provides an outline for a lecture series revising key concepts for Test B, including:
- Pythagoras' theorem, trigonometry, sine and cosine rules, and calculating triangle areas.
- Probability, probability trees, and examples calculating probabilities of dice rolls.
- Descriptive statistics like mode, median, interquartile range, mean, absolute deviation, and standard deviation.
- Hypothesis testing using z-tests, t-tests, and chi-squared tests; including setting up hypotheses, finding critical values, calculating test statistics, and making conclusions.
The revision is in preparation for Standard Track Test B which will be held the week of April 21st.
This document discusses six measures of variation used to determine how values are distributed in a data set: range, quartile deviation, mean deviation, variance, standard deviation, and coefficient of variation. It provides definitions and examples of calculating each measure. The range is defined as the difference between the highest and lowest values. Quartile deviation uses the interquartile range (Q3-Q1). Mean deviation is the average of the absolute deviations from the mean. Variance and standard deviation measure how spread out values are from the mean, with variance using sums of squares and standard deviation taking the square root of variance.
8. normal distribution qt pgdm 1st semesterKaran Kukreja
?
The document discusses the normal distribution and its key properties. It explains that the normal distribution is a limiting case of the binomial distribution when the number of trials is large. It has a bell-shaped symmetrical curve centered around the mean. The normal distribution is uniquely defined by its mean and standard deviation. The document also covers how to convert between a normal distribution and the standard normal distribution and how to find probabilities using the standard normal distribution table.
The document introduces the Gaussian or normal distribution, its key properties, and how it can be used for inference. The Gaussian distribution is symmetrical and bell-shaped. It is completely defined by its mean and standard deviation. By transforming data into z-scores, the standard normal distribution can be applied to understand the probabilities of outcomes in any normal distribution. The Gaussian distribution and z-scores allow researchers to assess likelihoods and make inferences about variable values based on their known distribution.
This document provides an overview of standard deviation and z-scores. It begins by listing the key learning objectives which are to describe the importance of variation in distributions, understand how to calculate standard deviation, describe what a z-score is and how to calculate them, and learn the Greek letters for mean and standard deviation. It then provides explanations and examples of how to calculate and interpret standard deviation as a measure of variation, how to convert values to z-scores based on the mean and standard deviation, and the importance of ensuring distributions are normal before using these statistical techniques. It emphasizes understanding the concepts rather than just memorizing formulas.
- The document discusses key concepts in descriptive statistics including types of distributions, measures of central tendency, and measures of dispersion.
- It covers normal, skewed, and other types of distributions. Measures of central tendency discussed are mean, median, and mode. Measures of dispersion covered are variance and standard deviation.
- The document uses examples and explanations to illustrate how to calculate and interpret these important statistical measures.
- The document discusses key concepts in descriptive statistics including types of distributions, measures of central tendency, and measures of dispersion.
- It covers normal, skewed, and other types of distributions. Measures of central tendency discussed are mean, median, and mode. Measures of dispersion covered are variance and standard deviation.
- The document uses examples and explanations to illustrate how to calculate and interpret these important statistical measures.
This document discusses analytical representation of data through descriptive statistics. It begins by showing raw, unorganized data on movie genre ratings. It then demonstrates organizing this data into a frequency distribution table and bar graph to better analyze and describe the data. It also calculates averages for each movie genre. The document then discusses additional descriptive statistics measures like the mean, median, mode, and percentiles to further analyze data through measures of central tendency and dispersion.
The central limit theorem states that the sampling distribution of the sample mean x will be approximately normally distributed for sample sizes n greater than 30, regardless of the shape of the population distribution. Specifically, the sampling distribution of x will have a mean equal to the population mean μ and a standard deviation of σ/√n. Similarly, for a sample proportion p, the sampling distribution of p will be approximately normal for n greater than 10 and np and nq both greater than 10, with mean equal to the population proportion p and standard deviation of √(p(1-p)/n).
?Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.3: Estimating a Population Standard Deviation or Variance
?Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.3: Estimating a Population Standard Deviation or Variance
The document discusses different types of data distributions including normal, binomial, Poisson, exponential, and Bernoulli distributions. It provides the formulas and properties of each distribution. For example, it states that the normal distribution is the most commonly used in data science and has a bell-shaped, symmetric curve. The Poisson distribution outlines the probability of events in fixed time periods and has a rate parameter λ. The exponential distribution gives the probability of time before an event and uses the rate parameter in its formula.
This document provides a summary of key concepts and examples for a statistics quiz on normal distributions, the central limit theorem, confidence intervals, and hypothesis testing. It reviews formulas and how to apply them to calculate probabilities, z-scores, confidence levels, sample sizes, and margins of error. Examples of problems cover finding areas under the normal curve, interpreting confidence intervals, and constructing confidence intervals for means, proportions, and more.
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdfRavinandan A P
?
Biostatistics, Unit-I, Measures of Dispersion, Dispersion
Range
variation of mean
standard deviation
Variance
coefficient of variation
standard error of the mean
The document contains an agenda for a statistics class that includes topics such as the normal distribution, z-scores, and using a z-table to find probabilities. The agenda covers defining population and sample means and standard deviations, analyzing exam score data, continuous probability distributions, the properties of the normal distribution, and how to use a z-table to find probabilities for various z-scores and ranges of z-scores. Examples are provided to demonstrate how to find probabilities and areas under the normal curve. Homework problems are assigned from the textbook.
This document provides an overview of the normal distribution:
- It defines key terms like population, sample, parameters, and statistics.
- The normal distribution is symmetric and bell-shaped. Most data lies near the mean, and the percentage of data on either side of the mean is consistent.
- 68%, 95%, and 99% of data falls within 1, 2, and 3 standard deviations of the mean, respectively, in a normal distribution.
- The document provides an example of calculating the probability of a value being above the mean using the standard normal distribution and z-scores.
The document provides an introduction to the normal distribution including its key characteristics and how it can be used for inference. Some of the main points covered include:
- The normal distribution is symmetric and bell-shaped.
- It is characterized by its mean and standard deviation.
- Knowing that a variable is normally distributed allows us to determine probabilities of outcomes.
- The standard normal distribution has a mean of 0 and standard deviation of 1 and can be used to find probabilities.
- Z-scores indicate how many standard deviations an observation is from the mean and can be looked up in probability tables.
C2 st lecture 13 revision for test b handoutfatima d
?
This document provides an outline for a lecture series revising key concepts for Test B, including:
- Pythagoras' theorem, trigonometry, sine and cosine rules, and calculating triangle areas.
- Probability, probability trees, and examples calculating probabilities of dice rolls.
- Descriptive statistics like mode, median, interquartile range, mean, absolute deviation, and standard deviation.
- Hypothesis testing using z-tests, t-tests, and chi-squared tests; including setting up hypotheses, finding critical values, calculating test statistics, and making conclusions.
The revision is in preparation for Standard Track Test B which will be held the week of April 21st.
This document discusses six measures of variation used to determine how values are distributed in a data set: range, quartile deviation, mean deviation, variance, standard deviation, and coefficient of variation. It provides definitions and examples of calculating each measure. The range is defined as the difference between the highest and lowest values. Quartile deviation uses the interquartile range (Q3-Q1). Mean deviation is the average of the absolute deviations from the mean. Variance and standard deviation measure how spread out values are from the mean, with variance using sums of squares and standard deviation taking the square root of variance.
8. normal distribution qt pgdm 1st semesterKaran Kukreja
?
The document discusses the normal distribution and its key properties. It explains that the normal distribution is a limiting case of the binomial distribution when the number of trials is large. It has a bell-shaped symmetrical curve centered around the mean. The normal distribution is uniquely defined by its mean and standard deviation. The document also covers how to convert between a normal distribution and the standard normal distribution and how to find probabilities using the standard normal distribution table.
The document introduces the Gaussian or normal distribution, its key properties, and how it can be used for inference. The Gaussian distribution is symmetrical and bell-shaped. It is completely defined by its mean and standard deviation. By transforming data into z-scores, the standard normal distribution can be applied to understand the probabilities of outcomes in any normal distribution. The Gaussian distribution and z-scores allow researchers to assess likelihoods and make inferences about variable values based on their known distribution.
This document provides an overview of standard deviation and z-scores. It begins by listing the key learning objectives which are to describe the importance of variation in distributions, understand how to calculate standard deviation, describe what a z-score is and how to calculate them, and learn the Greek letters for mean and standard deviation. It then provides explanations and examples of how to calculate and interpret standard deviation as a measure of variation, how to convert values to z-scores based on the mean and standard deviation, and the importance of ensuring distributions are normal before using these statistical techniques. It emphasizes understanding the concepts rather than just memorizing formulas.
- The document discusses key concepts in descriptive statistics including types of distributions, measures of central tendency, and measures of dispersion.
- It covers normal, skewed, and other types of distributions. Measures of central tendency discussed are mean, median, and mode. Measures of dispersion covered are variance and standard deviation.
- The document uses examples and explanations to illustrate how to calculate and interpret these important statistical measures.
- The document discusses key concepts in descriptive statistics including types of distributions, measures of central tendency, and measures of dispersion.
- It covers normal, skewed, and other types of distributions. Measures of central tendency discussed are mean, median, and mode. Measures of dispersion covered are variance and standard deviation.
- The document uses examples and explanations to illustrate how to calculate and interpret these important statistical measures.
This document discusses analytical representation of data through descriptive statistics. It begins by showing raw, unorganized data on movie genre ratings. It then demonstrates organizing this data into a frequency distribution table and bar graph to better analyze and describe the data. It also calculates averages for each movie genre. The document then discusses additional descriptive statistics measures like the mean, median, mode, and percentiles to further analyze data through measures of central tendency and dispersion.
The central limit theorem states that the sampling distribution of the sample mean x will be approximately normally distributed for sample sizes n greater than 30, regardless of the shape of the population distribution. Specifically, the sampling distribution of x will have a mean equal to the population mean μ and a standard deviation of σ/√n. Similarly, for a sample proportion p, the sampling distribution of p will be approximately normal for n greater than 10 and np and nq both greater than 10, with mean equal to the population proportion p and standard deviation of √(p(1-p)/n).
?Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.3: Estimating a Population Standard Deviation or Variance
?Please Subscribe to this Channel for more solutions and lectures
http://www.youtube.com/onlineteaching
Chapter 7: Estimating Parameters and Determining Sample Sizes
7.3: Estimating a Population Standard Deviation or Variance
The document discusses different types of data distributions including normal, binomial, Poisson, exponential, and Bernoulli distributions. It provides the formulas and properties of each distribution. For example, it states that the normal distribution is the most commonly used in data science and has a bell-shaped, symmetric curve. The Poisson distribution outlines the probability of events in fixed time periods and has a rate parameter λ. The exponential distribution gives the probability of time before an event and uses the rate parameter in its formula.
This document provides a summary of key concepts and examples for a statistics quiz on normal distributions, the central limit theorem, confidence intervals, and hypothesis testing. It reviews formulas and how to apply them to calculate probabilities, z-scores, confidence levels, sample sizes, and margins of error. Examples of problems cover finding areas under the normal curve, interpreting confidence intervals, and constructing confidence intervals for means, proportions, and more.
Unit-I Measures of Dispersion- Biostatistics - Ravinandan A P.pdfRavinandan A P
?
Biostatistics, Unit-I, Measures of Dispersion, Dispersion
Range
variation of mean
standard deviation
Variance
coefficient of variation
standard error of the mean
A brain tumor is a growth of cells in the brain or near it. Brain tumors can happen in the brain tissue. Brain tumors also can happen near the brain tissue. Nearby locations include nerves, the pituitary gland, the pineal gland, and the membranes that cover the surface of the brain.
Brain tumors can begin in the brain. These are called primary brain tumors. Sometimes, cancer spreads to the brain from other parts of the body. These tumors are secondary brain tumors, also called metastatic brain tumors.
Many different types of primary brain tumors exist. Some brain tumors aren't cancerous. These are called noncancerous brain tumors or benign brain tumors. Noncancerous brain tumors may grow over time and press on the brain tissue. Other brain tumors are brain cancers, also called malignant brain tumors. Brain cancers may grow quickly. The cancer cells can invade and destroy the brain tissue.
Brain tumors range in size from very small to very large. Some brain tumors are found when they are very small because they cause symptoms that you notice right away. Other brain tumors grow very large before they're found. Some parts of the brain are less active than others. If a brain tumor starts in a part of the brain that's less active, it might not cause symptoms right away. The brain tumor size could become quite large before the tumor is detected.
Brain tumor treatment options depend on the type of brain tumor you have, as well as its size and location. Common treatments include surgery and radiation therapy.
Types
There are many types of brain tumors. The type of brain tumor is based on the kind of cells that make up the tumor. Special lab tests on the tumor cells can give information about the cells. Your health care team uses this information to figure out the type of brain tumor.
Some types of brain tumors usually aren't cancerous. These are called noncancerous brain tumors or benign brain tumors. Some types of brain tumors usually are cancerous. These types are called brain cancers or malignant brain tumors. Some brain tumor types can be benign or malignant.
Benign brain tumors tend to be slow-growing brain tumors. Malignant brain tumors tend to be fast-growing brain tumors.
Glioblastoma brain tumor
Glioblastoma
Enlarge image
Child with a medulloblastoma brain tumor
Medulloblastoma
Enlarge image
Acoustic neuroma, a benign tumor on the nerves leading from the inner ear to the brain
Acoustic neuroma (vestibular schwannoma)
Enlarge image
Types of brain tumors include:
Gliomas and related brain tumors. Gliomas are growths of cells that look like glial cells. The glial cells surround and support nerve cells in the brain tissue. Types of gliomas and related brain tumors include astrocytoma, glioblastoma, oligodendroglioma and ependymoma. Gliomas can be benign, but most are malignant. Glioblastoma is the most common type of malignant brain tumor.
Choroid plexus tumors. Choroid plexus tumors start in cells that make the fluid that surrounds the bra
Title of the Proposal
Abstract
Introduction and Background
Research Questions and Objectives
Methodology
Significance and Implications
Timeline
Budget
Conclusion
Rerefences
Bangor University: A Legacy of Excellence in Education and Researchstudyabroad731
?
Bangor University, also known as Prifysgol Bangor in Welsh, is a prominent institution of higher education situated in Bangor, Wales. At Study Abroad Established in 1885, it has grown into a respected center for academic excellence
I am the Director of Communications for Florida State University's College of Social Sciences and Public Policy (COSSPP).
I have a strong track record of developing and implementing strategic and creative marketing approaches that incorporate expertise in public relations, graphic design, and digital marketing strategies.
In my role as Communications Director for COSSPP, I have the pleasure of communicating with our current and prospective students, faculty, staff, alumni, friends, and the communities beyond impacted by our work. COSSPP is home to 12 departments and interdisciplinary programs and 11 centers and institutes and serves as the third largest College at FSU, helping more than 5,000 students each year along their academic journeys. Our 50,000+ alumni work around the globe to address the world’s challenges and to better their communities. To support this work, I run an internship program within our College and provide on-the-job learning opportunities to FSU students at the undergraduate and graduate levels. Our students develop skills critical for workplace readiness and leave with comprehensive portfolios showcasing their writing, design, photography, and project management skills.
Part-Time Jobs in Jaipur for Students and Working Professionals.pptxvinay salarite
?
Jaipur offers diverse part-time job opportunities for students and working professionals in retail, IT, hospitality, and education. Flexible work options make it easier to balance studies or full-time jobs while gaining valuable experience.
Explore opportunities now and start your career!
Engage is FSU College of Social Sciences and Public Policy’s annual magazine for alumni and friends.
Each edition contains highlights from the college’s many student, faculty, staff, and alumni achievements during that academic year.
I served as Editor-in-Chief and Creative Director for this project, which included all graphic design services.
How to Prepare for Avaya 67200T Certification.pdfNWEXAM
?
Start Here--- https://bit.ly/4dOjocy ---Get complete detail on 67200T exam guide to crack Avaya IP Office Platform R11 Administrator (ASAC-0013). You can collect all information on 67200T tutorial, practice test, books, study material, exam questions, and syllabus. Firm your knowledge on Avaya IP Office Platform R11 Administrator (ASAC-0013) and get ready to crack 67200T certification. Explore all information on 67200T exam with number of questions, passing percentage and time duration to complete test.
3. The Normal Distribution:
as mathematical function
(pdf)
2
)
(
2
1
2
1
)
( ?
?
?
?
?
?
?
?
x
e
x
f
Note constants:
?=3.14159
e=2.71828
This is a bell shaped
curve with different
centers and spreads
depending on ? and ?
4. The Normal PDF
1
2
1 2
)
(
2
1
?
?
?
??
?
?
?
?
dx
e
x
?
?
?
?
It’s a probability function, so no matter what the values
of ? and ?, must integrate to 1!
5. Normal distribution is defined
by its mean and standard dev.
E(X)=? =
Var(X)=?2 =
Standard Deviation(X)=?
dx
e
x
x
?
??
?
?
?
?
?
2
)
(
2
1
2
1 ?
?
?
?
2
)
(
2
1
2
)
2
1
(
2
?
?
?
?
?
?
?
?
??
?
?
?
?
dx
e
x
x
6. **The beauty of the normal curve:
No matter what ? and ? are, the area between ?-? and
?+? is about 68%; the area between ?-2? and ?+2? is
about 95%; and the area between ?-3? and ?+3? is
about 99.7%. Almost all values fall within 3 standard
deviations.
9. How good is rule for real data?
Check some example data:
The mean of the weight of the women = 127.8
The standard deviation (SD) = 15.5
10. 8 0 9 0 1 0 0 1 1 0 1 2 0 1 3 0 1 4 0 1 5 0 1 6 0
0
5
1 0
1 5
2 0
2 5
P
e
r
c
e
n
t
P O U N D S
127.8 143.3
112.3
68% of 120 = .68x120 = ~ 82 runners
In fact, 79 runners fall within 1-SD (15.5 lbs) of the mean.
11. 8 0 9 0 1 0 0 1 1 0 1 2 0 1 3 0 1 4 0 1 5 0 1 6 0
0
5
1 0
1 5
2 0
2 5
P
e
r
c
e
n
t
P O U N D S
127.8
96.8
95% of 120 = .95 x 120 = ~ 114 runners
In fact, 115 runners fall within 2-SD’s of the mean.
158.8
12. 8 0 9 0 1 0 0 1 1 0 1 2 0 1 3 0 1 4 0 1 5 0 1 6 0
0
5
1 0
1 5
2 0
2 5
P
e
r
c
e
n
t
P O U N D S
127.8
81.3
99.7% of 120 = .997 x 120 = 119.6 runners
In fact, all 120 runners fall within 3-SD’s of the mean.
174.3
13. Example
? Suppose SAT scores roughly follows a
normal distribution in the U.S. population of
college-bound students (with range
restricted to 200-800), and the average math
SAT is 500 with a standard deviation of 50,
then:
? 68% of students will have scores between 450
and 550
? 95% will be between 400 and 600
? 99.7% will be between 350 and 650
14. Example
? BUT…
? What if you wanted to know the math SAT
score corresponding to the 90th percentile
(=90% of students are lower)?
P(X≤Q) = .90 ?
90
.
2
)
50
(
1
200
)
50
500
(
2
1 2
?
?
?
?
?
Q x
dx
e
?
Solve for Q?….Yikes!
15. The Standard Normal (Z):
“Universal Currency”
The formula for the standardized normal
probability density function is
2
2
)
(
2
1
)
1
0
(
2
1
2
1
2
)
1
(
1
)
(
Z
Z
e
e
Z
p
?
?
?
?
?
?
?
?
?
16. The Standard Normal Distribution (Z)
All normal distributions can be converted into
the standard normal curve by subtracting the
mean and dividing by the standard deviation:
?
?
?
?
X
Z
Somebody calculated all the integrals for the standard
normal and put them in a table! So we never have to
integrate!
Even better, computers now do all the integration.
17. Comparing X and Z units
Z
100
2.0
0
200 X (? = 100, ? = 50)
(? = 0, ? = 1)
18. Example
? For example: What’s the probability of getting a math SAT
score of 575 or less, ?=500 and ?=50?
5
.
1
50
500
575
?
?
?
Z
?i.e., A score of 575 is 1.5 standard deviations above the mean
?
? ?
?
?
?
?
?
??
?
?
?
?
?
5
.
1
2
1
575
200
)
50
500
(
2
1 2
2
2
1
2
)
50
(
1
)
575
( dz
e
dx
e
X
P
Z
x
?
?
Yikes!
But to look up Z= 1.5 in standard normal chart (or enter
into SAS)? no problem! = .9332
19. Practice problem
If birth weights in a population are
normally distributed with a mean of 109
oz and a standard deviation of 13 oz,
a. What is the chance of obtaining a birth
weight of 141 oz or heavier when
sampling birth records at random?
b. What is the chance of obtaining a birth
weight of 120 or lighter?
20. Answer
a. What is the chance of obtaining a birth
weight of 141 oz or heavier when
sampling birth records at random?
46
.
2
13
109
141
?
?
?
Z
From the chart or SAS ? Z of 2.46 corresponds to a right tail (greater
than) area of: P(Z≥2.46) = 1-(.9931)= .0069 or .69 %
21. Answer
b. What is the chance of obtaining a birth
weight of 120 or lighter?
From the chart or SAS ? Z of .85 corresponds to a left tail area of:
P(Z≤.85) = .8023= 80.23%
85
.
13
109
120
?
?
?
Z
22. Looking up probabilities in the
standard normal table
What is the area to the
left of Z=1.51 in a
standard normal curve?
Z=1.51
Z=1.51
Area is 93.45%
23. Normal probabilities in SAS
data _null_;
theArea=probnorm(1.5);
put theArea;
run;
0.9331927987
And if you wanted to go the other direction (i.e., from the area to the Z
score (called the so-called “Probit” function?
data _null_;
theZValue=probit(.93);
put theZValue;
run;
1.4757910282
The “probnorm(Z)” function gives you
the probability from negative infinity to
Z (here 1.5) in a standard normal curve.
The “probit(p)” function gives you the
Z-value that corresponds to a left-tail
area of p (here .93) from a standard
normal curve. The probit function is also
known as the inverse standard normal
function.
24. Probit function: the inverse
?(area)= Z: gives the Z-value that goes with the probability you want
For example, recall SAT math scores example. What’s the score that
corresponds to the 90th percentile?
In Table, find the Z-value that corresponds to area of .90 ? Z= 1.28
Or use SAS
data _null_;
theZValue=probit(.90);
put theZValue;
run;
1.2815515655
If Z=1.28, convert back to raw SAT score ?
1.28 =
X – 500 =1.28 (50)
X=1.28(50) + 500 = 564 (1.28 standard deviations above the mean!)
`
50
500
?
X
25. Are my data “normal”?
? Not all continuous random variables are
normally distributed!!
? It is important to evaluate how well the
data are approximated by a normal
distribution
26. Are my data normally
distributed?
1. Look at the histogram! Does it appear bell
shaped?
2. Compute descriptive summary measures—are
mean, median, and mode similar?
3. Do 2/3 of observations lie within 1 std dev of
the mean? Do 95% of observations lie within
2 std dev of the mean?
4. Look at a normal probability plot—is it
approximately linear?
5. Run tests of normality (such as Kolmogorov-
Smirnov). But, be cautious, highly influenced
by sample size!
27. Data from our class…
Median = 6
Mean = 7.1
Mode = 0
SD = 6.8
Range = 0 to 24
(= 3.5 ?)
28. Data from our class…
Median = 5
Mean = 5.4
Mode = none
SD = 1.8
Range = 2 to 9
(~ 4 ?)
29. Data from our class…
Median = 3
Mean = 3.4
Mode = 3
SD = 2.5
Range = 0 to 12
(~ 5 ?)
30. Data from our class…
Median = 7:00
Mean = 7:04
Mode = 7:00
SD = :55
Range = 5:30 to 9:00
(~4 ?)
31. Data from our class…
7.1 +/- 6.8 =
0.3 – 13.9
0.3 13.9
40. Data from our class…
7:04+/- 0:55 =
6:09 – 7:59
6:09 7:59
41. Data from our class…
7:04+/- 2*0:55
=
5:14 – 8:54
5:14
8:54
42. Data from our class…
7:04+/- 2*0:55
=
4:19 – 9:49
4:19
9:49
43. The Normal Probability Plot
? Normal probability plot
? Order the data.
? Find corresponding standardized normal quantile
values:
? Plot the observed data values against normal
quantile values.
? Evaluate the plot for evidence of linearity.
area
tail
-
left
particular
a
to
s
correspond
that
value
Z
the
gives
which
function,
probit
the
is
where
)
1
n
i
(
quantile
?
?
?
?
th
i
48. Formal tests for normality
? Results:
? Coffee: Strong evidence of non-normality
(p<.01)
? Writing love: Moderate evidence of non-
normality (p=.01)
? Exercise: Weak to no evidence of non-
normality (p>.10)
? Wakeup time: No evidence of non-normality
(p>.25)
49. Normal approximation to the
binomial
When you have a binomial distribution where n is
large and p is middle-of-the road (not too small, not
too big, closer to .5), then the binomial starts to look
like a normal distribution? in fact, this doesn’t even
take a particularly large n?
Recall: What is the probability of being a smoker among
a group of cases with lung cancer is .6, what’s the
probability that in a group of 8 cases you have less
than 2 smokers?
50. Normal approximation to the
binomial
When you have a binomial distribution where
n is large and p isn’t too small (rule of thumb:
mean>5), then the binomial starts to look like
a normal distribution?
Recall: smoking example…
1 4 5
2 3 6 7 8
0
.27 Starting to have a normal
shape even with fairly small
n. You can imagine that if n
got larger, the bars would get
thinner and thinner and this
would look more and more
like a continuous function,
with a bell curve shape. Here
np=4.8.
51. Normal approximation to
binomial
1 4 5
2 3 6 7 8
0
.27
What is the probability of fewer than 2 smokers?
Normal approximation probability:
?=4.8
?=1.39
2
39
.
1
8
.
2
39
.
1
)
8
.
4
(
2
?
?
?
?
?
?
Z
Exact binomial probability (from before) = .00065 + .008 = .00865
P(Z<2)=.022
52. A little off, but in the right ballpark… we could also use the value
to the left of 1.5 (as we really wanted to know less than but not
including 2; called the “continuity correction”)…
37
.
2
39
.
1
3
.
3
39
.
1
)
8
.
4
(
5
.
1
?
?
?
?
?
?
Z
P(Z≤-2.37) =.0069
A fairly good approximation of
the exact probability, .00865.
53. Practice problem
1. You are performing a cohort study. If the probability
of developing disease in the exposed group is .25 for
the study duration, then if you sample (randomly)
500 exposed people, What’s the probability that at
most 120 people develop the disease?
55. Proportions…
? The binomial distribution forms the basis of
statistics for proportions.
? A proportion is just a binomial count divided
by n.
? For example, if we sample 200 cases and find 60
smokers, X=60 but the observed proportion=.30.
? Statistics for proportions are similar to
binomial counts, but differ by a factor of n.
56. Stats for proportions
For binomial:
)
1
(
)
1
(
2
p
np
p
np
np
x
x
x
?
?
?
?
?
?
?
?
For proportion:
n
p
p
n
p
p
n
p
np
p
p
p
p
)
1
(
)
1
(
)
1
(
?
2
2
?
?
?
?
?
?
?
?
?
?
?
?
P-hat stands for “sample
proportion.”
Differs by
a factor of
n.
Differs
by a
factor
of n.
57. It all comes back to Z…
? Statistics for proportions are based on a
normal distribution, because the
binomial can be approximated as
normal if np>5