Professional Documents
Culture Documents
1 of 16
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
improveandinnovate
Thoughts and ideas on Quality , Improvement and
Innovation
The full content of CSSBB Tutorial Series is now available for sale in the form of a E Book titled :
Lean Six Sigma Black Belt : A Complete Guide
Key Features of this book are :
Complete CSSBB BoK coverage 350+ pages , easy to read format/ fonts
Hundreds of real world examples / worked out exercises with graphics to demonstrate Lean Six Sigma
BB tools and concepts
Statistical Tools explained in most simple language with Minitab & MS Excel
Format : pdf no content copying / printing allowed.
Annexures include summary of formulas and all statistical tables
Complete Glossary in alphabetical order
No exam related quizzes / question bank are included in the book .
Email assistance to clarify doubts related to any CSSBB related topic upto 3 months from date of
purchase
For price and commercial information , please email : gouritg@yahoo.com
Leave a comment
10/5/2014 10:52 PM
2 of 16
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
(hps://improveandinnovate.les.wordpress.com/2014/08
/picture13.jpg)
source : the internet
Leave a comment
Hypothesis Testing, null hypothesis, Parametric tests, Power of a hypothesis test, Producers Risk, Type I
Error, Type II Error
CSSBB Tutorial Series : Lesson 11
The Analyze Phase Part 2
Topics Covered :
10/5/2014 10:52 PM
3 of 16
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
Hypothesis Testing
Basic Concepts Statistical and Practical Signicance
Hypothesis Testing and Errors
Power of a hypothesis test
Types Parametric and Non- Parametric
Alice -Would you tell me, please, which way I ought to go from here?
The Cheshire Cat -That depends a good deal on where you want to get to.
Alice -I dont much care where
The Cheshire Cat Then it doesnt maer which way you go.
Lewis Carroll (hps://www.goodreads.com/author/show/8164.Lewis_Carroll), Alice in Wonderland
(hps://www.goodreads.com/work/quotes/2933712)
Introduction
A hypothesis is an assumption about a population parameter. The parameter may be the mean , the
proportion , the standard deviation etc. In statistics , we may make assumptions about population
parameters based on the sample statistic . These assumptions need to be proved ( or disproved) at
required levels of signicance ( or condence levels) . This is known as hypothesis testing.
The most common example of a hypothesis test is that of a trial process in a court of law . The assumption
in a trial is the defendant is innocent This assumption is called the null hypothesis . A contradiction to
this assumption is the defendant is not innocent. This is called the alternative hypothesis. Hence the
hypothesis statement is framed as :
Ho : The defendant is innocent
Ha : The defendant is not innocent
Note : Is it incorrect to state the alternative hypothesis as : Ha : The defendant is guilty ? Think about
it !
Hypothesis tests can be two tailed or single- tailed ( le or right tailed) . Fig. 11-1 below shows
acceptance and rejection regions for each of these possibilities. This concept is best understood with the
help of examples discussed later.
10/5/2014 10:52 PM
4 of 16
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
(hps://improveandinnovate.les.wordpress.com/2014/08/slide21.jpg)
Fig 11-1 : Two- Tailed and Single- Tailed Hypothesis Tests.
Steps in Hypothesis Testing : Following is the sequence of steps in hypothesis testing.
1. Dene the problem and objectives ( Ex. : Establish that the new product is lighter than the old product ).
2. Frame the hypothesis ( Is it a two tailed or a single tailed test ? )
1. State the null hypothesis (Ho : New Product is lighter ).
2. State the alternative hypothesis (Ha : New product is not lighter )
3. Select the appropriate hypothesis test (depending on the assumed probability distribution: z, t, or F ,
chi- squared etc).
4. Select the required level of signicance (typically 5%).
5. Select the required level of (typically 10%).
6. Establish the sample size depending on the precision ,E, required ( the dierence between the
population parameters required to be detected).
7. Carry out data collection according to the sampling data.
8. Compute the test statistic using the formula for the probability function
9. From the tables , determine the critical value of the probability distribution for the selected value of
and degrees of freedom . This denes the rejection region for the null hypothesis.
10. Compare values obtained in steps 8 and 9 .
11. If the test statistic lies in the rejection region , reject Ho and accept Ha , else accept the null
hypothesis.
12. Interpret the statistical conclusion with reference to the objective dened in Step 1.
Hypothesis Testing and Errors
Since decisions about accepting or rejecting hypotheses are based on sample statistics, there is always a
possibility of errors when arriving at such decisions. This could happen either due to a bad sample or
due to inadequate sample sizes . Errors in hypothesis testing can be of two types :
Type I Error : A decision to reject the null hypothesis when it is true is called a Type I Error . The
probability associated with a Type I error is the level of signicance , , It is also called the risk.
10/5/2014 10:52 PM
5 of 16
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
This means when we select a value of ( say 0.05) , there is a 5% risk of wrongly rejecting the null
hypothesis when it is true. This also means the decision about the hypothesis is made with 95%
condence .
Type II Error : A decision to accept a null hypothesis when it is false is called a Type II Error. The
probability associated with a Type II error is called the risk. This means when we select a value of
( say 0.1) , there is a 10% risk of wrongly accepting the null hypothesis when it is false.
Power of the Test : The ability of a test to correctly reject the hypothesis when it is false is given by 1 . This is also called the power of the test.
Example : A vendor supplies a consignment of 1000 components to the customer and a random sample is drawn
by the Quality Manager for inspection and acceptance . If , for some reason , the sample is not representative of
the population ( the 1000 components in this case ) , the following decisions are possible :
The sample indicates a bad consignment when the consignment is actually good . The Quality
Manager will reject a good consignment .This is a Type I Error , also called the Producers Risk;
The sample indicates a good consignment , when the consignment is actually bad . The Quality
Manager will accept the bad consignment . This is a Type II Error, also called the Consumers Risk.
Fig. 11- 2 explains the concept of Type I and Type errors .
(hps://improveandinnovate.les.wordpress.com
/2014/08/slide31.jpg)
Selecting Values of and
A major challenge in hypothesis testing is keeping both and as low as possible.
Example : A Type I error will be preferred over a Type II error when the risk of accepting a bad consignment is
high . In such a case , the decision maker will have to set a high value of , even if it means some good product
will get rejected. The same argument holds when the risk of rejecting a good consignment ( Type I error) is higher
than that of accepting a bad consignment ( Type II error). In this case , the decision maker will have to set a low
value of to ensure no good product gets rejected even at the cost of some bad product geing accepted.
Hence , there is always a trade o between the two errors : one can reduce the probability of making a
Type I error only at the cost of a higher probability of making a Type II error and vice -versa. Therefore,
the choice of levels of and will depend on the costs associated with each type of error.
Types of Hypothesis Testing :
10/5/2014 10:52 PM
6 of 16
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
Depending on the population parameter to be tested , there are several types of hypothesis tests.
However ,these tests can be classied into two broad categories viz. : Parametric Tests and NonParametric Tests
Parametric Tests are used when the underlying population from which the sample was drawn follows a
known probability distribution such as Normal , t , F , ch- squared etc. The following table ( Tabe 11 3)
shows some of the commonly used parametric tests.
(hps://improveandinnovate.les.wordpress.com
/2014/08/slide41.jpg)
Non Parametric Tests : Also known as distribution free tests , non parametric tests are used when the
population from which the samples were drawn does not follow any known distribution. Non
parametric test are based on ranks or signs or both . Table 11-4 shows some of the commonly used non
parametric tests and the equivalent parametric test .
(hps://improveandinnovate.les.wordpress.com/2014/08
/slide51.jpg)
Suggested Reading
Statistics For Management by Levin and Rubin
Dr. Jurans Quality Handbook.
10/5/2014 10:52 PM
7 of 16
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
Leave a comment
(hps://improveandinnovate.les.wordpress.com/2014/08/picture12.jpg)
Leave a comment
10/5/2014 10:52 PM
8 of 16
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
(hps://improveandinnovate.les.wordpress.com/2014/08/picture11.jpg)
Leave a comment
Alice -Would you tell me, please, which way I ought to go from here?
The Cheshire Cat -That depends a good deal on where you want to get to.
Alice -I dont much care where
The Cheshire Cat Then it doesnt maer which way you go.
Lewis Carroll (hps://www.goodreads.com/author/show/8164.Lewis_Carroll), Alice in Wonderland
(hps://www.goodreads.com/work/quotes/2933712)
10/5/2014 10:52 PM
9 of 16
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
10/5/2014 10:52 PM
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
(hps://improveandinnovate.les.wordpress.com/2014/08
/slide2.jpg)
Example 10-1 : A consumer goods rm is interested in estimating sales of a new product in a particular locality.
The rm believes that one of the factors aecting sales is the purchasing power of the residents in the locality.
300 families were surveyed for this purpose and their annual incomes averaged $65000 . Past data shows a
standard deviation of $ 27500. Construct a 95% condence interval for the mean .
Solution :
n = 300 , x(bar) = $ 65000 and = $ 27500 , = 0.05
From normal probability tables ( refer any book on basic Statistics)
Z (/2 = 0.025) = 1.96
Standard error of the mean ,
10 of 16
(hps://improveandinnovate.les.wordpress.com/2014/08/slide3.jpg)
= 1587.7
10/5/2014 10:52 PM
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
(hps://improveandinnovate.les.wordpress.com/2014/08
/slide4.jpg)
Fig. 1
Hence a 95 % condence interval is ( The area of interest is shown in Fig. 1)
= 65000 (1.96 *1587.7 ) <= <= 65000 + (1.96 *1587.7)
= $ 61888 < = < = $ 68112
This means , the rm is 95% condent that the average annual income of the population is between $
61888 and $ 68112
DIY Exercise 1 : Reconstruct and compare the above condence intervals with 1) 90% condence and 2) with
99% condence levels
Using MS Excel : The z- values for = 0.025 can also be obtained with MS Excel using the formula
NORMSINV( 0.025) .
The Finite Population Multiplier : In the above example we assumed a large sample ( almost innite) .
However , in real life we cannot imagine an innite population in a certain locality ! If the population is
nite , we need to modify the equation for standard error of the mean as given :
(hps://improveandinnovate.les.wordpress.com/2014/08
/slide5.jpg)
Note : A population is considered nite if n/N >= 0.05
11 of 16
10/5/2014 10:52 PM
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
Example 10-2: In the earlier example ( Example 10 1 ) , if the population was known to be 5000 , the revised
condence interval would be computed as follows :
(hps://improveandinnovate.les.wordpress.com/2014/08/slide6.jpg)
(hps://improveandinnovate.les.wordpress.com/2014/08
/slide7.jpg)
Fig. 3 shows a comparison of t distribution with 3 deg. of freedom and a normal distribution.
12 of 16
10/5/2014 10:52 PM
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
(hps://improveandinnovate.les.wordpress.com/2014/08
/slide8.jpg)
Fig 3. t- distribution vs. Normal Distribution
Example 10 3 : A sample of 10 patients in a hospital showed the average waiting time to be 50 mins and a
standard deviation of 15 mins. Construct a 90% condence interval for the mean waiting time.
Solution :
n = 10 , x(bar) = 50 mins. s = 15 mins ,
For a 90% condence interval, = 0.1. From statistical tables , look for t = 0.05 with ( n-1 = 9 ) degrees of
freedom. This corresponding value is is 2.2622 .
Hence the condence interval is computed as :
(hps://improveandinnovate.les.wordpress.com/2014/08
/slide9.jpg)
Note :
t- value can also be found using the Excel formula TINV(0.1,9) .
The t dist. table, unlike the Z table discussed earlier , shows combined areas under the le and
right tail for a given and also this area is outside the critical values of t .
Sample Size Determination for Means
The formula for estimating sample size for determining condence intervals for means is derived from the
margin of error discussed earlier :
13 of 16
10/5/2014 10:52 PM
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
(hps://improveandinnovate.les.wordpress.com
/2014/08/slide10.jpg)
Example 4 : In the earlier example ( Example 6- 1) of the consumer goods rm estimating average annual
salaries, what would be a statistically signicant sample size if the rm wanted the estimate to be within +/- $
3000 with a 95% condence?
Solution :
(hps://improveandinnovate.les.wordpress.com/2014/08/slide11.jpg)
Suggested Reading For Interval Estimates for Proportions & Variances :
1. Statistics For Management by Levin and Rubin
2. Jurans Quality Handbook
14 of 16
10/5/2014 10:52 PM
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
Leave a comment
(hps://improveandinnovate.les.wordpress.com/2014/08/picture1.jpg)
source : internet
Leave a comment
improveandinnovate (hp://improveandinnovate.wordpress.com/)
Blog at WordPress.com (hps://wordpress.com/?ref=footer_blog). The Big Brother Theme
(hps://wordpress.com/themes/big-brother/).
Follow
15 of 16
10/5/2014 10:52 PM
http://improveandinnovate.wordpress.com/?goback=.gde_3151110_memb
Follow improveandinnovate
16 of 16
10/5/2014 10:52 PM