Professional Documents
Culture Documents
Hypothesis Testing
Learning Objectives
Type, number, and the level of data may provide a platform for
deciding the statistical test.
Step 3: Set the Level of Significance
Critical region is the area under the normal curve, divided into two mutually
exclusive regions. These regions are termed as acceptance region (when the null
hypothesis is accepted) and the rejection region or critical region (when the null
hypothesis is rejected).
Two-Tailed Test of Hypothesis
It is not advisable to collect the data first and then decide on the
stages of hypothesis testing.
Step 6: Analyse the data
In this step, the researcher has to compute the test statistic. This
involves selection of an appropriate probability distribution for a
particular test.
Some of the commonly used testing procedures are z, t, F, and
χ2.
Step 7: Arrive at a Statistical Conclusion
and Business Implication
A marketing research firm conducted a survey 10 years ago and found that
the average household income of a particular geographic region is Rs
10,000. Mr. Gupta, who has recently joined the firm as a vice president has
expressed doubts about the accuracy of the data. For verifying the data,
the firm has decided to take a random sample of 200 households that
yield a sample mean (for household income) of Rs 11,000. Assume that
the population standard deviation of the household income is Rs 1200.
Verify Mr. Gupta’s doubts using the seven steps of hypothesis testing. Let
α = 0.05 (5%).
Example (Solution)
Hypothesis Testing for a Single Population
Mean Using the T Statistic (Case of a
Small Random Sample When N < 30)
When a researcher draw a small random sample (n < 30) to estimate
the population mean μ and when the population standard deviation
is unknown and population is normally distributed, t-test can be
applied.
Example
Royal Tyres has launched a new brand of tyres for tractors and claims
that under normal circumstances the average life of the tyres is
40,000 km. A retailer wants to test this claim and has taken a
random sample of 8 tyres. He tests the life of the tyres under normal
circumstance. The results obtained are presented in Table 10.4.
Example (Solution)
Figure : Computed and critical t values for Example 10.4
Lets Do It !!
When sample size is large (n1, n2 > 30) and samples are independent
(not related) and the population standard deviation is known, the Z
statistic can be used to test the hypothesis for difference between
two population means.
Hypotheseis Testing for the Difference Between
Two Population Means Using the Z-Statistic (Case
of a large Random Sample, n1, n2 > 30)
LET’S DO IT !
When sample size is small (n1, n2 < 30) and samples are
independent (not related) and the population standard deviation is
unknown, the t statistic can be used to test the hypothesis for
difference between two population means.
Hypotheseis Testing for the Difference Between
Two Population Means Using the t Statistic (Case
of a Small Random Sample, n1, n2 < 30, When
Population Standard Deviation Is Unknown)
LET’S DO IT !
Anmol Constructions is a leading company in the construction
sector in India. It wants to construct flats in Raipur and Dehradun,
the capitals of the newly formed states of Chattisgarh and
Uttarakhand, respectively. The company wants to estimate the
amount that customers are willing to spend on purchasing a flat
in the two cities. It randomly selected 25 potential customers
from Raipur and 27 customers from Dehradun and posed the
question, “how much are you willing to spend on a flat?” The
data collected from the two cities is shown in Table 11.2(a) and
Table 11.2(b). The company assumes that the intention to
purchase of the customers is normally distributed with equal
variance of 15.39 in the two cities taken for the study. On the
basis of the samples taken for the study, estimate the difference
in population means taking 95% as the confidence level.
Example 11.2 (Contd.)
Solution (Example 11.2)
Statistical Inference About the Difference
Between the Means of Two Related Populations
(Matched Samples)
A plant has installed two machines producing polythene bags. During the
installation, the manufacturer of the machine has stated that the capacity of the
machine is to produce 20 bags in a day. Owing to various factors such as
different operators working on these machines, raw material, etc. there is a
variation in the number of bags produced at the end of the day. The company
researcher has taken a random sample of bags produced in 10 days for machine
1 and 13 days for machine 2, respectively. The following data gives the number
of units of an item produced on a sampled day by the two machines:
How can the researcher determine whether the variance is from the same
population (population variances are equal) or it comes from different
populations (population variances are not equal)? Take α = 0.05 as the
confidence level.
Solution (Example 11.5)