Professional Documents
Culture Documents
c) What are the errors associated with decisions based on hypothesis testing?
Ans: Hypothesis Testing is one of the most important aspects of the theory of decision making. It
consists of decisions rules required for drawing probabilistic inferences about the population
parameters. It often involves deciding at given point of time whether a given population
parameter is the same as before, as claimed or changed.
Null hypothesis
A statistical hypothesis or assumption made about the population parameter to testing its validity
for the purpose of possible acceptance is called null hypothesis. Null hypothesis is also called
hypothesis of no difference. We should adopt neutral or null regarding the outcome of the sample
while setting up the null hypothesis. The null hypothesis is usually denoted by Ho or H sub
Zero. For example, the null hypothesis may be set up as follows.
I. If we are denoted to test the significance of the difference between a sample statistic and
population parameter or between two sample statistics, then we set up the null hypothesis
that there is no significant between the sample statistic and the population parameter or
between two sample statistics. This means that the difference is just due to fluctuation of
sampling.
II. If we want to test any statement about the population parameter, we set up the null
hypothesis that it is true. For example, if the population mean has specified value o,
then set up the null hypothesis as
Ho; = o. That is, the population mean has some specified value o. In other words,
there is no significant difference between sample mean and population mean ().
Alternate hypothesis
II. H1: > o. That is, the population mean is greater than o.
The decision to accept or reject the null hypothesis Ho is made on the basis of the information
supplied by the sample data. There are the following four possibilities in testing of hypothesis.
III. Accepting the null hypothesis when the null hypothesis is false.
IV. Rejecting the null hypothesis when the null hypothesis is false.
Accept Ho Reject Ho
Probability =
Probability = 1-
In the above case, decision i and decision iv are correct decision while decisions ii and iii
are wrong.
In the testing of hypothesis, we may commit two types of errors. The error
committed in rejecting null hypothesis Ho when it is true is called type I error or the
error of the first kind and its probability is denoted by . The error committed in
accepting null hypothesis Ho when it is false is called type II error or the error of second
kind and it is denoted by . Thus,
While inspecting the quality of manufactured lot, the type I error amounts to
reject a good lot and type II error amounts to accepting a bad lot. Accordingly,
Where and are also known as producers risk and consumers risk respectively.
Though efforts are made to reduce both type I and type II errors. But it is not
possible to reduce both at the same time. The probability of making one type of error can
be reducing both at same time. The probability of making one type of error can be
reduced only if we are willing to increase the probability of making the other type of
error.
Level of significance.
The maximum size of the type I error that we are prepared to risk is called the level of
significance. In other words, the probability of rejecting a true null hypothesis is called level of
significance and is denoted by . Symbolically, it is defined as
= P (type I error)
Hypothesis generally is tested at 1% or 5% level of significance. But, the most commonly used
level of significance in practice is 5%. If we adopt = 5% level of significance. It shows that in 5
true samples out of 100, we are likely to reject a correct Ho
The critical region may represent by a portion of area under normal probability curve of the
sampling distribution of the statistic in two ways:
II. One tail or side under the curve which is either the right or left tail.
The testing of hypothesis which is based on critical region represented by tails under the
normal curve is called two tail tests. In other words a two tail test is a hypothesis in which the
null hypothesis is rejected if the sample value significantly higher or lower than the hypothesized
vale of the population parameter.
A critical value is a line on a graph that splits the graph into sections. One or two of the sections
is the "rejection region"; if your test value falls into that region, then you reject the null
hypothesis. A one tailed test with the rejection in one tail.
Sampling Distribution
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.5
-5 -4 -3 -2 -1 0 1 2 3 4 5
A one tailed test with the rejection in one tail. The critical value is the black bold line of the left
of that region.
Specifically, the four steps involved in using the critical value approach to conducting
any hypothesis test are:
2. Using the sample data and assuming the null hypothesis is true, calculate the value of the test
statistic. To conduct the hypothesis test for the population mean , we use the t statistic which
follows a t-distribution with n - 1 degrees of freedom.
3. Determine the critical value by finding the value of the known distribution of the test statistic
such that the probability of making a Type I error which is denoted (Greek letter "alpha")
and is called the "significance level of the test " is small (typically 0.01, 0.05, or 0.10).
4. Compare the test statistic to the critical value. If the test statistic is more extreme in the
direction of the alternative than the critical value, reject the null hypothesis in favor of the
alternative hypothesis. If the test statistic is less extreme than the critical value, do not reject the
null hypothesis.
Test statistic
A test statistic is a statistic (a quantity derived from the sample) used in statistical
hypothesis testing. A hypothesis test is typically specified in terms of a test statistic, considered
as a numerical summary of a data- set that reduces the data to one value that can be used to
perform the hypothesis test. In general, a test statistic is selected or defined in such a way as to
quantify, within observed data, behaviors that would distinguish the null from the alternative
hypothesis, where such an alternative is prescribed, or that would characterize the null
hypothesis if there is no explicitly stated alternative hypothesis.
An important property of a test statistic is that its sampling distribution under the null
hypothesis must be calculable, either exactly or approximately, which allows p-values to be
calculated. A test statistic shares some of the same qualities of a descriptive statistic, and many
statistics can be used as both test statistics and descriptive statistics. However, a test statistic is
specifically intended for use in statistical testing, whereas the main quality of a descriptive
statistic is that it is easily interpretable. Some informative descriptive statistics, such as the
sample range , do not make good test statistics since it is difficult to determine their sampling
distribution.
Example
For example, suppose the task is to test whether a coin is fair (i.e. has equal probabilities
of producing a head or a tail). If the coin is flipped 100 times and the results are recorded, the
raw data can be represented as a sequence of 100 heads and tails. If there is interest in the
marginal probability of obtaining a head, only the number T out of the 100 flips that produced a
head needs to be recorded. But T can also be used as a test statistic in one of two ways:
The exact sampling distribution of T under the null hypothesis is the binomial distribution
with parameters 0.5 and 100.
the value of T can be compared with its expected value under the null hypothesis of 50,
and since the sample size is large a normal distribution can be used as an approximation
to the sampling distribution either for T or for the revised test statistic T 50.
Using one of these sampling distributions, it is possible to compute either a one-tailed or two-
tailed p-value for the null hypothesis that the coin is fair. Note that the test statistic in this case
reduces a set of 100 numbers to a single numerical summary that can be used for testing.
Problem
The mean life of a particular battery is 75 hours. A sample of 9 light bulbs is chosen and
found to have a standard deviation of 10 hours and a mean of 80 hours. Find the standardized
test statistic.
solution;
The population standard deviation isnt known,so Im going to use the t-score
formula .
xx = sample mean = 80
0 = population mean = 75
n = sample size = 9
This means that the standardized test statistic (in this case, the t-score) is 1.5.
1. Simplification
5. It can be used again and again for similar problems or can be modified.
Population: A population is any entire collection of people, animals, plants or things on which
we may collect data. It is the entire group of interest, which we wish to describe or about which
we wish to draw conclusions. In the above figure the life of the light bulbs manufactured say by
GE, is the concerned population.
Qualitative and Quantitative Variables: Any object or event, which can vary in
successive observations either in quantity or quality, is called a "variable." Variables are
classified accordingly as quantitative or qualitative. A qualitative variable, unlike a quantitative
variable does not vary in magnitude in successive observations. The values of quantitative and
qualitative variables are called"Variates" and "Attributes", respectively. Variable: A characteristic
or phenomenon, which may take different, values, such as weight, gender since they are different
from individual to individual.
Business owners like to know how their decisions will affect their business. Before making
decisions, managers may explore the benefits of hypothesis testing, the experimentation of
decisions in a "laboratory" setting. By making such tests, managers can have more confidence in
their decisions.
Essentially good hypotheses lead decision- makers like you to new and better ways to
achieve your business goals. When you need to make decisions such as how much you should
spend on advertising or what effect a price increase will have your customer base, its easy to
make wild assumptions or get lost in analysis paralysis. A business hypothesis solves this
problem, because, at the start, its based on some foundational information. In all of science,
hypotheses are grounded in theory. Theory tells you what you can generally expect from a
certain line of inquiry. A hypothesis based on years of business research in a particular area, then,
helps you focus, define and appropriately direct your research. You wont go on a wild goose
chase to prove or disprove it. A hypothesis predicts the relationship between two variables. If you
want to study pricing and customer loyalty, you wont waste your time and resources studying
tangential areas.
Much of running a small business is a gamble, buoyed by boldness, intuition and guts.
But wise business leaders also conduct formal and informal research to inform their business
decisions. Good research starts with a good hypothesis, which is simply a statement making a
prediction based on a set of observations. For example, if youre considering offering flexible
work hours to your employees, you might hypothesize that this policy change will positively
affect their productivity and contribute to your bottom line. The ultimate job of the hypothesis in
business is to serve as a guidepost to your testing and research methods.
Conclusion
a research work. During hypothesis formulation, it is important to keep the statement simple,
Precise and clear, and derive it from an existing body of knowledge. Two types of hypothesis