Professional Documents
Culture Documents
ASSIGNMENT
Name
Dhananjay
Kumar
Roll No. 510911381
Course MBA-Semester-3
Research
Subject
Methodology
Subject Code MB0035-Set-2
For instance, one might want to test the claim that a certain drug reduces the chance of
having a heart attack. One would choose the null hypothesis "this drug does not reduce
the chances of having a heart attack" (or perhaps "this drug has no effect on the chances
of having a heart attack"). One should then collect data by observing people both taking
the drug and not taking the drug in some sort of controlled experiment. If the data is very
unlikely under the null hypothesis one would reject the null hypothesis, and conclude that
its negation is true. That is, one would conclude that the drug does reduce the chances of
having a heart attack. Here "unlikely data" would mean data where the percentage of
people taking the drug who had heart attack was much less then the percentage of people
not taking the drug who had heart attacks. Of course one should use a known statistical
test to decide how unlikely the data was and hence whether or not to reject the null
hypothesis.
The results of exploratory research are not usually useful for decision-making by
themselves, but they can provide significant insight into a given situation. Although the
results of qualitative research can give some indication as to the "why", "how" and
"when" something occurs, it cannot tell us "how often" or "how many."
c. Random Sampling is that part of statistical practice concerned with the selection
of an unbiased or random subset of individual observations within a population of
individuals intended to yield some knowledge about the population of concern, especially
for the purposes of making predictions based on statistical inference. Sampling is an
important aspect of data collection.
Researchers rarely survey the entire population for two reasons (Adr, Mellenbergh, &
Hand, 2008): the cost is too high, and the population is dynamic in that the individuals
making up the population may change over time. The three main advantages of sampling
are that the cost is lower, data collection is faster, and since the data set is smaller is
possible to ensure homogeneity and to improve the accuracy and quality of the data.
Each observation measures one or more properties (such as weight, location, color) of
observable bodies distinguished as independent objects or individuals. In survey
sampling, survey weights can be applied to the data to adjust for the sample design.
Results from probability theory and statistical theory are employed to guide practice. In
business and medical research, sampling is widely used for gathering information about a
population.
d. Rank-order correlation - the most commonly used method of computing a
correlation coefficient between the ranks of scores on two variables. In statistics,
Spearman's rank correlation coefficient or Spearman's rho, named after Charles
Spearman and often denoted by the Greek letter (rho) or as rs, is a non-parametric
measure of statistical dependence between two variables. It assesses how well the
relationship between two variables can be described using a monotonic function. If there
are no repeated data values, a perfect Spearman correlation of +1 or 1 occurs when each
of the variables is a perfect monotone function of the other.
The Spearman correlation coefficient is often thought of as being the Pearson correlation
coefficient between the ranked variables. In practice, however, a simpler procedure is
normally used to calculate . The n raw scores Xi, Yi are converted to ranks xi, yi, and the
differences di = xi yi between the ranks of each observation on the two variables are
calculated.
If tied ranks exist, Pearson's correlation coefficient between ranks should be used for the
calculation:
One has to assign the same rank to each of the equal values. It is an average of their
positions in the ascending order of the values.
3. Reference Material
Bibliography
Appendix
Copies of data collection instruments
Technical details on sampling plan
Complex tables
Glossary of new terms used.
Mechanics of Writing:
A research report requires clear organization. Each chapter may be divided into
two or more sections with appropriate heading and in each section margin headings and
paragraph headings may be used to indicate subject shifts. Physical presentation is
another aspect of organization. A page should not be fully filled in from top to bottom.
Wider margins should be provided on both sides and on top and bottom as well.
Centered section heading is provided in the center of the page and is usually in solid font
size. It is separated from other textual material by two or three line space.
Marginal heading is used for a subdivision in each section. It starts from the left side
margin without leaving any space.
Paragraph heading is used to head an important aspect of the subject matter discussed in a
subdivision. There is some space between the margin and this heading.
Presentation should be free form spelling and grammar errors. If the writer is not strong
in grammar, get the manuscript corrected by a language expert.
Use the rules of punctuations.
Use present tense for presenting the findings of the study and for stating generalizations
Do not use masculine nouns and pronouns when the content refers to both the genders.
Do not abbreviate words in the text; spell out them in full. Footnote citation is indicated
by placing an index number, i.e., a superscript or numeral, at the point of reference.
Reference style should have a clear format and used consistently.
Case study is a method of exploring and analyzing the life of a social unit or
entity, be it a person, a family, an institution or a community. Case study would depend
upon wit, commonsense and imagination of the person doing the case study. The
investigator makes up his procedure as he goes along. Efforts should be made to ascertain
the reliability of life history data through examining the internal consistency of the
material.. A judicious combination of techniques of data collection is a prerequisite for
securing data that are culturally meaningful and scientifically significant. Case study of
particular value when a complex set of variables may be at work in generating observed
results and intensive study is needed to unravel the complexities. The case documents
hardly fulfill the criteria of reliability, adequacy and representativeness, but to exclude
them form any scientific study of human life will be blunder in as much as these
documents are necessary and significant both for theory building and practice. In-depth
analysis of selected cases is of particular value to business research when a complex set
of variables may be at work in generating observed results and intensive study is needed
to unravel the complexities.
Let us discuss the criteria for evaluating the adequacy of the case
history or life history which is of central importance for case study.
John Dollard has proposed seven criteria for evaluating such adequacy as
follows:
i) The subject must be viewed as a specimen in a cultural series. That is, the
case drawn out from its total context for the purposes of study must be
considered a member of the particular cultural group or community. The
scrutiny of the life histories of persons must be done with a view to identify
the community values, standards and their shared way of life.
ii) The organic motto of action must be socially relevant. That is, the action of the
individual cases must be viewed as a series of reactions to social stimuli or situation. In
other words, the social meaning of behaviour must be taken into consideration.
iii) The strategic role of the family group in transmitting the culture must be
recognized. That is, in case of an individual being the member of a family, the role of
family in shaping his behaviour must never be overlooked.
iv) The specific method of elaboration of organic material onto social
behaviour must be clearly shown. That is case histories that portray in detail
how basically a biological organism, the man, gradually blossoms forth into a
social person, are especially fruitful.
v) The continuous related character of experience for childhood through
adulthood must be stressed. In other words, the life history must be a
configuration depicting the inter-relationships between thee persons various
experiences.
vi) Social situation must be carefully and continuously specified as a factor.
One of the important criteria for the life history is that a persons life must be
shown as unfolding itself in the context of and partly owing to specific social
situations.
vii) The life history material itself must be organised according to some
conceptual framework.
4. Give the importance of frequency tables and discuss the principles of table
construction, frequency distribution and class intervals determination:
A frequency distribution is said to be skewed when its mean and median are different.
The kurtosis of a frequency distribution is the concentration of scores at the mean, or how
peaked the distribution appears if depicted graphicallyfor example, in a histogram. If
the distribution is more peaked than the normal distribution it is said to be leptokurtic; if
less peaked it is said to be platykurtic.
Letter frequency distributions are also used in frequency analysis to crack codes and refer
to the relative frequency of letters in different languages.
In musical set theory, an interval class (often abbreviated: ic), also known as unordered
pitch-class interval, interval distance, undirected interval, or (completely incorrectly)
interval mod 6 (Rahn 1980, 29; Whittall 2008, 27374), is the shortest distance in pitch
class space between two unordered pitch classes. For example, the interval class between
pitch classes 4 and 9 is 5 because 9 4 = 5 is less than 4 9 = 5 7 (mod 12). See
modular arithmetic for more on modulo 12. The largest interval class is 6 since any
greater interval n may be reduced to 12 n.
The concept of interval class was created to account for octave, enharmonic, and
inversion equivalency
Ans.
a.Type I error and type II error
In statistics, the terms type I error (also, error, false alarm rate (FAR) or false
positive) and type II error ( error, miss rate or a false negative) are used to describe
possible errors made in a statistical decision process. In 1928, Jerzy Neyman (1894-1981)
and Egon Pearson (1895-1980), both eminent statisticians, discussed the problems
associated with "deciding whether or not a particular sample may be judged as likely to
have been randomly drawn from a certain population" (1928/1967, p. 1), and identified
"two sources of error", namely:
Type I (): reject the null hypothesis when the null hypothesis is true, and
Type II (): fail to reject the null hypothesis when the null hypothesis is false
Type I error, also known as an "error of the first kind", an error, or a "false
positive": the error of rejecting a null hypothesis when it is actually true. Plainly
speaking, it occurs when we are observing a difference when in truth there is none, thus
indicating a test of poor specificity. An example of this would be if a test shows that a
woman is pregnant when in reality she is not. Type I error can be viewed as the error of
excessive credulity.
Type II error, also known as an "error of the second kind", a error, or a "false
negative": the error of failing to reject a null hypothesis when it is in fact not true. In
other words, this is the error of failing to observe a difference when in truth there is one,
thus indicating a test of poor sensitivity. An example of this would be if a test shows that
a woman is not pregnant, when in reality, she is. Type II error can be viewed as the error
of excessive skepticism.
TWO-TAILED t-TESTS
A two-tailed t-test divides a in half, placing half in the each tail. The null hypothesis in
this case is a particular value, and there are two alternative hypotheses, one positive and
one negative. The critical value of t, tcrit, is written with both a plus and minus sign ( ).
For example, the critical value of t when there are ten degrees of freedom (df=10) and a is
set to .05, is tcrit= 2.228. The sampling distribution model used in a two-tailed t-test is
illustrated below:
ONE-TAILED t-TESTS
There are really two different one-tailed t-tests, one for each tail. In a one-tailed t-test, all
the area associated with a is placed in either one tail or the other. Selection of the tail
depends upon which direction tobs would be (+ or -) if the results of the experiment came
out as expected. The selection of the tail must be made before the experiment is
conducted and analyzed.
The value tcrit would be negative. For example, when a is set to .05 with ten degrees of
freedom (df=10), tcrit would be equal to -1.812.
1. If tOBS = 3.37, then significance would be found in the two-tailed and the positive one-
tailed t-tests. The one-tailed t-test in the negative direction would not be significant,
because was placed in the wrong tail. This is the danger of a one-tailed t-test.
2. If tOBS = -1.92, then significance would only be found in the negative one-tailed t-test.
If the correct direction is selected, it can be seen that one is more likely to reject the null
hypothesis. The significance test is said to have greater power in this case.
The selection of a one or two-tailed t-test must be made before the experiment is
performed. It is not "cricket" to find a that tOBS = -1.92, and then say "I really meant to do
a one-tailed t-test." Because reviewers of articles submitted for publication are sometimes
suspicious when a one-tailed t-test is done, the recommendation is that if there is any
doubt, a two-tailed test should be done.
The p value identifies the likelihood tt a particular outcome may have occurred by
chance.
X(Ht)-cm 17 17 17 17 17 18 18 18 18 193
4 5 6 7 8 2 3 6 9
Y (Wt)- 61 65 67 68 72 74 80 87 92 95
Kg
The above formula defines the population correlation coefficient, commonly represented
by the Greek letter (rho). Substituting estimates of the covariances and variances based
on a sample gives the sample correlation coefficient, commonly denoted r :
An equivalent expression gives the correlation coefficient as the mean of the products of
the standard scores. Based on a sample of paired data (Xi, Yi), the sample Pearson
correlation coefficient is
where
are the standard score, sample mean, and sample standard deviation.