Professional Documents
Culture Documents
15-2
2) Frequency Distribution
3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability
15-3
15-4
15-5
Frequency Distribution
In a frequency distribution, one variable is considered at a time. A frequency distribution for a variable produces a table of frequency counts, percentages, and cumulative percentages for all the values associated with that variable.
15-6
15-7
8 7 6
Frequency Histogram
Frequency
5 4 3 2 1 0 2 3 4 5
Familiarity
15-8
X The mean, or average value, is the most commonly used measure of central tendency. The mean, ,is given by X = S X i /n
i =1
Where, Xi = Observed values of the variable X n = Number of observations (sample size) The mode is the value that occurs most frequently. It represents the highest peak of the distribution. The mode is a good measure of location when the variable is inherently categorical or has otherwise been grouped into categories.
15-9
15-10
Xsmallest.
The interquartile range is the difference between the 75th and 25th percentile. For a set of data points arranged in order of magnitude, the pth percentile is the value that has p% of the data points below it and (100 - p)% above it.
15-11
The variance is the mean squared deviation from the mean. The variance can never be negative. The standard deviation is the square root of the variance. n (Xi - X)2 sx = i =1 n - 1
The coefficient of variation is the ratio of the standard deviation to the mean expressed as a percentage, and is a unitless measure of relative variability.
CV = sx/X
15-12
15-13
Skewed Distribution
15-14
Cross-Tabulation
While a frequency distribution describes one variable at a time, a cross-tabulation describes two or more variables simultaneously. Cross-tabulation results in tables that reflect the joint distribution of two or more variables with a limited number of categories or distinct values.
15-15
15-16
15-17
15-18
15-19
15-20
15-21
College Degree
College Degree
No College Degree
15-22
15-23
15-24
15-25
Family size Small Large Yes 65% 65% No 35% 35% Column totals 100% 100% Number of respondents 250 250
15-26
If the calculated value is less than the critical value, accept the null hypothesis otherwise reject it
Hypothesis Tests
Non-parametric Tests (Nonmetric Tests) One Sample * * * * Chi-Square K-S Runs Binomial Two or More Samples
* * * *
15-28