Professional Documents
Culture Documents
(population mean)
(sample mean)
N= total number of observations in the population
n=total number of observations in the sample
Grouped Mean
n
x f
mean
m
=
.
Ungrouped Data: The median is the midpoint of the data array. Before finding this
value, the data is arranged in order, from least to greatest or vice versa. The median will
either be a specific value or will fall between two values.
Grouped Median
( )
md
L w
f
cf
n
Md median +
|
|
|
|
.
|
\
|
=
2
) (
Where n sum of frequencies
cf cumulative frequency of the class preceding/before the median
class
f frequency of the median class
w class width
L
md
lower boundary of the median class
The median class is the one that contains the midpoint of data.
Ungrouped Data: The modeis the value that occurs most often in the data set. A data
can have more than one or none at all.
Grouped Mode
w
d d
d
L Mo e
Mo
|
|
.
|
\
|
+
+ =
2 1
1
) ( mod
Where L
Mo
lower boundary of the modal class
w class width
d
1
difference of the frequency of the modal class and the class
preceding it
d
2
difference of the frequency of the modal class and the class
succeeding it
5
The modal class is the class with the largest frequency.
Midrange: This is a rough estimate of the middle value.
()
Weighted Mean: This is used to find the mean of the values of the data set that are not
equally represented. The weighted average can be found by multiplying the value by its
corresponding weight and dividing the sum of the products by the sum of their weights.
Geometric Mean:
Harmonic Mean:
Example 4 : A recent survey of a new cola reported the ff. percentages of people who
liked the taste. Find the weighted mean of the percentages.
Area %favored No. surveyed
1 40 1000
2 30 3000
3 50 800
Shapes of Distribution
a. Positively Skewed Distribution the majority of the data values falls to
the left of the mean and clusters to the lower end of the distribution.
b. Symmetrical Distribution the data values are evenly distributed on
both sides of the mean. Also, when the distribution is unimodal, the
mean, median, and mode are the same and are at the center of the
distribution.
c. Negatively Skewed Distribution the majority of the data values falls
to the right of the mean and clusters at the upper end of the
distribution.
Mean
Median
Mode
Symmetrical
y
0
x
Positively skewed
y
0
x
mode
median
mean
Negatively skewed
y
0
x
mode
median
mean
6
Measures of Variation for Grouped Data
Range difference between the largest and the smallest value in a given
data.
Variance and Standard Deviation
Ungrouped Data:
()
(population variance)
(standard deviation)
Unbiased estimator of the population variance:
( )
(sample variance)
Where ; x = observed value; n = sample size
s = sample standard deviation =
Grouped Data:
( )
( ) 1
2
2
2
=
n n
x f x f n
s
Where : x = class midpoint
Example 5: For 108 randomly selected high school students, the following IQ
frequency distribution were obtained.
Class Limits Frequency
90-98 6
99-107 22
108-116 43
117-125 28
126-134 9
Coefficient of Variation a statistic that allows us to compare two different data sets that
have different units of measurement.
For samples: % 100 =
x
s
CV
For populations: % 100
o
= CV
The data with larger CV is more variable.
Coefficient of Skewness
A measure to determine the skewness of a distribution is called Pearson
coefficient of skewness. The formula is
7
s
Md X
SK
|
|
.
|
\
|
=
__
3
Where
__
X -mean, Md-median, s standard deviation
When the distribution is symmetrical, the coefficient is zero; when the
distribution is positively skewed, the coefficient is positive; when the distribution
is negatively skewed, the coefficient is negative.
Example 6: Find the coefficient of skewness of a distribution with mean 10,
median 8 and standard deviation 3.
3.2.5 Measure of Kurtosis
Even if the curves of distributions have the same coefficient of skewness,
these curves may still differ in the sharpness of their peaks. The following figures
show different types of symmetrical curves.
Ungrouped Data:
( )
Grouped Data:
( )
Where x = class midpoint and s = sample st
d dev
A distribution is said to be : Mesokurtic if K=3
Leptokurtic if K>3
Platykurtic if K<3
Example 7: Calculate the measure of kurtosis for the data Example 5.
Measures of Position for Grouped Data
Standard Scores or Z scores measures the distance an observation and the mean,
measured in units of standard deviation.
s
x x
z
_
deviation standard
mean value
=
=
Mesokurtic
(normal)
Leptokurtic
(more peak)
Platykurtic
(flat-topped)
8
If z score is positive, the score is above the mean. If z =0, score = mean. If z <0,
score < mean.
Example 8: An IQ test has a mean of 105 and a standard deviation of 20. Find the
corresponding z score for each IQ.
a) 88 b) 122 c) 110
Grouped Data
The quartiles, deciles, percentiles can be determined using the following
formula.
w
f
cf kn
L |
.
|
\
|
+
Where k is equal to: i/4 for quartiles; i/10 for deciles; i/100 for percentiles
i ith quartile, decile, or percentile
L lower boundary of the quartile, decile or percentile class
n total number of observations
w class width
cf
p
frequency of the preceding class
f frequency of the quartile, decile or percentile
Example 9: Find the third quartile, 4
th
decile and 7
th
percentile for the given
frequency distribution below.
Class Boundaries Frequency cf
52.5-63.5 6 6
63.5-74.5 12 18
74.5-85.5 25 43
85.5-96.5 28 71
96.5-107.5 14 85
107.5-118.5 5 90