Professional Documents
Culture Documents
b. The variables in this plot are the swim times (in minutes) of
the competitors, a numerical variable, and the gender of the
competitors, a categorical variable. The plots involved are
side-by-side plots of dot plots and box plots on the same
scale. In this plot, it can be seen that there are fewer female
(164) than male (545) competitors. Both distributions are
roughly symmetrical, and unimodal. There is lesser spread in
the female competitors times, as there is a standard
deviation of 7.694 compared to 8.492 in the male competitors
times. Male competitors on average clocked a time of 39.81
minutes, swimming faster than female competitors, who
clocked a mean of 42.21 minutes. Contrary to expectations,
the slowest female competitor swam faster than the slowest
male competitor. This could be due to the male competitor
meeting with an accident or such unfortunate events.
c. The variables in this plot are the swim times (in minutes) of
the competitors, and the bike times (in minutes), both
numerical variables. The plot used is a scatter plot. There is a
strong linear positive association between the swim times and
the bike times The scatter is non-constant as there is more
scatter as the swim times increase. There is an outlier, as the
competitor with the longest swim time completed the bike
section of the race in a relatively short time. This can be
attributed to a one-off incident in the water, which the
competitor recovered from when he resumed the bike portion
of the race.
d. The variables in this plot are the swim times (in minutes) of
the competitors, the bike times (in minutes) of the
competitors, both numerical variables, and the gender of the
competitors which is a categorical variable. The plots used are
side-by-side scatter plots. There is a similar distribution of the
swim versus bike times of both males and females, with both
having strong positive linear associations between their swim
and bike times. The scatter pattern is also consistent in both
females and males, with both having non-constant scatter,
and increasing scatter as swim times increase. However, in
general, there is more scatter in the distribution of the female
5.
a. pr(female competitors) = Females/Total competitors =
164/709 = 0.2313
b. pr(male competitors who were from Australia) = pr(Australian
given male) = 92/545 = 0.169
c. Australia. pr(18-34 of competitors from Australia) = 39/121 =
0.3223
d. pr(female professional competitors) = pr(female given
professional) = 8/27 = 0.296
6.
a.
7.
a.
i. For study 1, the treatment being compared is the level
of blood alcohol. For study 2, the factor of interest being
compared is the gender.
ii. For study 1, the reaction time as measured by the
stimulator was the variable being measured. For study
2, the number of correctly identified emotions out of five
photos shown was the variable being measured.
iii. Study 1 is an experiment. This is because the subjects
were given varying amounts of alcohol and treated
differently by the researcher to yield the results. Study 2
is an observational study. An experiment could not have
been carried out in this case, as gender cannot be
assigned to test subjects; it is an inherent trait each
subject has.
b. The results cannot be used to argue that the overall difference
in the reaction time is due to the amount of alcohol
consumed. When the 80 adult volunteers were randomly
allocated into the five different groups, it should have
eliminated any differences between the groups that could
possibly account for the difference in reaction time such as
height, gender, or socioeconomic factors. However, the
sample size might be too small as there are only 16 units in