STATS101 Assignment 1

1
Minghan Yong (478510511)

1. I have attempted the Diagnostic Quiz. ID number: 478510511
2.
a. The minimum percentage required in the exam to pass the
course is 45%.
b. The remaining 80% of the grade is determined by taking the
higher score of these two combinations: 10% from the MidSemester Test and 70% from the Exam, or 20% from the MidSemester Test and 60% from the Exam.
3. I have attempted the iNZight Quiz. ID number: 478510511
4.
a. The variable in this plot is the swim times (in minutes) of the
competitors and it is numerical. The plot used is a dot plot and
a box plot. The distribution of swim times is roughly
symmetrical, with the competitors swimming a mean of 40.36
minutes. There is also a small spread. However, there are a
few outliers, with the slowest competitor clocking a swim time
of 87.92 minutes. This could be due to a chance mishap that
occurred on the day of the competition such as a cramp, or
veering off course.
b. The variables in this plot are the swim times (in minutes) of
the competitors, a numerical variable, and the gender of the
competitors, a categorical variable. The plots involved are
side-by-side plots of dot plots and box plots on the same
scale. In this plot, it can be seen that there are fewer female
(164) than male (545) competitors. Both distributions are
roughly symmetrical, and unimodal. There is lesser spread in
the female competitors times, as there is a standard
deviation of 7.694 compared to 8.492 in the male competitors
times. Male competitors on average clocked a time of 39.81
minutes, swimming faster than female competitors, who
clocked a mean of 42.21 minutes. Contrary to expectations,
the slowest female competitor swam faster than the slowest
male competitor. This could be due to the male competitor
meeting with an accident or such unfortunate events.
c. The variables in this plot are the swim times (in minutes) of
the competitors, and the bike times (in minutes), both
numerical variables. The plot used is a scatter plot. There is a
strong linear positive association between the swim times and
the bike times The scatter is non-constant as there is more
scatter as the swim times increase. There is an outlier, as the
competitor with the longest swim time completed the bike
section of the race in a relatively short time. This can be
attributed to a one-off incident in the water, which the
competitor recovered from when he resumed the bike portion
of the race.
d. The variables in this plot are the swim times (in minutes) of
the competitors, the bike times (in minutes) of the
competitors, both numerical variables, and the gender of the
competitors which is a categorical variable. The plots used are
side-by-side scatter plots. There is a similar distribution of the
swim versus bike times of both males and females, with both
having strong positive linear associations between their swim
and bike times. The scatter pattern is also consistent in both
females and males, with both having non-constant scatter,
and increasing scatter as swim times increase. However, in
general, there is more scatter in the distribution of the female
competitors than the males. Also, there are no outliers in the

female competitors distribution.
5.
a. pr(female competitors) = Females/Total competitors =
164/709 = 0.2313
b. pr(male competitors who were from Australia) = pr(Australian
given male) = 92/545 = 0.169
c. Australia. pr(18-34 of competitors from Australia) = 39/121 =
0.3223
d. pr(female professional competitors) = pr(female given
professional) = 8/27 = 0.296
6.
a.
b. pr(gamers favourite game type is puzzle) = 1241/10000 =

0.1241
c. pr(females of gamers whose favourite game type is puzzle) =
pr(female given puzzle) = 658/1241 = 0.53
7.
a.
i. For study 1, the treatment being compared is the level
of blood alcohol. For study 2, the factor of interest being
compared is the gender.
ii. For study 1, the reaction time as measured by the
stimulator was the variable being measured. For study
2, the number of correctly identified emotions out of five
photos shown was the variable being measured.
iii. Study 1 is an experiment. This is because the subjects
were given varying amounts of alcohol and treated
differently by the researcher to yield the results. Study 2
is an observational study. An experiment could not have
been carried out in this case, as gender cannot be
assigned to test subjects; it is an inherent trait each
subject has.
b. The results cannot be used to argue that the overall difference
in the reaction time is due to the amount of alcohol
consumed. When the 80 adult volunteers were randomly
allocated into the five different groups, it should have
eliminated any differences between the groups that could
possibly account for the difference in reaction time such as
height, gender, or socioeconomic factors. However, the
sample size might be too small as there are only 16 units in
each group. This might have resulted in a difference being

detected only by chance. The only concern with generalizing
the results is that the sample size is too small and might not
be accurate.
c. Study 1 used a control group. A control group is needed to
determine that there are no such effects when the treatment
is absent, which would indicate that any difference is due to
the alcohol level difference.
d. The study could utilise the randomized block design, which
groups the adults by gender first to make sure that each
group has an equal number of both males and females, so as
to eliminate any differences caused by gender.

STATS101 Assignment 1

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

STATS101 Assignment 1

Uploaded by

Copyright:

Available Formats

1

Minghan Yong (478510511)

competitors than the males. Also, there are no outliers in the

b. pr(gamers favourite game type is puzzle) = 1241/10000 =

each group. This might have resulted in a difference being

You might also like