You are on page 1of 12

Biostatistics 201

MOCK EXAM
Second Long Exam
Instructions: Write TRUE if the statement is correct. If the statement is
false, change the underlined word to make the statement correct.
1. In sampling distribution of the
/ n
x
mean,
2. The sampling distribution of the mean, with N=30,000, of a moderately
negatively skewed distribution is approximately normal.
3. The range of the upper bound and lower bound limit increases when the
proportion of successes approaches 0.50.
4. Comparison of the effects of a drug between two different groups of
patients involves statistical inference.
5. s denotes the standard deviation when the population variance is
unknown.
6. The probability distribution of the mean of all samples of size n is called
the sampling distribution of the mean.
7. The determination of the sample mean is not an example of statistical
inference.
8. If the population variance is known, we should use the t-test.
9. Hypothesis testing is a type of descriptive statistics.
10.
is
true for sampling
distribution of the mean.
x
11. In estimating proportions,
nP<5 and nQ>5.
12. The population has a mean of 14 and a standard deviation of 3. The
sample size of your sampling distribution is N=10. The mean of the sampling
distribution of the mean is 14.
13. The Central Limit Theorem provides that the sampling distribution of the
mean of any independent random variable will be normal or nearly normal, if
the sample size is large enough.
14. The proportion of successes in the population is 0.4. The proportion of
successes in the sampling distribution of the proportion is 0.3.
15. Estimation is not statistical inference.
16. If the population variance is unknown, we should use the z-test.
17. When estimating proportions, the standard error is equal to the square
root of the proportion of successes in the sample multiplied by the proportion
of failures in the sample divided by the sample size.
18. Out of 300 students in the school, 225 passed an exam. The mean of the
sampling distribution of the proportion of students who passed the exam in
the school is 0.75.
19. The lower bound of a sample with an n= 50 and p of .4 at 95% confidence
level is 0.35.
20. Out of 300 students in the school, 225 passed an exam. The standard
error of the sampling distribution of the proportion of students who passed
the exam in the school is 0.137.
21. The proportion of failures is equal to one minus the proportion of
successes.
22. The upper bound of a sample with an n=50, a mean of 2 and standard
deviation of 1 at 95% confidence level is 4.6.
23. The interval estimates of a sample proportion can be computed even if

only the sample size and the point estimate is known.


24. Generalizing the mean of a sample to a population is statistical inference.
25. The population has a mean of 30 and a standard deviation of 6. The
sample size of your sampling distribution is N=9. The variance of the
sampling distribution of the mean is 4.
Instructions: Read the problem then answer the following questions. Please
write the letter of the correct answer on your paper
For numbers 26-30, please refer to the following choices:
A. Find the critical region
B. State the statistical decision
C. Set the level of significance
D. State the null and alternative hypothesis
E. Choose the test statistic
F. None of the above
26.
27.
28.
29.
30.

After this, the test statistic will be computed.


In this step, we decide whether we accept or reject the null hypothesis.
Hypothesis testing begins with this step.
This step tells how confident is the researcher about the study.
Arrange the steps in chronological order. (except for F)

For numbers 31-35, please refer to this problem:


The Centers for Disease Control (CDC) reported on trends in weight, height
and body mass index from the 1960's through 2002.1 In 2002, the mean
weight for men was reported at 191 pounds. In order for to know if the mean
weight for men today is still the same as that of 2002, a researcher recruited
100 men then measured their body weights. The mean weight of the sample
was 197.1 pounds and the standard deviation is 25.6. (Use alpha = 0.05)
31. What is the primary variable of interest?
A. Weight of men
B. Mean weight for men
C. Weight of the sample
D. Weight in 2002
E. None of the above
32. What is the alternative hypothesis in statement form?
A. The mean weight for men today is equal to that in 2002.
B. The mean weight for men today is not equal to that in 2002.
C. The mean weight for men today is greater than that in 2002.
D. The mean weight for men today is less than that in 2002.
E. None of the above.
33. What formula should be used in order to answer the problem?
A.
B.
C.

x
t x n
t s
n
n
z

D.
E. None of the above

x
s
n
34. What is the critical region?
z 1.64; z 1.64
A.
B.
C.
D.
E.

z 1.96; z 1.96
z 2.58; z 2.58
t 1.98; t 1.64
t 1.98; t 1.98

35. What is the statistical decision?


A. Accept the null hypothesis
B. Reject the alternative hypothesis
C. Do not reject the null hypothesis
D. Reject the null hypothesis.
E. None of the above
For numbers 36-40, please refer to this problem:
A research study measured the pulse rates of 57 college men and found a
mean pulse rate of 70.42 beats per minute. Researchers want to know if the
mean pulse rate for all college men is different from the current standard of
72 beats per minute. From a previous study, it was known that the standard
deviation of the pulse rates of the population was 9.95 beats per minute. (Use
alpha = 0.05)
36. What is the critical region?
A.
z 1.64; z 1.64
B.
z 1.96; z 1.96
C.
z 2.58; z 2.58
D.
t 1.98; t 1.64
F.
t 1.98; t 1.98
37. What is the null hypothesis in notation form?
A.
72
B.
72
C.
72
D.
72
E. None of the above
38. What formula should be used in order to solve this problem?
A.
x
z x
B.

t x
C.

t x sn
D.
z n
E. Any of the above.

sn
n

39. What is the conclusion?


A. The mean pulse rate of college men is not equal to the current standard.

B. There is no sufficient evidence to say that the mean pulse rate of college
men is not equal to the current standard.
C. The mean pulse rate of college men is equal to the current standard.
D. There is sufficient evidence to say that the mean pulse rate of college men
is equal to the current standard.
E. There is no sufficient evidence to say that the mean pulse rate of college
men is equal to the current standard.
40. What is the computed value of the test statistic?
A. -1.5
B. -1.9
C. -1.2
D. 1.2
E. 0.12
For numbers 41-45, please refer to this problem:
In a study of the television viewing habits of children, a developmental psychologist selects a random
sample of 300 first graders - 100 boys and 200 girls. Each child is asked which of the following TV
programs they like best: The Lone Ranger, Sesame Street, or The Simpsons. Results are shown in the
contingency table below.

Viewing Preferences
Row total
Lone Ranger

Sesame Street

The Simpsons

Boys

50

30

20

100

Girls

50

80

70

200

110

90

300

Column total 100

(Use a 0.05 level of significance.)

41. What is the null hypothesis?


A. Boys and girls have the same viewing preferences.
B. Boys and girls have different viewing preferences.

C. Girls have different viewing preferences.


D. Boys have same viewing preferences
E. None of the above

42. What is the degrees of freedom?


A. 2
B. 3
C. 4
D. 5
E. 6

43. What kind of test should be used?


A. Test for two proportions
B. Chi-square test of association
C. Chi-square test of homogeneity
D. Correlation analysis
E. Test of two means

44. What is the statistical decision?


A. Accept the null hypothesis
B. Reject the alternative hypothesis
C. Do not reject the null hypothesis
D. Reject the null hypothesis.
E. None of the above

45. Can z-test be used to solve this problem?


A. Yes
B. No
For numbers 46-50, please refer to this problem:
In the Physicians' Health Study subjects were treated with either aspirin or a
placebo. The main recorded outcome was whether or not the subject suffered
a heart attack.
Below is a table representing the data and its expected frequencies (printed
below observed counts)

Yes

No

Total

Aspirin

104
146.52

10933
10890.48

11037

Placebo

189
146.48

10845
10887.52

11034

293

21778

22071

Total

46. Can z-test be used to solve this problem?


A. Yes
B. No
47. Will this data pass the requirements for the test that will be used?
A. Yes
B. No
C. Not enough information to answer this question
48. What formula can be used to solve this problem?
A.
x
z x
B.

t O
C.

E)2
sn
D. A and B
nE
E. A and C
F. A, B and C

49. What is the statistical decision?


A. Accept the null hypothesis
B. Reject the alternative hypothesis
C. Do not reject the null hypothesis
D. Reject the null hypothesis.
E. None of the above
50. What test was used for this problem?
A. Chi-square test of homogeneity
B. Chi-square test of association
C. Simple linear regression analysis
D. Correlation analysis
E. Test for two means
For numbers 51-60 please refer to this problem:
A researcher was interested in studying factors that may influence income
status.. Below were the findings in his research:

. reg income age


Source

SS

df

MS

Model
Residual

2516.98267
65982.8711

1
292

2516.98267
225.968737

Total

68499.8537

293

233.787897

income

Coef.

age
_cons

-.1620605
27.77274

Std. Err.
.0485581
2.328084

t
-3.34
11.93

Number of obs
F( 1,
292)
Prob > F
R-squared
Adj R-squared
Root MSE

P>|t|
0.001
0.000

=
=
=
=
=
=

294
11.14
0.0010
0.0367
0.0334
15.032

[95% Conf. Interval]


-.2576286
23.19079

-.0664924
32.35469

. reg income sex


Source

SS

df

MS

Model
Residual

2226.25486
66273.5989

1
292

2226.25486
226.96438

Total

68499.8537

293

233.787897

income

Coef.

sex
_cons

-5.676414
29.78452

Std. Err.
1.812448
3.069062

t
-3.13
9.70

Number of obs
F( 1,
292)
Prob > F
R-squared
Adj R-squared
Root MSE

P>|t|
0.002
0.000

=
=
=
=
=
=

294
9.81
0.0019
0.0325
0.0292
15.065

[95% Conf. Interval]


-9.243531
23.74424

-2.109297
35.82481

. reg income educ


Source

SS

df

MS

Model
Residual

12605.0595
55894.7943

1
292

12605.0595
191.420528

Total

68499.8537

293

233.787897

income
educat
_cons

Number of obs
F( 1,
292)
Prob > F
R-squared
Adj R-squared
Root MSE

=
=
=
=
=
=

294
65.85
0.0000
0.1840
0.1812
13.835

Income refers to the annual income in dollars


Age is the age in years (output 1)
Std. Err.
t whether
P>|t|
[95%undergrad,
Conf. Interval]
EducCoef.
is the educational
attainment
high school
high school grad, college
undergrad or college grad. (output 2)
5.004095
0.000
3.790429
6.217761
Sex
is whether.6166622
male or female8.11
(output 3)
3.162623

2.292435

1.38

0.169

-1.349168

7.674413

51. Interpret r-squared for the first output


A. 3.67% of the variability in income is explained by the variability in age.
B. 3.67% of the variability in income is explained by the variability in age.
C. 0.0367 of the variables explains income status.
D. 3.67% of the variables explains income status.
E. None of the above

52. What is the coefficient of correlation in the first ouput?


A. .0001
B. .1916
C. .4612
D. .2046
E. .2127
53. Is sex a significant predictor of income?
A. Yes
B. No
C. Sometimes
D. Insufficient data to answer this problem
E. None of the above
54. Is education a significant predictor of income?
A. Yes
B. No
C. Sometimes
D. Insufficient data to answer this problem
E. None of the above
55. Is age a significant predictor of income?
A. Yes
B. No
C. Sometimes
D. Insufficient data to answer this problem
E. None of the above
56. Describe the correlation in the first ouput?
A. Moderately positive
B. Moderately negative
C. Weak positive
D. Weak negative
E. Strong positive
57. What is the statistical decision for the third output?
A. Accept the null hypothesis
B. Reject the null hypothesis
C. Do not reject the null hypothesis
D. Reject the alternative hypothesis
E. None of the above
58. What is the null hypothesis for the second output??
A. Sex is a significant predictor of income.
B. Age is a significant predictor of income
C. Sex is associated with income.
D. Sex does not vary among the rich and the poor.
E. None of the above
59. What is the conclusion for the third output?
A. Education is related to income.
B.Education is associated with income.
C. Educated people have more income.

D. Education varies among the rich and the poor.


E. Education is a significant predictor of income.
60. Interpret the Beta coefficient in the first output.
A. A 1 year decrease in age results to a -0.16 decrease in income (in thousands of dollars)
B. A 1 year increase in age results to a -0.16 decrease in income (in thousands of dollars)
C. 16% of the age is related to income
D. A and C
E.None of the above
TRUE OR FALSE: No.s 61-65
Instruction: Write TRUE if the statement is true, FALSE, if otherwise.
61. Pearsons R gives us the strength only of the relationship.
62. Correlation tells whether a relationship exist between two variables
63. A strong positive correlation is denoted by an r>0.70
64. Only two variables can be run using correlation analysis.
65. r values are from -1 to 1.

For numbers 66-70 please refer to this problem:


A researcher wants to know whether gestational age is related to weight or
not.
gest_age weight
34.7 1895
36 2030
29.3 1440
40.1 2835
35.7 3090
42.4 3827
40.3 3260
37.3 2690
40.9 3285
38.3 2920
38.5 3430
41.4 3657
39.7 3685
39.7 3345
41.1 3260
38 2680
38.7 2005

weight gest_age
weight

1.0000

gest_age

0.8175
0.0001

1.0000

66. Describe the magnitude and direction of the correlation?


A. Strong positive
B. Weak negative
C. Moderately positive
D. Moderately negative
E. None of the above

67. What is the null hypothesis in notation form?


A.

B.

C.
p0
D.
p0
E. None of the above
68. What is the statistical decision?
A. Accept the null hypothesis
B. Reject the null hypothesis
C. Do not reject the null hypothesis
D. Accept the alternative hypothesis
E. None of the above
69. What is the conclusion?
A. There is a linear relationship between gestational age and birthweight.
B. There is no linear relationship between gestational age and birthweight.
C.There is a linear relationship between gestational age and birthweight.In fact, it is a strong positive
correlation.
D. Gestational age is associated with birthweight
E. None of the above.
70. What is the conclusion in statement form?
A.
B.
C.
D.
E. None of the above



p0
p0

You might also like