Professional Documents
Culture Documents
2. Suppose you are interested in knowing the impact on starting salary of a job candidate's
educational background and whether or not the job candidate had previous work
experience. Educational background was categorized as arts or science/engineering.
a. Which test would you use?
1. Simple linear regression
2. Multiple regression
3. Logistic regression
4. ANOVA
5. Two-way ANOVA
6. Repeated measures ANOVA
4. ANOVA
5. Two-way ANOVA
6. Repeated measures ANOVA
Page 1
5. Researchers wanted to predict whether someone was hired or not based on their age in
years, gender, and years of education.
a. Which test would you use?
1. Simple linear regression
2. Multiple regression
3. Logistic regression
4. ANOVA
5. Two-way ANOVA
6. Repeated measures ANOVA
6. Regression analysis was used to predict the cost of milk in cents from the cost of a
barrel of corn in dollars. The resulting least-squares equation is y = 25 + 5x. The actual
cost of milk is 500 cents when corn costs 100 dollars a barrel. What is the residual?
4. ANOVA
5. Two-way ANOVA
6. Repeated measures ANOVA
Page 2
Page 3
9. Use the SPSS output shown below to answer all parts of this question.
Engineers want to use three pond characteristics: depth, surface strength, and surface
area, to predict whether ice type was landfast or not.
a. What is the regression equation?
d. Describe how the odds of pond ice being landfast change with depth.
Page 4
10. Use the SPSS output shown below to answer all parts of this question:
Researchers investigated whether caffeine helps to counteract the effects of alcohol
consumption. They randomly assigned students to four groups that received alcohol (A),
alcohol with caffeine (AC), a placebo that looked and tasted like alcohol (P), or nothing
(AR). After 25 minutes, their performance on a memory task was recorded.
a. Does the data satisfy the assumptions of ANOVA?
Page 5
11. Researchers investigated the mean whipping capacity for eggs that were randomly
selected from chickens that were raised in four different types of housing: cage, barn, free
range, and organic. The eggs were also classified as being either from a medium or large
weight class. The results from a two-way ANOVA are shown below.
a. Is there a significant main effect of weight class?
c. Is there an interaction?
Page 6
12. The following questions refer to the SPSS output below. Researchers wanted to
investigate a program to help obsessive-compulsive disorder. They randomly selected 13
subjects with the mental illness and measured their percentage of time spent obsessing.
This was measured on entrance into the program, and every two months after for 6
months.
a. Are the assumptions for this test met?
b. Is there a significant effect of obsessive thoughts?
c. If you wanted to find out if there was a significant difference between the first and
last rating of obsessive thoughts, what test would you conduct?
Page 7
13. Researchers wanted to test whether taking a GMAT prep course would improve
subjects' scores on the GMAT. They had students take the GMAT before the prep
course, midway through the course, and after the course was over. They then
compared their scores.
a. Which test would you use?
1. Simple linear regression
2. Multiple regression
3. Logistic regression
4. ANOVA
5. Two-way ANOVA
6. Repeated measures ANOVA
14. Researchers wanted to investigate how the amount spent on homes purchased in
Boston varied by whether it was cape or colonial style, and whether it was sold in the
spring, summer, fall, or winter. They randomly selected 100 of the homes sold in the last 2
years and recorded the season and style of the home.
a. What are the factors?
Page 8
15. Researchers conducted an experiment investigating whether the average SAT score
differed for people with high or low short-term memory capacity. They randomly selected
100 incoming freshman and tested their short-term memory. 50 were classified as having
low short-term memory capacity and 50 were classified as having high short-term memory
capacity. When the researchers plotted the data, they noticed it was not normally
distributed. To overcome this, they used the Wilcoxon rank sum test.
a. Find W.
b. If the Wilcoxon rank sum statistic for students with a low short-term memory capacity
is 1500 and the standard error is 145.1, what is the value of the test statistic?
c. Use the value you found for the test statistic to write your conclusion and conclusion
in context.
16. Researchers wanted to test whether staying up all night affected memory recall. They
randomly assigned subjects to three groups; one group stayed up all night, one group
stayed up for half of the night, and the third groups slept normally. The next morning they
recorded their performance on a memory test and their averages were tallied.
a. Which test would you use?
1. Simple linear regression
2. Multiple regression
3. Logistic regression
4. ANOVA
5. Two-way ANOVA
6. Repeated measures ANOVA
Page 9
Solutions
1. False
2. a. two-way ANOVA
b. H0: AW = ANW = SW = SNW ; Ha: the means are not all equal.
3. a. simple linear regression
b. H0: 1 = 0, Ha: 1 0
4. a. P(tails) = 1 - P(heads) = 1 - .65 = .35
b. The odds of getting heads = P(heads) /1- P(heads) = .65/(1-.65) = .65/.35 = 1.857
c. odds ratio for heads =
odds of heads
.65
.35
1.857
3.449
odds of tails
1 .65 1 .35 .538
d. The odds of getting heads is 3.45 times the odds of getting tails.
5. a. Logistic regression
b. H0: age = gender = education = 0; Ha: the betas are not all zero
6. residual = observed value - predicted value = 500 - 525 = -25 cents
7. a. multiple regression
b. gender = age = parent = education = 0; Ha: the betas are not all zero
Page 10
8. a. y = 30,855.911 191.567xtemp
b. H0: temp = 0; Ha: temp 0.
F = 107.323 and p is close to 0. Since p is small, we can reject the null hypothesis and
conclude that there is a relationship between temperature and failtime. These results
suggest that the model predicts a significant amount of variation in failtime. This is
supported by the high adjusted R2 of .835, which suggests that the model accounts for
83.5% of the variation in failtime. However, the scatterplot of the data suggests that the
relationship between temperature and failtime might be curvilinear, and the residual plot
does not appear to be normally distributed.
c. I would not recommend the above model. Even though the model is significant (p< .05),
the scatterplot of the data suggests a curvilinear relationship and the residual plot violates
the assumption of equal variance, as the residual plot appears to thicken.
Page 11
Page 12
Page 13
12.a.Normality:
H0: the data is normally distributed
Ha: the data is not normally distributed.
The KS test of normality is nonsignificant, with all p-values greater than alpha of .05.
Therefore, we fail to reject the null hypothesis that the data is normally distributed and we
can conclude that the assumption of normality is met
Sphericity:
H0: the pairwise variances are equal.
Ha: the pairwise variances are not equal.
Since Mauchlys test has a small p-value (.001), we reject the null hypothesis that the
variances of the differences are equal. Therefore, the assumption of sphericity is not met.
Independence:
We can assume this assumption is met as it is stated in the study description that
subjects were randomly selected.
We can proceed with the ANOVA analysis with caution, and use the Greenhouse-Geisser
test for significance since the sphericity assumption is not met.
b. H0: 1 = 2 = 3 = 4
Ha: the means are not all equal.
Using the Greenhouse-Geisser results, the main effect for obsessive thoughts has a small
p-value of .018, and so we can reject the null hypothesis that the means are equal. This
suggests that the treatment program resulted in a change in percentage of time devoted
to obsessive thoughts.
c. Since the groups are not independent, a paired t-test is appropriate.
Page 14
16. a. ANOVA
b. H0: nosleep = halfsleep = fullsleep
Ha: the means are not all equal.
Page 15