Starting from:

$29.99

ISE529 Midterm Exam Solution


Linear Model Analysis
For this problem we will be working with the following dataset:


First, we create three models using X1, X2, and the combination of X1 & X2 to predict Y:

1A) For the two simple (single-predictor) models, are the predictors X1 & X2 significant?
1B) For the multiple regression model, which predictors are significant?
1C) How do you interpret what is going on here?
Now we incorporate the categorical variable into the model by creating a dummy variable “Blue” and incorporate it into the model as shown:

1D) Does adding this categorical variable to the model improve it’s overall performance? Why or why not?
1E) Looking at this color-coded scatterplot of X1 vs Y, do you see any indication of an interaction effect between X1 and X3? Why or why not?

1E) Looking at these model results, do you see any indication of an interaction effect between X1 and X3? Why or why not?

After completing your modeling analysis, you decide to use the model shown below:

1F) Write out the algebraic expression for this model (you do not need to include the error term):
1G) Write out the simplified algebraic expression for this model for the Blue observations
1H) Write out the simplified algebraic expression for this model for the Red observations

• 𝑋𝑃: Population of the city (in thousands of people)
• 𝑋𝐼: Average income of the city (in thousands of dollars per adult)
• 𝑋𝑇: Type of store (1 for downtown store, 0 for a mall store)
• 𝑋𝑃𝐼: Interaction between population and average income (in thousands)
• 𝑋𝐼𝑇: Interaction between average income (in thousands) and store type
In the cities we are evaluating, the average income is generally less than $100,000 and the cities are in the size range of 0 – 500,000 people
After fitting this model using a linear regression, we get the following coefficients: መ = 20, 𝛽መ𝐼 = 50, 𝛽መ𝑇 = 350, 𝛽መ𝑃𝐼 = 0.05, 𝛽መ𝐼𝑇 = - 5 𝛽መ0 = 10, 𝛽𝑃
2a) Which answer is correct:
a) For a fixed value of population and average income, a downtown store would on average have greater sales than a mall store
b) For a fixed value of population and average income, a mall store would on average have greater sales than a downtown store
c) For a fixed value of population and average income, a downtown store would on average have more sales than a mall store provided that the average income is high enough
d) For a fixed value of population and average income, a mall store would on average have more sales than a downtown store provided that the average income is high enough Response:
2B) What is the predicted sales for a downtown store in a city with a population of 100,000 and an average income of $50,000?
• $
2C) Is this statement true or false and why: “Since the coefficient of the interaction term between population and average income is very small, there is very little evidence of an interaction effect:
2D) Which predictor has the larger impact on sales, income or city population? Explain your answer

Problem 3
You are assessing two candidate models (M1 through M4). You try training the models ten different times with different population samples and then assessing those models against test partitions by calculating their mean squared errors (MSE). The results of those tests are summarized on the following page.
Complete the figure on the bottom of the following page with one model for each of the four boxes.
Problem 3

Low Variance High Variance
Low Bias
High Bias
Problem 4
4A) Explain in your own words how k-fold cross-validation is implemented
Problem 4
4B) Provide one advantage and one disadvantage of k-fold cross validation relative to:
• The validation set approach?
– Advantage:
– Disadvantage:
• Leave-Out-One-Cross-Validation?
– Advantage:
– Disadvantage:
Problem 5
Residuals Analysis
The following pages present a residuals diagram and a residuals histogram for each of six different models. For each model, identify the apparent problem(s) with the model and provide one technique that you might use to remediate (correct) the problem.

Residuals Analysis

Model Issue:
Possible remediation:
Residuals Analysis

Model Issue:
Possible remediation:
Residuals Analysis

Model Issue:
Possible remediation:
Residuals Analysis

Model Issue:
Possible remediation:
Residuals Analysis

Model Issue:
Possible remediation:
Residuals Analysis

Model Issue:
Possible remediation:

More products