$30
1. Practice with MME
(a) The Gamma(x, y) distribution has mean π₯ β π¦ and variance π₯ β π¦ . Find MME for π₯ and π¦.
(b) Find MME π and π for the Uniform(a, b) distribution. Express your final answer in terms of the
sample mean, π ∑ π /π, and sample variance, π ∑ π /π π .
Consistency of MLE
Let π , π , … , π be distributed as Exponential(1/β), all i.i.d. Show that the MLE(π½) will converge to the unknown parameter β. Prove this by showing that bias(π½) and se(π½ tends to 0 as n tends to ∞. You can use the fact that the mean and variance of Exponential(λ) are 1/λ and 1/λ2, respectively.
Practice with MLE
(a) Let π , π , … , π be distributed i.i.d. as Poisson(λ). Find the MLE of λ. (3 points)
(b) Let π , π , … , π be distributed i.i.d. as Normal(µ, σ2). Show that the MLE of µ and σ2 is the same as
the sample mean and (uncorrected) sample variance, respectively. (4 points) (c) Let π , π , … , π ~ Normal(θ, 1). Let δ =πΈ πΌ . Use the Equivariance property to show that the
MLE of δ is π· ∑ π , where π· is the CDF of the standard Normal. You can use the MLE of the
Normal as provided in 3(b).
Parametric Inference with Data Samples
2 π€ππ‘β ππππ π
Let π , where π is unknown. Let D = {2, 3, 2} be drawn i.i.d. from X.
3 ππ‘βπππ€ππ π
(a) Derive π using D as the sample data. Clearly show all your steps. (3 points)
(b) Provide a numerical estimate of the 95%ile confidence intervals for π . Start by deriving π π π : first derive π π π in terms of π, and then estimate π π π , as in class. Show all
your steps. Your final answer should be a numerical range. (4 points) (c) Derive π using D as the sample data. Clearly show all your steps.
MME versus MLE using real data
For this question, we will use the acceleration, model, and mpg data from the Autoβmpg dataset (https://www.kaggle.com/uciml/autompgβdataset). Please use the data files on the class website. We will assume that acceleration is Normal(μ, σ2) distributed, model year is Uniform(a, b) distributed, and mpg is Exponential(λ) distributed. You are to find the MME and MLE estimates of the parameters of the distributions for all 3 datasets. For the Normal MME and Uniform MLE, you can directly use the results from class. For the Normal MLE, use the result from Q3(b); for Uniform MME, use the result from Q1(b).
For the Exponential, we will first derive the estimates.
(a) For the Exp(λ) distribution, find the π .
(b) For the Exp(λ) distribution, find the π .
(c) For the 3 datasets, find the MME estimates. That is, find the MME for μ and σ2 for the acceleration dataset, a and b for the model dataset, and λ for the mpg dataset. Provide your answer as a number
with 3 significant digits.
(d) Same as part (c), but this time find the MLE estimates.
Clinical Testing
Consider the sick patient example from class. In a clinical trial of a new disease detection test, there were 100 healthy patients and 100 sick patients. The test correctly identified 98 out of the 100 healthy patients as healthy. The test also correctly identified 99 of the 100 sick patients as sick. The remaining patients were incorrectly classified.
(a) What is the precision of the test?
(1 point)
(b) What is the recall of the test?
(1 point)
(c) What is the Type I error of the test?
(1 point)
(d) What is the Type II error of the test?
(
Wald’s test
(a) Suppose the null hypothesis is H0: θ = θ0, but the true value of θ is θ*. Show that, under Wald’s test, the probability of a Type II error is π· ∗ π§/ π· ∗ π§/ .
(Hints: (i) might help to draw a figure; (ii) think about the distribution of the estimate.)
(b) You observe 46 successes in 100 trials of a coin. If the null hypothesis is that the coin is unbiased, use the Wald’s test with the MLE or MME with α = 0.05 to Reject/Accept the null. What if the null
hypothesis is that the coin has p=0.7?
More on Wald’s test
(a) Use q8_a.csv dataset and assume it is distributed as Normal(θ, σ2). Apply the Wald’s test with α =
0.02 to check whether the true mean is π 0.5. Use sample mean to obtain π and corrected
sample variance estimator for obtaining π .
(b) Use q8_b_X.csv and q8_b_Y.csv available at the class website for this question. Each contains 750 samples for X and Y drawn from two independent Normal distributions. Without worrying about the applicability of the test, use Wald’s 2βpopulation test with α = 0.05 to test whether the population means of X and Y are same (null) or not (alternative). Is this test applicable here?