$25
Curse of Dimensionality
Task 1: Let denote the p-dimensional hypercube of edge length d, centered at the origin.
• Assume X to be uniformly distributed in C1. Determine d in dependence of p and q ∈ [0,1], such that
Pr(X ∈ Cd) = q
holds.
• Let the components of the p-dimensional random variable Xp be independent and have the standard normal distribution. It is known that Pr(|X1| ≤ 2.576) = 0.99. For an arbitrary p, determine the probability Pr(kXpk∞ > 2.576) for any of the components of Xp to lie outside of the interval [−2.576,2.576]. Evaluate the value for p = 2, p = 3 and p = 500.
Task 2: Provide the PYTHON code to the following tasks (the code needs to be commented properly):
• Sample 100 uniformly distributed random vectors from the box [−1,1]d for d = 2.
• For each of the 100 vectors determine the minimum angle to all other vectors. Then compute the average of these minimum angles. Note that for two vectors x, y the cosine of the angle between the two vectors is defined as
.
• Repeat the above for dimensions d = 1,...,1000 and use the results to plot the average minimum angle against the dimension.
• Give an interpretation of the result. What conclusions can you draw for 2 randomly sampled vectors in a d-dimensional space?
• Does the result change if the sample size increases?
Statistical Decision Making
Task 3: Answer the following questions. All answers must be justified.
• The numbers in Figure 1 show the probability of the respective event to happen (e.g. the probability for the event X = 1 and Y = 1 is 0.02). Is this table a probability table? If so, why?
• Based on Figure 1 give the conditional expectation EY |X=2[Y ] and the probability of the event X = 1 under the condition that Y = 3.
• Is the function p(x,y) given by
(
1 for p(x,y) =
0 otherwise
a joint density function for two random variables?
• For two random variables X and Y the joint density function is given by
(2e−(x+y) for 0 ≤ x ≤ y, 0 ≤ y
p(x,y) =
0 otherwise.
What are the marginal density functions for X and Y respectively?
• Let the joint density function of two random variables X and Y be given by
for 0 < x < 3, 0 < y < 1
p x,y
0 otherwise.
Determine the probability for X ≤ 2 under the condition that .
Task 4: Show that the covariance matrix C of any random variable X ∈ Rp is symmetric positive semidefinite, i.e. C = C> and x>Cx ≥ 0 for any covariance matrix C ∈ Rp×p and any x ∈ Rp.
Figure 1: Task 3