Starting from:

$39.99

ISC 4241 – Activity #1 Solution

Team Number:
Team Captain:
Team Members:




Activity on
PART I (20 Points) Programming
Problem 1.1 (8 Points) Read the EXCLE file “COVID_08312020.csv”
Problem 1.2 (8 Points) Produce a scatter plot using “TotalCases” and “TotalDeaths” and impose a loess line on the top of the data.

Problem 1.3 (8 Points) Produce a scatter plot using “ToTCases_1M” and “TotDeath_MPOP” and impose a loess line on the top of the data.

Problem 1.4 (8 Points) Produce a table with the following summary statistic including minimum, mean, median, variance, standard deviation, maximum, and skewness for the following five variables “ToTCases_1M”, “TotDeath_MPOP”, “TotalCases”, “TotalDeaths”, and “TotalTested”. (Note: Display only three decimal place)

Problem 1.5 (8 Points) Obtain both the Spearman correlation and the Pearson correlation between the following variables “ToTCases_1M”, “TotDeath_MPOP”, “TotalCases”,
“TotalDeaths”, and “TotalTested”.



1

ISC 4241 – Activity #1
PART II (10 Points) Fill In Blank
1. Suppose that {𝑥1, 𝑥2, 𝑥3, ⋯ , 𝑥𝑛} be a set of data and 𝑥(15) = 5, 𝑥(16) = 7, and 𝑥(17) = 8, the median of this data set is if n = 30 and the median of this data is if n = 31.
2. Suppose that {𝑥1, 𝑥2, 𝑥3, ⋯ , 𝑥𝑛} be a set of data and ̅ and n = 26, the sample variance of this data set is .
3. The points at distances 1.5 times of IQR (Inner Quartile Range) from each hinge mark the fences of the data set.
4. Tom is interested in finding out the salary of students graduated from UCF in the past three years. He collected data from one thousand students graduated from UCF. The data he collected including their major, their graduation year, their gender, their salary, and their
GPA. Tom’s study is a with
observations and predictors.
5. Jennifer has a data set to perform an analysis; however, you cannot find any response variable in this set of data. The analysis performed by Jennifer should be a (supervised learning / non-supervised learning).
6. Steve fit a model on a set of data. After perform data exploration analysis, he decided to assume that the data come from normal population and the relationship between the response variable and a set of predictors should be linear. The analysis perform by Steve should be (parametric analysis / nonparametric analysis / cluster analysis).
7. Lori likes to know the relationship between a given predictor and the response variable. Lori is interested in (prediction / inference) problem.

2

More products