subject
Mathematics, 21.03.2020 10:58 plumagirl

The file CommunityCrime. csv is a dataset containing 319 observations on 123 variables. The observations are communities within the United States. The data combines socio-economic data from the 1990 US Census, law enforcement data from the 1990 US LEMAS survey, and crime data from the 1995 FBI Uniform Crime Reporting program. A detailed description of all variables is available at http://archive. ics. uci. edu/ml/machine-learning-databases/c ommunities/communities. names. We seek to predict the variable ViolentCrimesPerPop, the total number of violent crimes per 100,000 people.

Note: when asked to perform cross validation to select a tuning parameter, be sure to conduct this cross validation on the training data only, then see how well your cross validated tuning parameter does on the test data.

(a). Set a seed of 1 and split the data into a 90% training set, and a 10% test set.

(b). Fit a linear model using least squares on the training set. Report the test error obtained.

(c). Fit a ridge regression model on the training set with λ chosen by cross-validation. Report
the test error obtained.

(d). Fit a lasso model on the training set with λ chosen by cross-validation. Report the test error obtained, along with the number of non-zero coefficient estimates.

(e). Fit a PCR model on the training set with M chosen by crossvalidation. Report the test error obtained along with the value of M selected by cross-validation.

(f). Fit a PLS model on the training set with M chosen by cross-validation. Report the test error obtained, along with the value of M selected by cross-validation.

(g). Comment on the above parts and how well you believe we can predict violent crime rate using these methods.

ansver
Answers: 2

Another question on Mathematics

question
Mathematics, 21.06.2019 12:40
Aparallelogram has two side lengths of 5 units. three of its sides have equations y = 0, y = 2, y = 2x. find the equation of the fourth side.
Answers: 1
question
Mathematics, 21.06.2019 14:30
Acommunity group sells 2,000 tickets for its raffle. the grand prize is a car. neil and 9 of his friends buy 10 tickets each. when the winning ticket number is announced, it is found to belong to neil's group. given this information, what is the probability that the ticket belongs to neil? a.1/5 b.1/10 c.1/200 d.1/4
Answers: 2
question
Mathematics, 21.06.2019 15:10
Polygons efgh and e′f′g′h′ are shown on the coordinate grid: what set of transformations is performed on efgh to form e′f′g′h′? a. a translation 1 unit to the left followed by a 90-degree counterclockwise rotation about the origin b. a translation 1 unit to the right followed by a 90-degree counterclockwise rotation about the origin c. a 90-degree clockwise rotation about the origin followed by a translation 2 units to the right d. a 90-degree clockwise rotation about the origin followed by a translation 2 units to the left
Answers: 1
question
Mathematics, 21.06.2019 15:50
Name the most appropriate metric unit for each measurement
Answers: 3
You know the right answer?
The file CommunityCrime. csv is a dataset containing 319 observations on 123 variables. The observat...
Questions
question
History, 18.11.2020 21:30
question
Health, 18.11.2020 21:30
question
Chemistry, 18.11.2020 21:30
question
Mathematics, 18.11.2020 21:30
question
Chemistry, 18.11.2020 21:30
question
Mathematics, 18.11.2020 21:30
question
Arts, 18.11.2020 21:30
question
Mathematics, 18.11.2020 21:30
Questions on the website: 13722367