Machine Learning, 2022S: HW6CS XXXXXXXXXXMachine Learning: Homework 6SpRing 2022Due: Sunday, May 1,...

Question

Machine Learning, 2022S: HW6CS XXXXXXXXXXMachine Learning: Homework 6SpRing 2022Due: Sunday, May 1, 2022 (End of day)We are coming back to the dataset that we used in homework 5. Namely, we will be using a UCIsimulated electrical grid stability data set that is available here:https:archive.ics.uci.edu/ml/datasets/Electrical+Grid+Stability+Simulated+Data+.This dataset has 10,000 examples using 11 real-valued attributes, with a binary target (stable vs.unstable). The target value that you are predicting is the last column in the dataset.Remark 1 (Cross-Entropy). For the cross-entropy values that you want to report in the questionselow, please use the following formula (empirical risk using the cross-entropy loss):R̂S(h, c) = −1mm∑i=1[yi ln(pi) + (1− yi) ln(1− pi)] ,wherem is the size of the sample S where we evaluate our hypothesis h,yi ∈ {0, 1} is the true label c(xi) of the instance xi, andpi is the probability of assigning the positive label to instance xi by our hypothesis.Exercise 1 – Preprocessing (10 points). You have already done this part in homework 4. How-ever, since you may need to refresh your memory with what you did, this part is worth a few points.(a) Remove columns 5 and 13 (labeled p1 and stab); p1 is non-predictive and stab is target columnthat is exactly coelated with the binary target you are trying to predict (if this column isnegative, the system is stable).(b) Change the target variable to a number. If the value is stable, change it to 1, and if the valueis unstable, change it to 0.(c) Remove 20% of the examples and keep them for testing. Youmay assume that all examples areindependent, so it does not matter which 20% you remove. However, the testing data shouldnot be used until after a model has been selected.(d) Split the remaining examples into training (75%) and validation (25%). Thus, you will trainwith 60% of the full dataset (75% of 80%) and validate with 20% of the full dataset (25% of80%).April 11, 2022 1/3https:archive.ics.uci.edu/ml/datasets/Electrical+Grid+Stability+Simulated+Data+CS XXXXXXXXXXMachine LeaRning: HomewoRK 6Exercise 2 – Artificial Neural Network (20 points). You may usesklearn.neural_network.MLPClassifier.(a) Fit an artificial neural network to the training data using 1 hidden layer of 20 units as well asanother neural network that has 2 hidden layers of 10 units each.(b) For each model made in (a), make a probabilistic prediction for each validation example. Re-port the cross-entropies between the predictions and the true labels in your writeup.(c) Which neural network performs the best on the validation data? Report this in your writeup.Train a new neural network using the architecture that performed better among the two usingthe training and validation data. Make a probabilistic prediction for each testing exampleusing this model and save them for later.Exercise 3 – Decision Trees (20 points). For this problem you can use the scikit-learn methodsklearn.tree.DecisionTreeClassifier.(a) Fit a decision tree to the training data using the Gini impurity index and max tree depth of 5.(b) Using themodel created in part (a)make a probabilistic prediction for each validation example.What is the cross-entropy on these predictions and the true labels? Put this value in youwriteup.(c) Fit a decision tree to the training data using information gain and max tree depth of 5.(d) Using themodel created in part (c) make a probabilistic prediction for each validation example.What is the cross-entropy on these predictions and the true labels? Put this value in youwriteup.(e) Which model performed better on the validation data? Report this in your writeup. Train anew decision tree on the training and validation data using whichever measure created theest model in (a)-(d), with a max tree depth of 5. Make a probabilistic prediction for eachtesting example and save them for later.Exercise 4 – Boosting (20 points). For this problem you may usesklearn.ensemble.AdaBoostClassifier.(a) Fit boosted decision stumps (max tree depth of 1) to the training data allowing at most 20, 40,and 80 decision stumps (base estimators) in each model.(b) For each model trained in (a), make a probabilisitc prediction for each validation example.Report the cross-entropies between the predictions and the true labels in your writeup.(c) Which upper bound on the number of allowed base classifiers generates the best performingmodel? Report this in your writeup. Train a new AdaBoost classifier using this bound on thenumber of maximum allowed base classifiers, using the training and validation data. Make aprobabilistic prediction for each testing example using this model and save them for later.2/3 April 11, 2022CS XXXXXXXXXXMachine LeaRning: HomewoRK 6Exercise 5 – ROC Curve (30 points). For this exercise you must write your own code; noscikit-learn, except maybe to compute AUC.For each model produced in Exercises 2-4 do the following:(a) Determinize the testing predictions made above, using 1001 different probability thresholds(0.000, 0.001, 0.002, . . ., 0.999, 1.000). “Determinization” means converting the probability toa deterministic class label (0 or 1). Use (1) below for determinization. We have that p∗ isthe critical threshold; pi is the predicted probability for example i; and Pi is the resultingdeterministic prediction:Pi ={1, if pi ≥ p∗0, otherwise(1)(b) At each of the 1001 probability thresholds, compute the true positive rate (TPR) and falsepositive rate (FPR). Recall that these values are easily computed from the confusion matrix.(You would have to re-calculate the confusionmatrix for each one of these thresholds, for eachmodel.)(c) Plot the ROC (receiver operating characteristic) curve, using the 1001 points created in part6b. If you have forgotten what a ROC curve looks like, see our notes on model evaluation. TheROC curve must contain a point at the bottom left (0, 0) and top right (1, 1). Also, it mustcontain the dashed grey line, indicating the performance of a random predictor. Include theROC curve for each model in your write-up.(d) Find the probability threshold yielding the highest Youden index (TPR - FPR). Report theYouden index and the coesponding probability threshold for each model.(e) Compute the AUC (area under the curve) for each model. You may use the functionsklearn.metrics.roc_auc_score for this part.April 11, 2022 3/3

Uhanya · Accepted Answer

AUC-ROC Curve – The Star Performer!
You’ve built your machine learning model – so what’s next? You need to evaluate it and validate how good (or bad) it is, so you can then decide on whether to implement it. That’s where the AUC-ROC curve comes in.
The name might be a mouthful, but it is just saying that we are calculating the “Area Under the Curve” (AUC) of “Receiver Characteristic Operator” (ROC). Confused? I feel you! I have been in your shoes. But don’t worry, we will see what these terms mean in detail and everything will be a piece of cake!
For now, just know that the AUC-ROC curve helps us visualize how well our machine learning classifier is performing. Although it works for only binary classification problems, we will see towards the end how we can extend it to evaluate multi-class classification problems too.
We’ll cover topics like sensitivity and specificity as well since these are key topics behind the AUC-ROC curve.
I suggest going through the article on Confusion Matrix as it will introduce some important terms which we will be using in this article.
Become a Full-Stack Data Scientist
Power Ahead in your AI ML Career | No Pre-requisites RequiredDownload Brochure
 
Table of Contents
· What are Sensitivity and Specificity?
· Probability of Predictions
· What is the AUC-ROC Curve?
· How Does the AUC-ROC Curve Work?
· AUC-ROC in Python
· AUC-ROC for Multi-Class Classification
 
What are Sensitivity and Specificity?

Machine Learning, 2022S: HW6 CS XXXXXXXXXXMachine Learning: Homework 6 SpRing 2022 Due: Sunday, May 1, 2022 (End of day) We are coming back to the dataset that we used in homework 5. Namely, we will...

Solution

Answer To This Question Is Available To Download

Related Questions & Answers

Submit New Assignment