{ "nbformat": 4, "nbformat_minor": 0, "metadata": { "colab": { "provenance": [] }, "kernelspec": { "name": "python3", "display_name": "Python 3" }, "language_info": { "name": "python" } }, "cells": [ { "cell_type": "markdown", "source": [ "# CS640 Homework 1: ROC Analysis and Ethics\n", "\n", "In this assignment, you will answer some questions regarding ROC analysis.\n", "\n", "## Collaboration\n", "You are allowed to work in a team of at most **three** on the coding part (**Q1**), but you must run the experiments and answer written questions independently.\n", "\n", "## Instructions\n", "\n", "### General Instructions\n", "In an ipython notebook, to run code in a cell or to render [Markdown](https://en.wikipedia.org/wiki/Markdown)+[LaTeX](https://en.wikipedia.org/wiki/LaTeX) press `Ctrl+Enter` or `[>|]`(like \"play\") button above. To edit any code or text cell (double) click on its content. To change cell type, choose \"Markdown\" or \"Code\" in the drop-down menu above.\n", "\n", "Most of the written questions are followed by a cell for you enter your answers. If not, please insert one. Your answers and the questions should **not** be in the same cell.\n", "\n", "### Instructions on Math\n", "Some questions require you to enter math expressions. To enter your solutions, put down your derivations into the corresponding cells below using LaTeX. Show all steps when proving statements. If you are not familiar with LaTeX, you should look at some tutorials and at the examples listed below between \\$..\\$.\n", "\n", "Alternatively, you can scan your work from paper and insert the image(s) in a text cell.\n", "\n", "## Submission\n", "Once you are ready, save the notebook as PDF file (File -> Print -> Save as PDF) and submit to Gradescope. Make sure all outputs are visible." ], "metadata": { "id": "wWhk6f79x6MH" } }, { "cell_type": "markdown", "source": [ "## Q0: Name(s)\n", "\n", "Please write your name in the next cell. If you are collaborating with someone, please list their names as well." ], "metadata": { "id": "yH_FqNM805X5" } }, { "cell_type": "markdown", "source": [], "metadata": { "id": "ow6uTaD708Du" } }, { "cell_type": "markdown", "source": [ "## Q1: Confusion Matrix\n", "\n", "In the next cell, write code to manually compute the confusion matrix for a **binary** classification task and run the cell to test your code.\n", "\n", "The function `confusion_matrix` from sklearn is used to verify your answers, but you should **not** use it in your implemention." ], "metadata": { "id": "2OUZOOoA08ks" } }, { "cell_type": "code", "source": [ "import numpy as np\n", "from sklearn.metrics import confusion_matrix\n", "\n", "def compute_confusion_matrix(YTrue, YPredict):\n", " \"\"\"\n", " Computes the confusion matrix as a numpy matrix. For convention, the\n", " vertical axis represent YTrue while the horizontal axis represent YPredict.\n", " \"\"\"\n", " cm = np.zeros((2, 2))\n", "\n", " ############################ start of your code ############################\n", "\n", " ############################# end of your code #############################\n", "\n", " return cm\n", "\n", "is_correct = True\n", "for _ in range(10):\n", " rng = np.random.default_rng()\n", " YTrue = rng.choice([0, 1], size = 100)\n", " YPredict = rng.choice([0, 1], size = 100)\n", " if (compute_confusion_matrix(YTrue, YPredict) != confusion_matrix(YTrue, YPredict)).any():\n", " is_correct = False\n", " break\n", "print(is_correct)" ], "metadata": { "id": "V-6CofDE2Ziu" }, "execution_count": null, "outputs": [] }, { "cell_type": "markdown", "source": [ "## Q2: Metrics" ], "metadata": { "id": "ZqzxSFz29AYa" } }, { "cell_type": "markdown", "source": [ "### Q2.1\n", "Suppose that we are given the confusion matrices and the ROC curves of two models **A** and **B**.\n", "\n", "![Screenshot 2023-09-02 210029.png]()\n", "\n", "For each of the two models,\n", "\n", "1. compute true positive rate (TPR), false negative rate (FNR), false positive rate (FPR), and true negative rate (TNR); and\n", "2. compute accuracy, precision, recall and F-1 score.\n", "\n", "You can do it either by hand or by code (do **not** use existing functions). Either way, show details of your calculations in a new cell using variables TP, FN, FP, and TN. Round your answers to four digits if you choose to use decimals." ], "metadata": { "id": "hKBkTdeZ_6xz" } }, { "cell_type": "markdown", "source": [ "### Q2.2\n", "\n", "The two ROC curves are labeled as 1 and 2 in the following figure.\n", "\n", "![Screenshot 2023-09-02 183858.png]()\n", "\n", "1. Which one represents a better model? Briefly explain.\n", "2. Based on your results in Q2.1, which ROC curve is more likely to be associated with model **A** and which is more likely to be associated with model **B**? Briefly explain." ], "metadata": { "id": "6zFvugVNBIKM" } }, { "cell_type": "markdown", "source": [], "metadata": { "id": "dzbw4MD-nakF" } }, { "cell_type": "markdown", "source": [ "## Q3: Threshold\n", "\n", "Now suppose that we have a state-of-the-art model to predict earthquakes (negative vs positive) and by default the threshold value is set to 0.5. If we don't want to trigger too many false alarms, how should we tune the threshold value (i.e., should we increase it or decrease it)? Briefly explain." ], "metadata": { "id": "HpHXoQjIDNNV" } }, { "cell_type": "markdown", "source": [], "metadata": { "id": "1YCiUJIEnbLn" } }, { "cell_type": "markdown", "source": [ "## Q4: AI and Ethics\n", "\n", "For each the following three questions, provide an answer with **no more than three** sentences." ], "metadata": { "id": "FL0q1xyHv8B0" } }, { "cell_type": "markdown", "source": [ "### Q4.1\n", "\n", "Watch Joy Buolamwini's TED talk on [How I'm fighting bias in algorithms](https://www.ted.com/talks/joy_buolamwini_how_i_m_fighting_bias_in_algorithms). Joy mentions that judges use machine-generated risk scores to determine how long an individual is going to spend in prison. The state of Wisconsin uses the AI system Northpointe for making sentencing and parole decisions. Research this issue on the internet and provide one link to material you have read." ], "metadata": { "id": "SIzhe35JwS7b" } }, { "cell_type": "markdown", "source": [], "metadata": { "id": "JWQe9y3RwVtV" } }, { "cell_type": "markdown", "source": [ "### Q4.2\n", "\n", "Use some the bias terminology of Suresh and Guttag, 2021 (see lecture slides), to discuss whether you think the Northpointe system is a responsible use of AI." ], "metadata": { "id": "y8agoGL8wV5W" } }, { "cell_type": "markdown", "source": [], "metadata": { "id": "q81oP1N8waO1" } }, { "cell_type": "markdown", "source": [ "### Q4.3\n", "\n", "Research at BU and MIT by Canetti et al. ([arxiv.org/pdf/1810.02003.pdf](http://arxiv.org/pdf/1810.02003.pdf)) considers the scenario that decisions whether a prisoner should stay or leave prison can be deferred. In the publicly-available Northpointe data, decisions about Caucasian prisoners are deferred for 9% of prisoners, while decisions about African-Americans are deferred for 20% of the prisoners. Canetti et al. propose \"equalizing methods\" to change the AI system so that deferral decisions would be made at about the same rate for both racial groups. Briefly discuss what you think about adding an equalizing method to Northpointe's system. Would this yield a fair AI system? You may refer to Suresh and Guttag's terminology." ], "metadata": { "id": "Tpf2wWh6wadr" } }, { "cell_type": "markdown", "source": [], "metadata": { "id": "r_7l8dcBwtaf" } } ] }