add basic stat metric, and model performance report #136

xiaoyi-cheng · 2023-01-25T16:33:03Z

Issue #, if available:
Cherry-picked Pranav's changes, and fix typo.

Description of changes:
See details in #135

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

larroy · 2023-01-25T18:37:30Z

src/smclarify/bias/metrics/basic_stats.py

+
+
+# Model Performance Metrics
+def multicategory_confusion_matrix(


would be good to add an example in the docstring for reference, as I had to look at the test to fully understand it.

src/smclarify/bias/metrics/basic_stats.py

src/smclarify/bias/metrics/common.py

src/smclarify/bias/metrics/basic_stats.py

bilalaws · 2023-02-02T16:56:17Z

src/smclarify/bias/report.py

+        label_name: str,
+        metrics: List[MetricResult],
+        binary_confusion_matrix: List[float],
+        confusion_matrix: Optional[Dict] = None,


needs a better name. is this the multiclass confusion matrix?

I agree this is not a good name. But changing this will also change the output json key, so I'll keep them consistent, and specify in the docstring.

src/smclarify/bias/report.py

bilalaws · 2023-02-02T17:51:21Z

tests/unit/bias/metrics/test_basic_stats.py

+    assert label_dist == basic_stats.observed_label_distribution(dfB[0], sensitive_facet_index, dfB_pos_label_idx)
+
+
+def test_performance_metrics():


Could we also add a test which computes these metrics directly from dfBinary and dfMulticategory?

Per offline discussion, I need to test calculating right number of TP/TN/FP/FN from df directly. So I moved it to a helper function

src/smclarify/bias/metrics/basic_stats.py

bilalaws · 2023-03-03T13:54:35Z

src/smclarify/bias/metrics/basic_stats.py

+    :param label_series: Label Data Series
+    :param predicted_label_series: Predicted Label Data Series
+    :param unique_label_values: List of unique label values computed from the set of true and predicted labels
+    :return: Matrix JSON where rows refer to true labels, and columns refer to predicted labels


What are the values in each column? Are these normalized? Looking at the code, does not look like the values are normalized, but in confusion_matrix, we normalize them. Why the difference?

tests/unit/bias/metrics/test_basic_stats.py

bilalaws · 2023-03-03T14:46:02Z

tests/unit/bias/metrics/test_basic_stats.py

+    )
+    df.columns = ["x", "y", "z", "yhat"]
+
+    expected_value = {


Same comment as the previous one. red is not found in observed labels.

tests/unit/bias/test_report.py

Pranav Krishnan and others added 4 commits January 1, 2023 22:57

add basic stat metric, and model performance report

3b995c0

testing/error handling for multicategory confusion matrix

b3877f1

change error for confusion matrix to match scala

2fec9b6

fix typo

f9cc396

xiaoyi-cheng requested a review from larroy as a code owner January 25, 2023 16:33

larroy approved these changes Jan 25, 2023

View reviewed changes

bilalaws suggested changes Feb 2, 2023

View reviewed changes

requested fix and changes

4f9f72c

xiaoyi-cheng requested a review from bilalaws March 1, 2023 18:17

bilalaws suggested changes Mar 3, 2023

View reviewed changes

address comments

7ddf06b

xgchena approved these changes Mar 3, 2023

View reviewed changes

xiaoyi-cheng requested a review from bilalaws March 3, 2023 21:31

xgchena merged commit 1edaf23 into aws:master Mar 3, 2023

xiaoyi-cheng deleted the reportv2 branch March 4, 2023 00:59

xiaoyi-cheng mentioned this pull request Mar 4, 2023

Multi-categorical confusion matrix calculation for labels not presented in predicted_labels #138

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add basic stat metric, and model performance report #136

add basic stat metric, and model performance report #136

xiaoyi-cheng commented Jan 25, 2023

larroy Jan 25, 2023

bilalaws Feb 2, 2023

xiaoyi-cheng Feb 10, 2023

bilalaws Feb 2, 2023

xiaoyi-cheng Mar 1, 2023

bilalaws Mar 3, 2023

bilalaws Mar 3, 2023



		# Model Performance Metrics
		def multicategory_confusion_matrix(

		assert label_dist == basic_stats.observed_label_distribution(dfB[0], sensitive_facet_index, dfB_pos_label_idx)


		def test_performance_metrics():

add basic stat metric, and model performance report #136

add basic stat metric, and model performance report #136

Conversation

xiaoyi-cheng commented Jan 25, 2023

larroy Jan 25, 2023

Choose a reason for hiding this comment

bilalaws Feb 2, 2023

Choose a reason for hiding this comment

xiaoyi-cheng Feb 10, 2023

Choose a reason for hiding this comment

bilalaws Feb 2, 2023

Choose a reason for hiding this comment

xiaoyi-cheng Mar 1, 2023

Choose a reason for hiding this comment

bilalaws Mar 3, 2023

Choose a reason for hiding this comment

bilalaws Mar 3, 2023

Choose a reason for hiding this comment