bias_variance_decomp bug: numpy.zeros truncating predictions #743

johnnybarrels · 2020-10-18T05:47:00Z

Bug description

The predictions matrix all_pred initialised by np.zeros(..., dtype=np.int) in line 73 of bias_variance_decomp() is truncating predictions (casting to integer):

all_pred = np.zeros((num_rounds, y_test.shape[0]), dtype=np.int)

Example of numpy behaviour causing the issue:

import numpy as np
np.__version__  # 1.19.2 (current latest)

all_pred = np.zeros((2,3), dtype=np.int)
all_pred[0] = [0.25, 0.5, 0.75]
all_pred[1] = [1.3, 1.6, 1.9]
print(all_pred)

array([[0, 0, 0],
       [1, 1, 1]])

This causes wildly inaccurate results if the target variable is small, as predictions are truncated as integers. Regardless, casting predictions to integers doesn't strike me as a desired feature of the bias_variance_decomp() function.

See this gist for a full reproducible example of this, but below are the differences in results in a regression case with a small target variable:

Unchanged function results:

print(avg_expected_loss)
print(avg_bias)
print(avg_var)

0.2826888888888888
0.2698977777777778
0.012791111111111112

Results after removing dtype=np.int from np.zeros() in all_pred initialisation:

print(avg_expected_loss)
print(avg_bias)
print(avg_var)

0.039183805200284395
0.03825420409046315
0.0009296011098212146

Steps/Code to Reproduce

See this gist.

Versions

MLxtend 0.17.3
macOS-10.15.6-x86_64-i386-64bit
Python 3.8.3 (v3.8.3:6f8c8320e9, May 13 2020, 16:29:34)
[Clang 6.0 (clang-600.0.57)]
Scikit-learn 0.23.2
NumPy 1.19.2
SciPy 1.5.2

The text was updated successfully, but these errors were encountered:

rasbt · 2020-11-10T00:56:42Z

Wow, good catch. Yeah, the examples and unit tests for the MSE loss were all with relatively large numbers so I didn't notice that. That's going to be fixed via #749. Many thanks.

johnnybarrels added the Bug label Oct 18, 2020

rasbt mentioned this issue Nov 10, 2020

Fix integer bug in bias_variance_decomp #749

Merged

5 tasks

rasbt closed this as completed in #749 Nov 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bias_variance_decomp bug: numpy.zeros truncating predictions #743

bias_variance_decomp bug: numpy.zeros truncating predictions #743

johnnybarrels commented Oct 18, 2020

rasbt commented Nov 10, 2020

bias_variance_decomp bug: numpy.zeros truncating predictions #743

bias_variance_decomp bug: numpy.zeros truncating predictions #743

Comments

johnnybarrels commented Oct 18, 2020

Bug description

Steps/Code to Reproduce

Versions

rasbt commented Nov 10, 2020