Updated unit tests to include mean_bias_significance metric

in 19 minutes and 55 seconds, using 0 compute credits, and was queued for 1 second