Count by class is wrong when input array contains values that are not number / non finite number

Consider the following code:

```
let discr = require("statsbreaks")

let data = [1, 1, 2, 2, 3, 3, 3, 3, 4, 5, 5, 5, 6, 7, 8, 'foo', -Infinity, NaN]
let series = new discr.JenksClassifier(data, 2);
let bks = series.classify(3);
let count = series.countByClass();
```

I think `count` should be `[8, 5, 2]` (as if we used `[1, 1, 2, 2, 3, 3, 3, 3, 4, 5, 5, 5, 6, 7, 8]` as input array) instead of `[9, 5, 2, NaN]`.

The breaks returned are correct (because the input array is filtered in the inner classification function) but in Classifier classes we store the input array before it is filtered : https://github.com/riatelab/statsbreaks/blob/c016c68e99234fa1b5b20e95f9ba5194d6c9afb5/src/classifier.js#L25

A quick fix is simply to store the filtered input array in the line of code shown below (but we'll be redoing this filtering for nothing in the internal classification function).

A better fix might be to avoid doing this filtering twice (and to avoid creating too many new arrays, since doing `array.filter(/* some code */).map(/* some code */)` creates two new arrays). However, in most cases this shouldn't make any noticeable difference to performance.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Count by class is wrong when input array contains values that are not number / non finite number #41

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Count by class is wrong when input array contains values that are not number / non finite number #41

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions