SF#364 type promotion in whistogram is based upon the index, not the weight #45

perldl-bot · 2015-03-02T21:49:49Z

From http://sourceforge.net/p/pdl/bugs/364 (from @djerius)

mohawk2 · 2022-01-01T21:03:18Z

The input was never an "index", it was just input data. The operation's "type-promotion" (really, determination) was never based on only the first parameter. The output's type is written "float+", which is what happened here (the output got promoted to float), which can be seen easily by using this updated code:

use PDL;

srand(372);
my $wt = random( PDL::double, 1000 );
my $lidx = zeroes( PDL::long, 1000 );
my $fidx = zeroes( PDL::float, 1000 );
my $didx = zeroes( PDL::double, 1000 );

my $lh = whistogram( $lidx, $wt, 1, 0, 1 );
my $fh = whistogram( $fidx, $wt, 1, 0, 1 );
my $dh = whistogram( $didx, $wt, 1, 0, 1 );

my $exp = $wt->dsum;
print "$_->[0]=", $_->[1]->info, "\n" for ['lh',$lh], ['fh',$fh], ['dh',$dh], ['exp',$exp];

print "delta, long index: ", $lh - $exp, "\n";
print "delta, float index: ", $fh - $exp, "\n";
print "delta, double index: ", $dh - $exp, "\n";

You get an identical difference between a "long" input type, and a "float" input type. The difference between those and double is simply the result of the difference between single and double precision. There is no bug here.

mohawk2 · 2022-04-15T13:11:01Z

@djerius writes:

Sorry for the late reply, but I believe that there's still an issue here.

Here's the output of your script (With a slight fix on PDL 2.038 setting $exp = PDL($wt->dsum), as dsum doesn't return a piddle):

lh=PDL: Float D [1]
fh=PDL: Float D [1]
dh=PDL: Double D [1]
exp=PDL: Double D []
delta, long index: [0.00069109649]
delta, float index: [0.00069109649]
delta, double index: [0]

As you point out, the difference is due to the difference in precision between float and double, but that's the actual problem. In each case the weight parameter is double precision, but the determined type of the output histogram does not take that into account, leading to a catastrophic loss of precision.

You state "the operation's "type-promotion" (really, determination) was never based on only the first parameter", but that seems to be exactly what is happening. There are only two parameters of import, the input data and the weight, and it doesn't seem to pay attention to the latter.

mohawk2 · 2022-04-15T13:22:28Z

(With a slight fix on PDL 2.038 setting $exp = PDL($wt->dsum), as dsum doesn't return a piddle)

pdl> p sequence(2)->dsum->info
PDL: Double D []

Looks quite a lot like an ndarray (not piddle) to me?

mohawk2 · 2022-04-15T13:57:26Z

You state:

There are only two parameters of import, the input data and the weight, and it doesn't seem to pay attention to the latter.

The signature of whistogram, with two ndarray input parameters, one of which has a type-qualifier of float+:

Signature: (in(n); float+ wt(n);float+[o] hist(m); double step; double min; int msize => m)

Now, a quote from https://metacpan.org/pod/PDL::PP#Type-conversions-and-the-signature (you did look in the docs?):

As we had already seen for the int, float and double qualifiers, a pdl marked with a type+ qualifier does not influence the datatype of the pdl operation.

If you consider getting back a value that is less than double-precision a "catastrophic loss", then you'd better pass the in parameter as double-precision. This is how PDL has operated for really quite a long time (at least 1.99987, from 1998). I am open to the idea that there is indeed "an issue here", but so far I am not clever enough to see it.

zmughal added sf-import sf:critical bug labels Mar 2, 2015

zmughal added this to the PDL v2.009 milestone Mar 15, 2015

zmughal modified the milestones: PDL v2.012, PDL v2.013 Jul 29, 2015

zmughal modified the milestones: PDL v2.013, PDL v2.014 Aug 11, 2015

zmughal added sf:normal and removed sf:critical labels Aug 17, 2015

mohawk2 closed this as completed Jan 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SF#364 type promotion in whistogram is based upon the index, not the weight #45

SF#364 type promotion in whistogram is based upon the index, not the weight #45

perldl-bot commented Mar 2, 2015 •

edited by mohawk2

mohawk2 commented Jan 1, 2022

mohawk2 commented Apr 15, 2022 •

edited

mohawk2 commented Apr 15, 2022

mohawk2 commented Apr 15, 2022

SF#364 type promotion in whistogram is based upon the index, not the weight #45

SF#364 type promotion in whistogram is based upon the index, not the weight #45

Comments

perldl-bot commented Mar 2, 2015 • edited by mohawk2

mohawk2 commented Jan 1, 2022

mohawk2 commented Apr 15, 2022 • edited

mohawk2 commented Apr 15, 2022

mohawk2 commented Apr 15, 2022

perldl-bot commented Mar 2, 2015 •

edited by mohawk2

mohawk2 commented Apr 15, 2022 •

edited