torch.mode when input has nans #46225

nikhilmishra000 · 2020-10-13T00:20:33Z

🐛 Bug

torch.mode has inconsisent behavior when the input contains nans:

The torch docs do not say what the nan policy is, whereas the scipy equivalent lets the user decide
On cpu, torch.mode acts like scipy's nan_policy="omit"
On cuda, it gives a nonsense result

To Reproduce

def test(device):
    x = torch.rand(1000).mul(5).long().to(device)
    s = torch.bincount(x, minlength=5).argsort(descending=True)
    
    mode =  x.mode().values
    print(f'w/o nans, got {mode}, expected {s[0]}')

    y = x.clone().float()
    y[y == mode] = np.nan
    mode =  y.mode().values.long()
    print(f'w nans, got {mode}, expected {s[1]}')

When running test("cpu"), both lines always give the expected result:

In [17]: test('cpu')
w/o nans, got 3, expected 3
w nans, got 0, expected 0

whereas when running test("cuda"), the first line always gives the expected result, but the second line gives something seemingly random:

In [26]: test('cuda')
w/o nans, got 2, expected 2
w nans, got 4, expected 0

Expected behavior

Environment

Output of collect_env.py:

Collecting environment information...
PyTorch version: 1.6.0
Is debug build: False
CUDA used to build PyTorch: 10.2
ROCM used to build PyTorch: N/A

OS: Ubuntu 18.04.3 LTS (x86_64)
GCC version: (Ubuntu 7.5.0-3ubuntu1~18.04) 7.5.0
Clang version: Could not collect
CMake version: version 3.10.2

Python version: 3.7 (64-bit runtime)
Is CUDA available: True
CUDA runtime version: 10.0.130
GPU models and configuration:
GPU 0: TITAN RTX
GPU 1: TITAN RTX
GPU 2: GeForce RTX 2080 Ti
GPU 3: GeForce GTX 1080 Ti

Nvidia driver version: 440.64.00
cuDNN version: /usr/lib/x86_64-linux-gnu/libcudnn.so.7.6.5
HIP runtime version: N/A
MIOpen runtime version: N/A

Versions of relevant libraries:
[pip3] msgpack-numpy==0.4.4.3
[pip3] numpy==1.16.1
[pip3] numpy-quaternion==2020.10.2.17.17.31
[pip3] numpy-stl==2.10.1
[pip3] torch==1.6.0
[pip3] torchvision==0.6.0
[conda] msgpack-numpy             0.4.4.3                  pypi_0    pypi
[conda] numpy                     1.16.1                   pypi_0    pypi
[conda] numpy-quaternion          2020.10.2.17.17.31          pypi_0    pypi
[conda] numpy-stl                 2.10.1                   pypi_0    pypi
[conda] torch                     1.6.0                    pypi_0    pypi
[conda] torchvision               0.6.0                    pypi_0    pypi

cc @brianjo @mruberry @rgommers @heitorschueroff @ezyang @gchanan @zou3519 @bdhirsh @ejguan @jlin27

The text was updated successfully, but these errors were encountered:

bdhirsh · 2020-10-13T15:11:38Z

tested this on latest master, confirmed that this is still occurring (in particular the results for cpu and cuda are different)

mruberry · 2020-10-13T19:03:33Z

Thanks for reporting this issue, @nikhilmishra000, we should definitely fix this and would take a PR updating our behavior. We should propagate NaNs in this case and note that it's a BC-breaking change to do so.

bdhirsh added high priority module: docs module: numpy shadow review labels Oct 13, 2020

pytorch-probot bot added the triage review label Oct 13, 2020

mruberry added module: sorting and selection module: NaNs and Infs labels Oct 13, 2020

ezyang removed the shadow review label Oct 13, 2020

ailzhang removed high priority triage review labels Oct 19, 2020

heitorschueroff mentioned this issue Oct 19, 2020

PyTorch NaN behavior and API design #46544

Open

ezyang added the triaged label Oct 20, 2020

mruberry added the module: reductions label Mar 8, 2021

heitorschueroff mentioned this issue Jul 8, 2021

Reductions tracking issue #61417

Open

51 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

torch.mode when input has nans #46225

torch.mode when input has nans #46225

nikhilmishra000 commented Oct 13, 2020 •

edited by pytorch-probot bot

Loading

bdhirsh commented Oct 13, 2020

Uh oh!

mruberry commented Oct 13, 2020

Uh oh!

torch.mode when input has nans #46225

torch.mode when input has nans #46225

Comments

nikhilmishra000 commented Oct 13, 2020 • edited by pytorch-probot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🐛 Bug

To Reproduce

Expected behavior

Environment

bdhirsh commented Oct 13, 2020

Uh oh!

mruberry commented Oct 13, 2020

Uh oh!

nikhilmishra000 commented Oct 13, 2020 •

edited by pytorch-probot bot

Loading