normal v2 pretrained model not working #53

yuyingyeh · 2023-07-19T06:03:28Z

Hi! I have tried your code to predict normal using the latest v2 model checkpoint, but the outputs are all NaN. I have tried both the model from the script below and from the google drive.
sh ./tools/download_surface_normal_models.sh

The results can be reproduced with this command:
python demo.py --task normal --img_path assets/demo/test1.png --output_path assets/

I have uncommended below lines to use v1 model and there is no issue. Could you check your released weight? Thank you!

omnidata/omnidata_tools/torch/demo.py

Lines 50 to 51 in b927c41

    
           # pretrained_weights_path = root_dir + 'omnidata_unet_normal_v1.pth' 
        
           # model = UNet(in_channels=3, out_channels=3)

The text was updated successfully, but these errors were encountered:

jens-nau · 2023-07-21T11:04:21Z

I have the same problem. Loading the model to the CPU instead of the GPU seems to work, but is very slow.
After some testing, I found that the problem only seems to occur on GPUs with certain or perhaps old architectures. On my Ampere-based RTX 3080 everything works fine, but when running the same code on a Pascal-based GTX 1050 Ti the model predicts NaN values.

alexsax · 2023-07-21T16:43:29Z

Hi! I’ve used the successfully used these weights before on GPU after downloading from Google drive. How are you using the weights? Have you tried running the demo first and does that work?

…

On Fri, Jul 21, 2023 at 4:04 AM Jens Naumann ***@***.***> wrote: I have the same problem. Loading the model to the CPU instead of the GPU seems to work, but is very slow. — Reply to this email directly, view it on GitHub <#53 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABHLE3JLPGF4PEHT2STU733XRJO4BANCNFSM6AAAAAA2PMYDHI> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

yuyingyeh · 2023-07-21T21:50:51Z

I have the same problem. Loading the model to the CPU instead of the GPU seems to work, but is very slow. After some testing, I found that the problem only seems to occur on GPUs with certain or perhaps old architectures. On my Ampere-based RTX 3080 everything works fine, but when running the same code on a Pascal-based GTX 1050 Ti the model predicts NaN values.

Thanks for finding out the problem! I have also tested on another machine and it works!

The command used to test:

cd omnidata/omnidata_tools/torch
python demo.py --task normal --img_path assets/demo/test1.png --output_path assets/

What I have tried:

[Not working] Ubuntu docker + Turing-based RTX 2080 Ti
[Working] Windows 11 + Ada Lovelace-based RTX 4090 + Anaconda

zzt76 · 2023-11-01T01:55:02Z

I face the same problem when using the normal model:

[WORKING] Win11 + RTX 4060
[NOT WORKING] Ubuntu + V100

Totoro97 · 2024-01-20T10:37:35Z

I face the same problem

[WORKING] Ubuntu + RTX3090, pytorch 2.0.1, CUDA 11.8
[Not WORKING] Ubuntu + V100, pytorch 2.0.1, CUDA 11.8

haotongl · 2024-05-13T11:54:19Z

[Not WORKING] Ubuntu + V100, pytorch 2.0.1, CUDA 11.8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

normal v2 pretrained model not working #53

normal v2 pretrained model not working #53

yuyingyeh commented Jul 19, 2023

jens-nau commented Jul 21, 2023 •

edited

Loading

alexsax commented Jul 21, 2023 via email

yuyingyeh commented Jul 21, 2023

zzt76 commented Nov 1, 2023

Totoro97 commented Jan 20, 2024

haotongl commented May 13, 2024

normal v2 pretrained model not working #53

normal v2 pretrained model not working #53

Comments

yuyingyeh commented Jul 19, 2023

jens-nau commented Jul 21, 2023 • edited Loading

alexsax commented Jul 21, 2023 via email

yuyingyeh commented Jul 21, 2023

zzt76 commented Nov 1, 2023

Totoro97 commented Jan 20, 2024

haotongl commented May 13, 2024

jens-nau commented Jul 21, 2023 •

edited

Loading