-
Notifications
You must be signed in to change notification settings - Fork 266
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] pynvml.smi.DeviceQuery() errors when run in the Intro01 demo notebook due to bad device brand (10) returned #338
Comments
@Riebart , thanks for this issue. Can you share with me the output of pynvml is an external library, so it may be good to send the details of this issue to gpuopenanalytics who owns pynvml: https://github.com/gpuopenanalytics/pynvml |
This problem is already reported in two issues on pynvml: here and here. Output of
Output on the RTX3080 Mobile (I'll update this comment later and include it). |
Awesome. I'll track this. As this is a pynvml issue, would you be able to remove that cell form your workflow or prefer that we comment out or remove that cell and replace it with the standard |
For our use case (training and hands-on workshops), we can comment out/remove that cell, as we're doing other automated transformations on the notebooks to change sample counts to match the size of the MIG slices we're using anyway (since we usually don't have 16GB of VRAM per participant). It might be worth commenting it out until pynvml fixes the issue to avoid confusing new users at the very beginning of the very first intro notebook. |
Sorry for the late response here, but I'd like to declare a bit of a warning that the |
@rjzamora That makes sense to me. It's important to note that the |
@taureandyernv Still no joy, but a different error this time. This is related to issues we've observed in other areas, such as
|
Aww man. okay, i'll check that out Monday, unless its P0. Does this can you send me your environment? |
Just to clarify, rapidsai/dask-cuda#674 has been merged and MIG devices should now be supported by Dask-CUDA. However, it's still not the most user-friendly interface, the only way to enable MIG devices at this time is to specify each MIG instance to With MIG, you can't use |
Describe the bug
When using
rapidsai/rapidsai:21.06-cuda11.2-runtime-ubuntu20.04-py3.8
on either an RTX 3080 Mobile or A100 MIG partition and when running the intro_01 notebook, thenvmlDeviceGetBrand()
call invoked from thepynvml
library returns an known device brand.Steps/Code to reproduce bug
docker run --gpus all --rm -it -p 8888:8888 -p 8787:8787 -p 8786:8786 rapidsai/rapidsai:21.06-cuda11.2-runtime-ubuntu20.04-py3.8
pynvml.smi
.Expected behavior
The
.DeviceQuery()
succeeds without error.Environment details (please complete the following information):
rapidsai/rapidsai:21.06-cuda11.2-runtime-ubuntu20.04-py3.8
) usingnvidia-docker2
installed from official repo as runtime.The text was updated successfully, but these errors were encountered: