Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hello! Since Today I Get

Hello!
Since today I get AssertionError: Torch not compiled with CUDA enabled for PyTorch 1.8.
Tasks that I submitted yesterday to the queue are also not working, even though they ran yesterday. PyTorch 1.7 based tasks work fine. Any idea what I could have done wrong?

  
  
Posted 3 years ago
Votes Newest

Answers 161


Perfect! 🙂

  
  
Posted 3 years ago

Okay found it 🙂 it returns 11020 instead of 112

  
  
Posted 3 years ago

Type "help", "copyright", "credits" or "license" for more information.
>>> from clearml_agent.helper.gpu.gpustat import get_driver_cuda_version
>>> get_driver_cuda_version()
'110'
  
  
Posted 3 years ago

Let me check something

  
  
Posted 3 years ago

So it should have detected 11.2...

  
  
Posted 3 years ago

I do not have a global cuda install on this machine. Everything except for the driver is installed via conda.

  
  
Posted 3 years ago

drwxr-xr-x 10 root root 4096 Jul 31  2020 .
drwxr-xr-x 14 root root 4096 Jul 31  2020 ..
drwxr-xr-x  2 root root 4096 Feb  4 13:52 bin
drwxr-xr-x  2 root root 4096 Jul 31  2020 etc
drwxr-xr-x  2 root root 4096 Jul 31  2020 games
drwxr-xr-x  2 root root 4096 Jul 31  2020 include
drwxr-xr-x  4 root root 4096 Feb  3 13:40 lib
lrwxrwxrwx  1 root root    9 Dez 10 14:29 man -> share/man
drwxr-xr-x  2 root root 4096 Jul 31  2020 sbin
drwxr-xr-x  7 root root 4096 Jul 31  2020 share
drwxr-xr-x  2 root root 4096 Jul 31  2020 src
  
  
Posted 3 years ago

Thu Mar 11 17:52:45 2021       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 460.56       Driver Version: 460.56       CUDA Version: 11.2     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  GeForce RTX 3090    Off  | 00000000:01:00.0 Off |                  N/A |
| 61%   63C    P2   296W / 350W |   8318MiB / 24268MiB |     74%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
|   1  GeForce RTX 3090    Off  | 00000000:21:00.0 Off |                  N/A |
| 30%   29C    P8    20W / 350W |      1MiB / 24268MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|    0   N/A  N/A    133165    C+G   ...s-builds.1/3.7/bin/python     8314MiB |
+-----------------------------------------------------------------------------+
  
  
Posted 3 years ago

@<1523701868901961728:profile|ReassuredTiger98> what are you getting with:

nvidia-smi

And here:

ls -la /usr/local/
  
  
Posted 3 years ago

Or there should be an early error for trying to run conda based tasks on pip agents

  
  
Posted 3 years ago

btw: why is agent.package_manager and agent attribute. Imo it does not make sense because conda can install pip packages, but pip cannot install conda packages which can lead to install failures, right?

  
  
Posted 3 years ago

I get 110 but it should be 111

  
  
Posted 3 years ago

One more thing: The cuda_version that clearml finds automatically is wrong.

  
  
Posted 3 years ago

Thank you! 🙂

  
  
Posted 3 years ago

Yes that is exactly what I will make sure we change :)

  
  
Posted 3 years ago

Perfect, will try it. fyi: The conda_channels that I used are from clearml-agent init

  
  
Posted 3 years ago

Well, in that case, just change the order it should solve it (I'll make sure we have that as the default:

conda_channels: ["pytorch", "conda-forge", "defaults", ]

It should solve the issue 🙂

  
  
Posted 3 years ago

conda_channels: ["defaults", "conda-forge", "pytorch", ]
  
  
Posted 3 years ago

@<1523701868901961728:profile|ReassuredTiger98> what do you have in the clearml.conf under "conda_channels" ?
Is this it ?
None

  
  
Posted 3 years ago

Will do!

  
  
Posted 3 years ago

Can you ping me when it is updated in None so I can update my installation?

  
  
Posted 3 years ago

No problem! I profit so much from clearml 🙂

  
  
Posted 3 years ago

Great, thanks!

  
  
Posted 3 years ago

channels:
- pytorch
- conda-forge
- defaults
dependencies:
- cudatoolkit~=11.1.1
- pytorch~=1.8.0

Works fine

  
  
Posted 3 years ago

@<1523701868901961728:profile|ReassuredTiger98> thank you so much for testing it!

  
  
Posted 3 years ago

Damn, okay I'll make sure we fix the order.
Could you verify the ~= works as intended (if the order id correct)

  
  
Posted 3 years ago

Just tested again. The ordering definitly matters.

  
  
Posted 3 years ago

okay, I'll make sure we order it correctly

  
  
Posted 3 years ago

I try it one more time just to make sure

  
  
Posted 3 years ago

WTF?!

  
  
Posted 3 years ago
4K Views
161 Answers
3 years ago
2 months ago
Tags