Reputation
Badges 1
611 × Eureka!The package is just subdir by the way. So it should not be in installed packages anyways, right?
Okay, it seems like it just takes some time to delete and to reflect in the WebUI. So when I try to delete again, actually a deletion process seems already to be running in the background.
You can add and remove clearml-agents to/from the clearml-server anytime.
btw: why is agent.package_manager and agent attribute. Imo it does not make sense because conda can install pip packages, but pip cannot install conda packages which can lead to install failures, right?
I will create a minimal example.
AgitatedDove14 Thank you, that explains it.
Thank you! I agree with CostlyOstrich36 that is why I meant false sense of security 🙂
Thank you SuccessfulKoala55 so actually only the file-server needs to be secured.
drwxr-xr-x 10 root root 4096 Jul 31 2020 .
drwxr-xr-x 14 root root 4096 Jul 31 2020 ..
drwxr-xr-x 2 root root 4096 Feb 4 13:52 bin
drwxr-xr-x 2 root root 4096 Jul 31 2020 etc
drwxr-xr-x 2 root root 4096 Jul 31 2020 games
drwxr-xr-x 2 root root 4096 Jul 31 2020 include
drwxr-xr-x 4 root root 4096 Feb 3 13:40 lib
lrwxrwxrwx 1 root root 9 Dez 10 14:29 man -> share/man
drwxr-xr-x 2 root root 4096 Jul 31 2020 sbin
drwxr-xr-x 7 root root 4096 Jul 31 2020 share
drwxr-xr-x ...
Quick question: Where again does clearml place the venv? I wanna take a look into it after the task has failed
One more thing: The cuda_version that clearml finds automatically is wrong.
My driver says "CUDA Version: 11.2" (I am not even sure this is correct, since I do not remember installing code in this machine, but idk) and there is no pytorch for 11.2, so maybe it fallbacks to cpu?
But I do not have anything linked correctly since I rely in conda installing cuda/cudnn for me
Perfect, will try it. fyi: The conda_channels that I used are from clearml-agent init
This my environment installed from env file. Training works just fine here:
From the logs when ran with --foreground I I do not see any conda create command.
So if understand correctly, something like this should work?
task = Task.init(...) task.connect_configuration( {"agent.package_manager.system_site_packages": False} ) task.execute_remotely(queue_name, clone=False, exit_process=True)
I do not have a global cuda install on this machine. Everything except for the driver is installed via conda.