Hi @<1523704157695905792:profile|VivaciousBadger56> , can you elaborate on this error please?
2023-02-14 13:06:44,336 - clearml.Task - WARNING - Failed auto-detecting task repository: [WinError 123] Die Syntax für den Dateinamen, Verzeichnisnamen oder die Datenträgerbezeichnung ist falsch: '[...]\\<input>'
Can you add such an attempt and the outputs please?
Hi @<1724960468822396928:profile|CumbersomeSealion22> , can you provide a log of such a run?
Aight. Thanks for the information. I'll take a look and see if it reproduces for me as well 🙂
Hi @<1535793988726951936:profile|YummyElephant76> , did you use Task.add_requirements ?
None
@<1719524641879363584:profile|ThankfulClams64> , if you set auto_connect_streams to false nothing will be reported from your frameworks. With what frameworks are you working, tensorboard?
Also, I would suggest trying pipelines from decorators, I think it would be much smoother for you
Hi @<1691983266761936896:profile|AstonishingOx62> , I'm not sure I understand what you're trying to do. You have some python code unrelated to ClearML. Does it run without issues? Did you afterwards add Task.init() to that code?
can you try reinstalling clearml-agent ?
You shouldn't lose credentials. How exactly are you deploying your server? All of the related data to the server should be saved in one of the /opt/ folders as explained in the installation steps
Don't think so. Consider that agent doesn't know who executed the pipeline or even on which machine.
Which mode? It is indeed vague 🙂
try with pip install -U clearml==1.7.2rc1
Try with sdk.development.default_output_uri as well
Hi @<1535069219354316800:profile|PerplexedRaccoon19> , not sure what you mean. Can you please share the full log, a screenshot of the two experiments and some snippet that re-creates this for you?
Hi @<1523701066867150848:profile|JitteryCoyote63> , you mean a global "env" variable that can be passed along the pipeline?
I think I found what you need 🙂
https://clear.ml/docs/latest/docs/references/sdk/task#get_parametersget_parameters(cast=True)
Hi ExasperatedCrocodile76 ,
When running in docker mode the agent should handle all the points you raised above and just work 🙂
Hi ObedientToad56 , you can simply delete all of them since it's only cache. It's safe to delete cache 🙂
Can you copy paste the error you got?
UnevenDolphin73 , that's an interesting case. I'll see if I can reproduce it as well. Also can you please clarify step 4 a bit? Also on step 5 - what is "holding" it from spinning down?
Hi @<1754676274460102656:profile|CurvedStarfish68> , it takes some time until metrics are updated. I suggest deleting a few more experiments and waiting.
Hi @<1772433273633378304:profile|VexedWoodpecker50> , these are the packages that were on the environment that ran the experiment. Please see here - None
No way to do that currently, you need to create an automation to remove everything
But I believe you can use your user/group if you prefer 🙂
Hi SmugTurtle78 , I'm not sure it's possible. Maybe SuccessfulKoala55 knows some workaround. Do you want to get rid of them in all scenarios or in just some specific use cases?
Hi @<1529271098653282304:profile|WorriedRabbit94> , I'll ask the guys to take a look at this and what is required for it.
I suggest running it in docker mode with a docker image that already has cuda installed
Hi @<1544853695869489152:profile|NonchalantOx99> , can you run that docker with the docker run command?