@<1556812486840160256:profile|SuccessfulRaven86> , I think this is because you don't have the proper permissions 🙂
Hi @<1585078763312386048:profile|ArrogantButterfly10> , I think you're having an issue with your docker... The error comes from docker not from ClearML
You can save it as a dataset and then fetch it during run time, or am i missing something?
Hi @<1676762887223250944:profile|FancyKangaroo34> , it would be possible for example if the docker image has a different python version from what you ran on previously
Aight. What OS are you on, also, what is the status of this deploy? Is this a clean install, version upgrade or it just stopped working after a restart? 🙂
What is the exact python version you're trying to run on?
CheerfulGorilla72 , can you please provide to me how your ~/clearml.conf has the following settings configured?
api.web_server
api.api_server
api.files_server
Did you copy the example to a different repository and ran it from there? I think you need to commit your code for new files
Hi @<1574931891478335488:profile|DizzyButterfly4> , not sure what you mean. Can you elaborate on what you see vs what you expect to see?
Hi LackadaisicalHedgehong78 . It seems that someone/something sent a command to delete a bunch of tasks. Do you have backups?
Hi @<1787653566505160704:profile|EnchantingOctopus35> , I don't think you can clean up parents without damaging the children since they rely on that. I would suggest taking any data you don't want scrubbed and then creating a new version with it. Then delete the unrelated older datasets. What do you think?
You mean you would like to delete an output model of a task if other models in the task surpass it?
Hi SuperiorPanda77 , I'm not sure I understand, can you elaborate on the specific use case?
Hi @<1698868530394435584:profile|QuizzicalFlamingo74> , Try compression=False
Can you please run ls -la /opt/clearml and send the output + your docker compose file
Hi @<1529271098653282304:profile|WorriedRabbit94> , I'll ask the guys to take a look at this and what is required for it.
128GB RAM, 32 cores and 2 GPUs.
WOW 😮 I'm so jealous
However, after a while my container will exit, but also the clearml-server stops responding correctly. WebUI will not show updates and only a few experiments are shown at all. After restarting the apiserver, the clearml-server works correctly again.
Do you get any errors on how/why the container exist? Which container is it?
What version of clearml are you using? Can you provide a code snippet that reproduces this?
Do you maybe have blocks on some ports locally?
I think this is covered in the enterprise version
@<1544853721739956224:profile|QuizzicalFox36> , yes 🙂
Hi RattyLouse61 ,
Do you have an example of the parameters you're trying to connect?
'True' should point to the files server
You have a small cog wheel on the right of the graphs. You can switch presentation to 'Wall Time' to see how much time it took 🙂
Hi @<1590514584836378624:profile|AmiableSeaturtle81> , what version is your server? Can you provide logs from the apiserver container?
This is the env variable you're looking for - CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL
Hi @<1643785593177509888:profile|FrustratingSeagull27> , do you have some sample code that recreates this behavior?