VexedCat68 , It appears to be a bug of sorts, we'll sort it out 🙂
Hi ReassuredArcticwolf33 , what are you trying to do and how is it being done via code?
And you made sure to run clearml-agent init
on the machine or to implement the clearml.conf
manually?
I think so, yes
Hi @<1670964687132430336:profile|SpicyFrog56> , can you please add the full log?
What actions did you take exactly to get to this state?
Hi @<1558986867771183104:profile|ShakyKangaroo32> , can you please elaborate more on what is happening? So you're taking an existing task that finished and forcing it to get 'started' again? Then you write some things to it sometimes and then later you 'revive' it again? And due to this it appears some artifacts are missing?
WittyOwl57 , It determines the user that created the object. What is the sign in method that you and your team are using?
Hi @<1539417873305309184:profile|DangerousMole43> , can you add an example of your usage + the errors you were getting?
Hi @<1772433273633378304:profile|VexedWoodpecker50> , these are the packages that were on the environment that ran the experiment. Please see here - None
DepressedFish57 , I'm not sure there is something for this. There is a cleanup service to kill old failed/aborted tasks but to clear caches you'd have to handle it manually 🙂
but without -d
I was referring to the SDK python package (ClearML) and ClearML-Agent
GiganticTurtle0 Hi 🙂
How about Task.get_task()
?
https://clear.ml/docs/latest/docs/references/sdk/task#taskget_task
You'd need to provide it the project name and task name
Did you raise a serving engine?
What version of clearml
and clearml-agent
are you using, what OS? Can you add the line you're running for the agent?
CrookedWalrus33 , Hi 🙂
Can you please provide which packages are missing?
ReassuredTiger98 , Hi 🙂
Which version of clearml-agent are you using? Do you get the same result if you manually kill the docker container by force?
Hi @<1549202366266347520:profile|GorgeousMonkey78> , at what point does it get stuck? What happens if you remove the Task.init
line from the script?
Hi @<1775332371021697024:profile|TenseSwan1> , I think what you're looking for is the output_uri
parameter in Task.init
I think this can give you more information:
https://stackoverflow.com/questions/51279711/what-does-1000-mean-in-chgrp-and-chown
This means it assigns the owner as the first linux user on that machine.
Hi @<1523703961872240640:profile|CrookedWalrus33> , I think by "mutable" it means that the object itself is mutable when connecting.
I'm curious, what is your use case that you want to change the values in the code itself? The intended usage is to connect the config object and then control it via the webUI / API
Hi GiganticMole91 ,
I see that the storage settings are also available through environment variables, but I'm worried that the environment variables have already been parsed at that time.
I'm not sure I understand. Can you elaborate? How do you run in remotely? Do you raise an instance each time or are your instances persistent?
I think you would have to re-register it
Hi @<1523701240951738368:profile|RoundMosquito25> , when a different user runs something with their own generated credentials from the UI will show as different user. Does that clear things up?
Also, please go into the UI - go to the experiment that was executed remotely. Open developer tools (F12) and see what is returned when you navigate to the plots page in the UI
So even if you abort it on the start of the experiment it will keep running and reporting logs?
Hi @<1643060818490691584:profile|MagnificentHedgehong41> , did you specify a project name? You can go into settings and enable showing hidden projects/experiments and then you will be able to see the pipeline steps in projects as well
Yes. Run all the pipelines examples and see how the parameters are added via code to the controller.
For example:
None
Yes you can, see the examples here 🙂
None