Hi @<1523701842515595264:profile|PleasantOwl46> , I think its a docker hub limitation for non paying accounts pulling too many images and unrelated to ClearML. You can always host your own artifactory as well.
ClearML has a built in model repository so together I think they make a "feature store" again, it really depends on your definition
Hi MoodyCentipede68 ,
What version of ClearML / ClearML-Agent are you using? Is it a self hosted server or the SaaS?
Also, can you explain what step 7 was trying to do? Is it running locally or distributed?
What is the command you used to run the agent?
BoredPigeon26 , are images from previous iterations still showing?
Hi @<1882599179692281856:profile|FriendlyBluewhale89> , not sure what you mean by metadata displaying. Can you elaborate?
Hi @<1664079296102141952:profile|DangerousStarfish38> , yes you can 🙂
Hi @<1555362936292118528:profile|AdventurousElephant3> , if you clone/reset the task, you can change the logging level to 'debug'
Or when running something and uploading to a s3?
Hi @<1523703397830627328:profile|CrookedMonkey33> , not sure I follow. Can you please elaborate more on the specific use case?
Currently you can add plots to the preview section of a dataset
you set up 2 agent runs - one with docker and the other without. Each agent should be listening to a different queue. Makes sense?
OK, then just try the docker image I suggested 🙂
Hi @<1772795696529805312:profile|LethalCoral80> , I think what you're looking for is the offline mode - None
GreasyLeopard35 , what happens if you try to run the command it's (agent) trying to run yourself?
It can be changed with this env var for the apiserver:
CLEARML__hosts__elastic__events__args__timeout=<new number>
Though the better handling could be either increase the elasticsearch capacity (memory and cpu) or decrease the load (send events in smaller batches)
Hi DeterminedOwl36 , are you sure the file is there? Can you access it via console?
Can you add a full log of an experiment?
SubstantialElk6 , can you view the dataset in the UI? Can you please provide a screenshot so I can mark it down for you
I think this is what you're looking for - None
Just make sure you make regular backups
Hi @<1567321746447536128:profile|EmaciatedCentipede72> , I would suggest checking what api calls the UI sends when doing work in the UI with reports. You can see it in the network section of developer tools (F12), filter by XHR for easier time reading 🙂
Hi @<1523701260895653888:profile|QuaintJellyfish58> , yes it is. You can simply specify a branch 🙂
Looks like a network issue.
As a side note, I would suggest removing & revoking all credentials you've pasted here 🙂
Hi @<1607184400250834944:profile|MortifiedChimpanzee9> , yes 🙂
This is exactly how the autoscalers work. Scale from 0 to as many as needed and then back to 0
Hi @<1573119955400921088:profile|CloudyPelican46> , you can certainly do this. You can find all the related api calls here - None
I suggest opening developer tools (F12) and seeing what is sent in the UI to fetch the various metrics you're looking for
Hi @<1874989039501709312:profile|LividDragonfly0> , podman is supported out of the box and is transparent, you can just run the agent with --docker command.
worker by default checks the backend every 5 seconds for new tasks in the queue. While running a task I think it basically sends whatever api calls a regular local task sends
ResponsiveHedgehong88 , do you have an option to log into the machine and see the state or if there were any errors? Is there any chance it's running out of memory? The agent also keeps a local log, can you take a look there to see if there is any discrepancy?