Hi @<1523701168822292480:profile|ExuberantBat52> , I think someone should review this soon enough 🙂
Hi @<1578555761724755968:profile|GrievingKoala83> , I don't think so.
Hi @<1523701868901961728:profile|ReassuredTiger98> , I think you can achieve a similar affect using add_external_files - None
DepressedChimpanzee34 , Hi 🙂
Let's break this one down:
In the 'queues & workers' window if you switch to 'queues' you can actually see all the workers assigned to a specific queue In the workers window, you can see which workers are active and which are not. Is this enough or do you think something else is needed? You can see the resources used by each worker in the workers window. Is that what you mean? You can already do that! Simply drag and drop experiments in the queue window
I'm...
Hi @<1529271085315395584:profile|AmusedCat74> , can you please provide the full log of the autoscaler?
TenseOstrich47 , you can specify a docker image withtask.set_base_docker(docker_image="<DOCKER_IMAGE>")You will of course need to login to ECR on that machine so it will be able to download the docker image.
Are you seeing any errors in the webserver container?
@<1523704157695905792:profile|VivaciousBadger56> , ClearML's model repository is exactly for that purpose. You can basically use InputModel and OutputModel for handling models in relation to tasks
ScaryBluewhale66 ,
If you want to re-run - you need the agent It's still a Task object so you can just use Task.close() I'm not sure if something exists at the moment but you could write it faily easily in code
Please try updating the ClearML server to the latest. Doesn't reproduce for me with newer versions of the backend - 1.13.0
You have a self hosted server, several agents that are running and two users developing on their own machines but the training is done elsewhere?
Hi ReassuredOwl55 , can you please elaborate on your use case or exactly what you're trying to achieve?
Hi @<1523702932069945344:profile|CheerfulGorilla72> , in Task.init specify output_uri= None
Great, thanks! Looking into it 🙂
I don't believe this is part of the open documentation. In the enterprise there is an admin panel, SSO integration and RBAC on top of all the user management system. All of this is managed via an API like everything else in the system.
May I ask why you need docs on this?
Hi!
What version of ClearML-Agent are you using?
Also from within the docker, what do you get when you run the following commands:which pythonwhich python3which python3.7which python3.8
Hi @<1523701553372860416:profile|DrabOwl94> , can you check if there are some errors in the Elastic container?
Check the environment variables, maybe test with export maybe there's some env var hiding there 🙂
you said you're on ubuntu... please describe exactly where/how the server + agent are set up
How do you run docker compose? If you run it with the -d in the end it should stay and be persistent even after restart, if I'm not mistaken
Hi @<1523701132025663488:profile|SlimyElephant79> , to answer your questions:
-
Does ClearML store any dataset outside of this S3 storage (and local storage) for preview or compression purposes?Some preview data might be stored inside mongodb (if it's a table for example). This of course can be disabled so no data/previews are exposed to the ClearML backend.
-
Are any data packets related to model or data versioning routed through the API server? (We are pretty confident it shouldn't, but j...
Hi OddShrimp85 , you mean bash script? I don't think there is something built in to run a script afterwards but I'm sure you could incorporate it in your python script.
I'm curious, what is the use case?
Hi @<1882599179692281856:profile|FriendlyBluewhale89> I think its support@clearml.ai , it should be on the website 🙂
What happens if you look at elastic container logs directly? I think it's something along the lines sudo docker logs clearml-elastic --follow . Don't catch me on the exact syntax naming tho 😛
What's the docker image that you're using?
Hi @<1544853695869489152:profile|NonchalantOx99> , can you please add the full log?
@<1673501397007470592:profile|RelievedDuck3> , I think you can solve this by going to /login instead of /dashboard in the URL