Hi @<1570583227918192640:profile|FloppySwallow46> . We've update the rate limits. Can you please check if the issue is still occurring?
We limit the allowed calls per IP - to make sure the server is not blocked accidentally. We enabled over 1000 calls per minute.
Hi DrainedHippopotamus42 - I just answered in Slack (didn't see the post here...). We found the issue and it will be resolved in the next saas update. For the time being I added more storage to your workspace. Once you run a task that reports metrics - the problem should disappear.
Please let me know if this works for you
DrainedHippopotamus42 - once you send more metrics, the calculation will be correct. The problem only occurs when you delete all the metrics.
Great! Thanks for the update
MotionlessMonkey27 - was this resolved?
Hi JitteryCoyote63 . Just to make sure we fully understand the scenario - you had a project with several experiments, some of which were archived. Then you went to archive and selected all of the experiments and deleted them using the bulk delete operation. And then you saw that the project was empty even when you were not in archive mode. Is that correct?
JitteryCoyote63 - thanks for the info. We are trying to reproduce this, and will update
AbruptWorm50 - the agent poll the queue, so any free agent can pull tasks. From the graph on the right, it looks like experiments were not waiting in the queue (max experiments is 1, and it was immediately pulled). Can you also check what happens if two experiments are enqueued at the same time?
AbruptWorm50 - just to make sure there is no misunderstanding - the last image you sent is on the "training" queue and not on the "services" queue. Are there free agents running on that queue?
AbruptWorm50 - does the issue still occur, or did you manage to resolve it?
SuperiorPanda77 - thanks for updating. So indeed these may be similar issues. I will re-check this and udpate
ShinyRabbit94 - The github issue reported by SuperiorPanda77 is a display issue - meaning the files are deleted, and only the "undefined" display is incorrect. This should be fixed in the next release (1.4).
In your case the files are not deleted from the filesystem - so I assume this is a different issue. Can you provide more info so that I can try to reproduce:
How are the artifacts created? Are the links in the popup describing the un-deleted files correct? Can you add a screenshot?
@<1523701087100473344:profile|SuccessfulKoala55> - yes, plots are reported every iteration.
@<1564060248291938304:profile|AmusedParrot89> - the plot comparison indeed compares the latest iteration of the experiments. I will see if this can be better indicated somewhere
Looking at the 2nd image you sent, I see that in addition to "services" queue, you also have queues called "training" and "training*_2" - and the experiments you circled is in the "training" queue. In that image - there are no experiments in the services queue.
If you press on the "services" queue (like you did in the first image) you can view the experiments in the queue and the workers. Can you check if there is a situation where there are tasks pending in that queue while one of the work...
NastySeahorse61 - can you check if this was updated? What does you usage page say?
NutritiousBear41 - I would like to ask you to try the following:
Refresh the web page (control+F5). Check the browser's local storage: F12 -> Application/Storage -> Local Storage - and copy the values of "currentVersion" and "dismissedVersion" (if they exist)
@<1574931903440490496:profile|CrookedBear44> - if you logged in with email - you can also login with one of the other options available in the log in page.
To reset your email's password please do the following:
- Go to the login page ( None )
- Press
Continue with Email
- Go to
Log In
tab - Press
Don't remember your password
- Fill your email and press
SEND EMAIL
Please make sure to check your spam folder
Let me know if you need any help.
Hi ThoughtfulGorilla90 - when did you perform the deletion. It might take up to 24 hours for the application to get the updated size
Hi ThoughtfulGorilla90 , how did you delete the experiments/models - was it from the web app or using the SDK?
The artifacts (debug samples, models, artifacts) deletion is currently performed done from the browser, after the task object is deleted from the backend. So it might take time for all the artifacts to be deleted - please make sure not to close your browser.
Note that only artifacts saved on fileserver are deleted - exteranal artifacts (in the cloud or on a local filesystem) are not deleted.
We are working on moving the deletion to the server side, to avoid these kind of issues. Should be in ...
SmallBluewhale13 - plot comparing only shows the last iteration, where as the individual info screen displayed the last 5 iterations. Could this explain the issue?
@<1526371986278715392:profile|VivaciousReindeer64> - Yes - please send me the docker-compose and the log of the webserver
Hi @<1526371986278715392:profile|VivaciousReindeer64> - please try this file:
you can re-direct it to a file or just copy with the clipboard
@<1564060248291938304:profile|AmusedParrot89> - I see the logic in displaying the last iteration per metric in the compare screen. We will need to think if this won't cause any other issues.
In the mean time - may I ask you to open a github issue - so it will be easier to track?
HandsomeGiraffe70 - please try to update to clearml 1.6.1 - this should solve the issue. Please update us if this worked
@<1523701790044852224:profile|FreshParrot56> - Indeed a bug. We will fix this in the upcoming version - due in a few weeks
@<1558986867771183104:profile|ShakyKangaroo32> - are you running the server using docker-compose?
If so - please add the following to the .env
file in the same directory as the compose:CLEARML_FILES_HOST=http://
<YOUR IP or HOSTNAME>
:8081
Then restart the service by running sudo docker-compose up -d
Please update if this worked, or if you have any questions/issues
@<1558986867771183104:profile|ShakyKangaroo32> - if you check the api section in your client-side clearml.conf, the value for files_server
there should be the same one that you set in the .env
file on the server. Can you check that they are indeed the same?
If they are the same - can you please send me the output of the following command in the server:sudo docker logs -n30 async_delete