Reputation
Badges 1
186 × Eureka!thanks! this bug and cloning problem seem to be fixed
just DMed you a screenshot where you can see a part of the token
yeah, I am aware of trains-agent, we are planning to start using it soon, but still, copying original training command would be useful
yeah, backups take much longer, and we had to increase our EC2 instance volume size twice because of these indices
got it, thanks, will try to delete older ones
thanks for the link advice, will do
I'll let you know if I managed to achieve my goals with StorageManager
parents and children. maybe tags, maybe separate tab or section, idk. I wonder if anyone else is interested in this functionality, for us this is a very common case
after the very first click, there is a popup with credentials request. nothing happens after that
1 - yes, of course =) but it would be awesome if you could customize the content - to include key metrics and hyperparameters, for example
3 - hooooooraaaay
not quite. for example, Iām not sure which info is stored in Elastic and which is in MongoDB
I don't think so because max value of each metric is calculated independently of other metrics
standalone-mode gives me "Could not freeze installed packages"
it will probably screw up my resource monitoring plots, but well, who cares š
docker mode. they do share the same folder with the training data mounted as a volume, but only for reading the data.
awesome news š
I assume, temporary fix is to switch to trains-server?
that's right
for example, there are tasks A, B, C
we run multiple experiments for A, finetune some of them in separate tasks, then choose one or more best checkpoints, run some experiments for task B, choose the best experiment, and finally run task C
so we get a chain of tasks: A - A-ft - B- C
ClearML pipeline doesn't quite work here because we would like to analyze results of each step before starting next task
but it would be great to see predecessors of each experiment in the chain
JIC - trains still works after that, it's just that the new user is not added and hence is not able to login