Reputation
Badges 1
40 × Eureka!if I don't have internet connection on the other machine, can I just copy the artifacts and transfer them to my local machine?
by communication that the artifacts are streamed from the machine running the experiments to the local server?
can it be done "offline" after the experiments run view them in my local server?
yes, I have limited access to the machine that is running the experiment. I can't setup a server there. but I want to collect the results and view them later
Hi AgitatedDove14 , path to the config file for trains manual execution
edit: tweaked it a little bit for my use-case:is_demo_server = ' http://demoapi.trains.allegro.ai ' in Session.get_api_server_host()is_server_available = requests.get(Session.get_api_server_host() + "/debug.ping").status_code == 200
AgitatedDove14 , I want multiple machines to access the synced state of the optimizer. which is part of the internals of the optimizer... and then report the results back to the optimizer such that the study object of the optimizer keeps track of the results and the next sample will be aware of all previous studies
I'm doing this instead
Thanks! I'll have a look and see if I have some useful ideas
Thanks AgitatedDove14 , well if a machine doesn't set the default_output_uri, the default behavior for model checkpoints for example is to just register without uploading. So in the case that the default_output_uri is not defined the offline task folder will not have the artifacts for uploading (not included in the zip file created by offline package).. or am I missing something?
thanks SuccessfulKoala55 , the question arose after trying to follow the instructions you attached. it seems that installing a docker on windows 10 Home is somewhat problematic
So I can avoid running unnecessary common heavy setup, for a light weight experiment
"does not support running with no server connection." this is what I was afraid of..I'll need to figure out if I can use trains at all 😞
and latest pre release hydra
the ok() call seem to crash
much appreciated, thanks!
by WebApp you mean the public online one? I might be confusing stuff
so it sounds like there is no known issue related to this
and I will also be happy to see if I can contribute maybe to this specific feature or maybe others
Hi AgitatedDove14 , regarding the slider feature, do you know when would it be released?
yes I will be happy to, its gonna be my first time
great, I'm going to give it a try
AgitatedDove14 a single experiment, that is being paused and resumed.
inconsistrncy in yhe reporting: when resuming the 10th epoch for example and doing an extra epoch clearml iteration count is wrong for debug images and monitored metrics.. somehow not for the scalar reporting
TypeError: 'bool' object is not callable
to put it a bit differently, I am looking for a way to manually sample and report from and to the optimizer
cool, AgitatedDove14 so just to confirm:
To get the desired behavior, uploading artifacts on import_offline_session
The needed action is: setting the development.default_output_uri in the offline machine run (and nothing else?)