Console output and also what you get on the ClearML task page under the console section
Just to make sure, did the logging to the clearml server work previously and stoped working at some point?
When the script is hung at the end the experiment says failed in ClearML
So I am only seeing values for the first epoch. It seems like it does not track all of them so maybe something is happening when it tries to log scalars.
I have seen it only log iterations but setting task.set_initial_iteration(0)
seemed to fix that so it now seems to be logging the correct epoch
Tensorboard is correct and works. I have never seen an issue in the tensorboard logs
Yes I see it in the terminal on the machine
What happens if you're running the reporting example from the ClearML github repository?
Yes it is logging to the console. The script does hang whenever it completes all the epochs when it is having the issue.
The console logging still works. Aborting the task was in the log but did not work and the process continued until I killed it.
Not sure why that is related to saving images