sorry
I solved the mistake. there was a mistake in my file path and then the training could not be started
Hi UnsightlySeagull42
How can I reproduce this behavior ?
Are you getting all the console logs ?
Is it only the Tensorboard that is missing ?
thanks.
i tried 1.0.4rc0 but get the same error.
Output from allegro:
2021-06-01 15:51:59.984367: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2)
2021-06-01 15:52:00.019168: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 3399905000 Hz
2021-06-01 15:52:00.683090: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublas.so.11
2021-06-01 15:52:01.099961: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libcublasLt.so.11
ClearML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-from-start
I run it for 2 hours but it did not train
Hi UnsightlySeagull42
Could you test with the latest RCpip install clearml==1.0.4rc0
Also could you provide some logs?