Reputation
Badges 1
52 × Eureka!I just ran a dummy experiment logging images, plots, etc and I can see them in my server’s Web UI.
so it looks like the server is there (docker ps), I can see the artifacts (web ui), but not sure where things are as per documentation there is no /mnt/fileserver
(?)
I see this in the docker-compose.yml
file:
fileserver:
networks:
- backend
- frontend
command:
- fileserver
container_name: clearml-fileserver
image: allegroai/clearml:1.12.1-397
environment:
CLEARML__fileserver__delete__allow_batch: "true"
restart: unless-stopped
volumes:
- /opt/clearml/logs:/var/log/clearml
- /opt/clearml/data/fileserver:/mnt/fileserver
- /opt/clearml/config:/opt/clearml/config
ports:
- "8081:...
I am not a docker expert but am I correct to say that here the ‘/mnt/fileserver’ is the container path rather than the source path?
if that were the case it explains why I see /opt/clearml/data/fileserver
but no /mnt/fileserver
….
ok so the documentation is confusing here:
there under fileserver it should read /opt/clearml/data/fileserver
sorry I am a noob not sure how can do that but happy to help if I can
@<1523701205467926528:profile|AgitatedDove14> None
from this video tutorial None :
“…the name of the hyperparameter consist of the section is reported to followed by a slash then its name…”
So following that confuses me because I can’t see my Hydra parameters under Hyperparameters > Hydra
and this is why I thought, ok well, perhaps use OmegaConf/params.batch_size
Is this another opportunity to improve the documentation? Happy to help if so.
What I am referring to is this information about the Storage Configuration:
None
Hey @<1523701205467926528:profile|AgitatedDove14> in the WebUI the hydra configuration object is under CONFIGURATION OBJECTS > OmegaConf
So should this be OmegaConf/trainer.batch_size
?
so it’s not intuitive to me to try Hydra/params.batch_size
I will try it nonetheless as you suggested.
hmm….. probably simpler/cleaner if I do
hpo_params = {
'param1':cfg.param_1, ...
}
task.connect(hpo_params)
Thoughts?
Hi @<1523701087100473344:profile|SuccessfulKoala55> it’s failing again.. I haven’t rebooted the agent or changed anything and I am able to connect with ssh with ssh -vT
git@github.com on a different tmux sess.
This is the error I am seeing running the agent with the -debug
flag:
Using cached repository in "/home/ubuntu/.clearml/vcs-cache/clearml-tutorial.git.e1c2351b09f3d661b6f0dbf85e92be2e/clearml-tutorial.git"
git@github.com: Permission denied (pub...
3fdcf5db64d allegroai/clearml:1.12.1-397 “/opt/clearml/wrappe…” 10 days ago Up 9 minutes 8008/tcp, 8080/tcp, 0.0.0.0:8081->8081/tcp, :::8081->8081/tcp clearml-fileserver
I can’t see anything under /mnt
so no fileserver there (?)
Thanks @<1523701205467926528:profile|AgitatedDove14> reading …
This is what I see:
Responding to my own question, in case someone else has the same issue. You have to edit the security group and enable TCP 8080.
I haven’t figure out the missing fileserver? :man-shrugging:
Do you have any insights on the missing fileserver @<1523701205467926528:profile|AgitatedDove14> ?