Reputation
Badges 1
40 × Eureka!I tried to modify all docker_internal_mounts point but the mount point for clearm.conf file still remains the same. May be it is defined on server side ?
No particular information in console( no error), no network error too.
My files (fs) are deleted but i have the same issue as reported by SuperiorPanda77 , with some undefined value that is said not to be deleted. I guess that as my command deleteFileServerSources works but exit with some strange return value, other commands in the row addFaieldDeletedFiles and deleteProjectFromRoot are not executed (file src/app/webapp-common/shared/entity-page/entity-delete/base-delete-dialog.effects.ts )
I think i have my answer, this is hard coded in agent base_cmd += ( (['--name', name] if name else []) + ['-v', conf_file+':'+DOCKER_ROOT_CONF_FILE] + (['-v', host_ssh_cache+':'+mount_ssh] if host_ssh_cache else []) + ...
I had to set another docker volume because venvs_builds was not properly mounted to. I don't have write permission to the default one too
But after all these modifications, i succeeded in using the clearml-agent. 👍 Great job ! Thank to the clearml team.
The mount point for the clearml.conf, i.e. '-v', '/tmp/.clearml_agent.qy2xyt21.cfg:/root/clearml.conf' becauce the docker i use is run as user and doesn't have acces to \root. . May be this is defined on cleaml-server side ? I use my own server installed on another linux box using docker-compose.
Most of the time it is due to a bad parsing of the ip adress. You need to be sure the ip adress is correctly parsed and for this, i need to specify the port used for my minio server even it is a standard http (80) port. Then 'address:80' works but not "address" alone
In docker-compose, image was latest allegroai/clearml:latest when i pull docker images. When i launch it, after installation i have in WebApp following informations : "WebApp: 1.3.1-169 • Server: 1.3.1-169 • API: 2.17"
i tried so far but it was not so easy, because there is a python executable "update_from_env" that empties the configuration.json file. So i create a file in /mnt/external_files/configs and my configuration.json was read.
It seems that i should define this variable by the use of an environment variable in ConfigurationService.globalEnvironment.
Yes, i was expecting that it was already working like that. So far, i modified the code to set DOCKER_ROOT_CONF_FILE to what i want !!!
Files are stored on the same box where the docker is running. And there is a mounting point between file server docker and the host itself.
i use a proxy and the port is 80, i need to write it ?
I define the HOST ( ENDPOINT) like this but it doesn't change anything
By default, i put nothing in the task but when i use a ClearMLSaver like thisClearMLSaver(logger, output_uri=" ")where clearml is my bucket.
for clearml package i use the 1.4.1.
I was unable to define FILE_BASE_URL inside the docker container. I modify the HTTP constant in app.constants.ts with hard code values, compile the webapp again (npm) and replace it in my docker container and now it works....
I don't undestand why after specifying /root/clearml.conf , a copy is required to /root/default_clearml.conf . I modified in the code this copy by one that takes a user mounting point and copies it to home directory ~/clearml.conf
What i don't understant is the list of artifacts that were not deleted
I take a look a src/app/webapp-common/shared/entity-page/entity-delete/base-delete-dialog.effects.ts.
I see that an error is raised in the mergeMap at line 125, but i'm not familiar enough with Typescript to find why.
DeterminedCrab71 You right, if i understand correctly HTTP.FILE_BASE_URL is undefined, then file to delete is describe as "misc" instead of "fc" then i guess system is unable to launch the delete of the file
He ask me for credentials for the root server (minio.10.68.0.250.nip) and not for the bucket where it is stored (minio.10.68.0.250.nip/simclr) only this bucket has read/write permission.
No, no "delete" request 😞 Is there a way to put a breakpoint just before ?
Yes, i even got a "upload finished" message et the whole process goes to end.
sudo docker logs clearml-fileserver
This gives no info at all. May be i should increase the log level to debug. The only message i got is about "werkzeug" the default server module of flash that shouldn't be use as production deployement (by the way, why not use gunicorn as entrypoint in docker-compose ?)
My artifacts are now deleted but the directories where the artifacts are stored are not deleted.
My configuration.json is { "fileBaseUrl": " http://file.10.68.0.250.nip.io "}, but HTTP.FILE_BASE_URL still remains undefined . Something is missing ?
When i deleted the Experiment, i obtain a the following window :
I saw this part of the configuration file but i don't know exactly which key is used as mount point for configuration file ?