
Reputation
Badges 1
40 × Eureka!By default, i put nothing in the task but when i use a ClearMLSaver like thisClearMLSaver(logger, output_uri="
")
where clearml is my bucket.
Yes, so far i came back to the old adress 🙂
Hello. I think an Issue should at least be opened. Modifications in my code need to be generalized before creating a pull request.
Yes, i was expecting that it was already working like that. So far, i modified the code to set DOCKER_ROOT_CONF_FILE to what i want !!!
I define the HOST ( ENDPOINT) like this but it doesn't change anything
Ok, it works i need to specify :80 also in the output_uri from my python file !
The mount point for the clearml.conf, i.e. '-v', '/tmp/.clearml_agent.qy2xyt21.cfg:/root/clearml.conf'
becauce the docker i use is run as user
and doesn't have acces to \root.
. May be this is defined on cleaml-server side ? I use my own server installed on another linux box using docker-compose.
Yes, i even got a "upload finished" message et the whole process goes to end.
He ask me for credentials for the root server (minio.10.68.0.250.nip) and not for the bucket where it is stored (minio.10.68.0.250.nip/simclr) only this bucket has read/write permission.
I had to set another docker volume because venvs_builds was not properly mounted to. I don't have write permission to the default one too
i use a proxy and the port is 80, i need to write it ?
I don't undestand why after specifying /root/clearml.conf
, a copy is required to /root/default_clearml.conf
. I modified in the code this copy by one that takes a user mounting point and copies it to home directory ~/clearml.conf
only a "upload failed" and no data in my S3 bucket
To your point of view, it may be related with the sdk client that triggers the upload ? with urllib request ?
I take a look a src/app/webapp-common/shared/entity-page/entity-delete/base-delete-dialog.effects.ts.
I see that an error is raised in the mergeMap at line 125, but i'm not familiar enough with Typescript to find why.
I was unable to define FILE_BASE_URL inside the docker container. I modify the HTTP constant in app.constants.ts with hard code values, compile the webapp again (npm) and replace it in my docker container and now it works....
i tried so far but it was not so easy, because there is a python executable "update_from_env" that empties the configuration.json file. So i create a file in /mnt/external_files/configs and my configuration.json was read.
In docker-compose, image was latest allegroai/clearml:latest when i pull docker images. When i launch it, after installation i have in WebApp following informations : "WebApp: 1.3.1-169 • Server: 1.3.1-169 • API: 2.17"
My files (fs) are deleted but i have the same issue as reported by SuperiorPanda77 , with some undefined value that is said not to be deleted. I guess that as my command deleteFileServerSources
works but exit with some strange return value, other commands in the row addFaieldDeletedFiles
and deleteProjectFromRoot
are not executed (file src/app/webapp-common/shared/entity-page/entity-delete/base-delete-dialog.effects.ts
)
No particular information in console( no error), no network error too.
It seems that i should define this variable by the use of an environment variable in ConfigurationService.globalEnvironment.
DeterminedCrab71 You right, if i understand correctly HTTP.FILE_BASE_URL is undefined, then file to delete is describe as "misc" instead of "fc" then i guess system is unable to launch the delete of the file
No, no "delete" request 😞 Is there a way to put a breakpoint just before ?
What i don't understant is the list of artifacts that were not deleted
My artifacts are now deleted but the directories where the artifacts are stored are not deleted.
Files are stored on the same box where the docker is running. And there is a mounting point between file server docker and the host itself.
sudo docker logs clearml-fileserver
This gives no info at all. May be i should increase the log level to debug. The only message i got is about "werkzeug" the default server module of flash that shouldn't be use as production deployement (by the way, why not use gunicorn as entrypoint in docker-compose ?)
When i deleted the Experiment, i obtain a the following window :
The addresses seems strange, is this the hostname?
I use the nip services to have subdomains: clearml.domain api.domain and file.domain that points to the same host.
I think i have my answer, this is hard coded in agent base_cmd += ( (['--name', name] if name else []) + ['-v', conf_file+':'+DOCKER_ROOT_CONF_FILE] + (['-v', host_ssh_cache+':'+mount_ssh] if host_ssh_cache else []) + ...