Reputation
Badges 1
39 × Eureka!the parameter must be "imagenet". But when I print the parameter in my code it is imagenet without quotes. But tensorflow needs "imagenet"
I hope you understand what I mean
where can I change it?
when i right click on the cloned project then there is no option to change it.
I need to access to a tfrecord file
but with allegro i do not have access to the folder
Now I ran docker-compose down
But the allegro server is still available.
thank you
it works now
you really helped me
i try to run the agent without docker. Without docker mode the path is available. But i need docker for tensorflow and cuda
thank you for the feedback
TimelyPenguin76 SuccessfulKoala55
I used the line you wrote me. But at the first time I start the program with the command line.
I have still the problem with the demo server.
At the moment it has nothing to do with the clearml-agent.
my clearml.conf:
api_server: http://192.168.40.210:8008
web_server: http://192.168.40.210:8080
files_server: http://192.168.40.210:8081
assert os.path.exists("path")
with this line I get the error that the path does not exist
I removed the trains.conf
first line:
TRAINS Task: overwriting (reusing) task id=8ce7a396ae8c4a14b22186a48ade5d91
sdk {
# ClearML - default SDK configuration
storage {
cache {
# Defaults to system temp folder / cache
default_base_dir: "~/.clearml/cache"
size {
# max_used_bytes = -1
min_free_bytes = 10GB
# cleanup_margin_percent = 5%
}
}
direct_access: [
# Objects matching are considered to be available for direct access, i.e. they will not be downloaded
...
thanks for the info
I have time now
how can I use "volume mount" with allegro?
clearml-agent --config-file /home/chuber/clearml.conf daemon --detached --gpus 1
--queue KA_ML2_GPU1 --docker nvidia/cuda:10.1-cudnn7-devel
first i run it locally. This works. But then I use the clearML agent and then it does not work
thank you for the information.
I am using the same GUI on 2 servers.
On both servers the following path did not exist:/opt/trainsSo I could not stop allegro.
I run the commands on 1 server to upgrade it. But on the gui there is still the old version.
Does anyone know how I can proceed?
Thank you for the answer.
I have 2 different cuda versions.
I need tensorflow 2.2, 2.3, 2.4, 2.5.
For tensorflow 2.2 i need cuda 10.1
But for tensorflow 2.4 i need for example cuda 11.0
https://www.tensorflow.org/install/source#gpu .
For docker I use for example: --docker nvidia/cuda:10.1-cudnn7-runtime-ubuntu18.04
Then tensorflow 2.4 no longer works because tensorflow 2.4 requires cuda 11 and not cuda 10.1
Does anyone have any idea?
I can also pass 2 different docker images?
sorry
I solved the mistake. there was a mistake in my file path and then the training could not be started
AgitatedDove14
do I have to change a parameter in the config file to use .git-credentials?
i will use the 2. option
Does this mean that I need to create multiple ssh keys? 1 key for each user?
thanks for the answer.
I tried it but it did not work.
I have the same error:
fatal: could not read Username for ' http://rz-s-git ': terminal prompts disabled.
The git account have 2 users. I tried a run a different project from the other user and it worked.
The problem is to clone repository from different users.
does anyone know how I can best proceed?
is there a better way instead of creating multiple ssh keys?
ok
but how can I get the hyperparameters from the current task?