
Reputation
Badges 1
47 × Eureka!AgitatedDove14 Is it possible to delete specified worker? I mean, I have 10 workers and I want to delete one of them?
Ups, you misunderstood me. I just want to remove specified agent. For example, I had 3 agents on the same queue with different worker names. So, if I remove them by applying what you said in this thread, all of them will be removed. However, I just want to remove one of them.
I mean, you know in trains github, there are examples and when I deploy the server, these examples are exist in server with draft status. So, I want to add my examples in the same way.
fatal: destination path '/home/dogukan/.trains/vcs-cache/pre-post-script-repo.git.35f82b395021c8e6afef186fafa662cc/pre-post-script-repo.git' already exists and is not an empty directory.
Actually, the error occurs because of this line. I run my experiment firstly on docker, then it creates a folder in the vcs-cache that gives access permission only to root. Then, I run my experiment on venv and it cannot access this folder and cannot create new one because of the same name.
AgitatedDove14 , sorry for my late response, I will try it. it might work and Thanks.
If I delete this folder that is in ~/.trains/vcs-cache directory, it fixes the problem
Actually, I want to retrieve metrics from code and I thought that these metrics might be stored somewhere in the local folders so that I can access them via code. You know the log file is created and stored in /tmp folder.
Hi TimelyPenguin76 , My version is 0.15.2rc0 and I am running with this command; trains-agent daemon --detached --gpus 0 --queue default --docker nvidia/cuda --foreground
AttributeError: 'Task' object has no attribute 'hyperparams'
I will install from repository and try it again, my trains-agent version is 0.15.2rc0, but I could not find trains version, it might be about version
Yes, I mean trains-agent. Actually I am using 0.15.2rc0. But, I am using local files, I mean I clone trains and trains-agent repos and install them. Their versions are 0.15.2rc0
Yes, I mean removing agent from the server
Yes, it fixed. If I install trains from github repo directly withouth using my local version. But, is there another way to avoid it, because I might change the code for my personal use, that's why I want to install trains from my local.
fatal: destination path '/home/dogukan/.trains/vcs-cache/pre-post-script-repo.git.35f82b395021c8e6afef186fafa662cc/pre-post-script-repo.git' already exists and is not an empty directory. Repository cloning failed: Command '['clone', 'git@192.168.1.230:dogukan/pre-post-script-repo.git', '/home/dogukan/.trains/vcs-cache/pre-post-script-repo.git.35f82b395021c8e6afef186fafa662cc/pre-post-script-repo.git', '--quiet', '--recursive']' returned non-zero exit status 128. Cannot find output file
-e git+
torch == 1.5.1 torchvision == 0.6.1 trains == 0.15.2rc0
Actually, package version is also written. However, because of git ref, trains-agent fails.
As you said, in order to see my code in the server, I need to run it. So, what am I asking that is there any way to see my code in the server without run it.
I think you can reproduce it by cloning the trains repository, then pip install -e ~/trains
and then you need to run one of the examples from trains examples by writing python3 toy_base_task.py
. Then you should see this odd bug.
I couldn't tell. Assume, I have a huge github repository, it has 100 ml project and I want to see all of them in the trains server. Should I write "train.init()" and run them all in order to see them in the server or is there any other way to see all of them in the server without run them all.
TimelyPenguin76 It is 2.1, but 2.9 files are also exist. How can I update it?
I was using APIClient inside of trains-agent and in trains-agent there is no v2.9. I think that's why I could not get hyperparams
Okey, I got it now. Thanks for help 🙂
print(task.data.hyperparams)
I tried this one
Okey, know it is 2.9, I misunderstood you.