
Reputation
Badges 1
47 × Eureka!At the end, there is an error about "pip"
How can I change the apiserver from localhost to my machine's IP. I couldn't figure it out. Sorry.
chown: cannot access '/root/.cache/pip': No such file or directory
It gives this error
from trains_agent import APIClient client = APIClient() task = client.tasks.get_by_id(task="5b5cbdc5e7bc4cd5ba39081b5b4a0423") print(task.execution.parameters)
AttributeError: 'Task' object has no attribute 'hyperparams'
Yes, I mean removing agent from the server
AgitatedDove14 Is it possible to delete specified worker? I mean, I have 10 workers and I want to delete one of them?
Yes, I mean trains-agent. Actually I am using 0.15.2rc0. But, I am using local files, I mean I clone trains and trains-agent repos and install them. Their versions are 0.15.2rc0
If I delete this folder that is in ~/.trains/vcs-cache directory, it fixes the problem
AgitatedDove14 I might find something to fix the issue but I am not sure. In trains-agent worker.py script log it is written like that python3 -u -m trains_agent execute --disable-monitoring --id 9fe6d610a2b946379255b0fc25b5f9fd')
so at the end there is an extra " ' ". So when I run this script in my local environment by writing python3 -u -m trains_agent execute --disable-monitoring --id 9fe6d610a2b946379255b0fc25b5f9fd
it works and runs the code. However, if I write ` pytho...
fatal: destination path '/home/dogukan/.trains/vcs-cache/pre-post-script-repo.git.35f82b395021c8e6afef186fafa662cc/pre-post-script-repo.git' already exists and is not an empty directory. Repository cloning failed: Command '['clone', 'git@192.168.1.230:dogukan/pre-post-script-repo.git', '/home/dogukan/.trains/vcs-cache/pre-post-script-repo.git.35f82b395021c8e6afef186fafa662cc/pre-post-script-repo.git', '--quiet', '--recursive']' returned non-zero exit status 128. Cannot find output file
Okey, know it is 2.9, I misunderstood you.
I mean, you know in trains github, there are examples and when I deploy the server, these examples are exist in server with draft status. So, I want to add my examples in the same way.
Hi TimelyPenguin76 , My version is 0.15.2rc0 and I am running with this command; trains-agent daemon --detached --gpus 0 --queue default --docker nvidia/cuda --foreground
I will install from repository and try it again, my trains-agent version is 0.15.2rc0, but I could not find trains version, it might be about version
AgitatedDove14 /usr/bin/python3
/usr/local/bin/trains-agent
-e git+
torch == 1.5.1 torchvision == 0.6.1 trains == 0.15.2rc0
Actually, package version is also written. However, because of git ref, trains-agent fails.
I think you can reproduce it by cloning the trains repository, then pip install -e ~/trains
and then you need to run one of the examples from trains examples by writing python3 toy_base_task.py
. Then you should see this odd bug.
It worked when I changed python3 -m trains-agent --help
to trains-agent --help
Btw, we figure out that '
is belong the echo. So there is no problem with that one.
/usr/bin/python3: No module named trains-agent
Because of the error I thought, I run first command, but I run edited version. It gives this error (That's why, it takes time 😞 )
Thanks a lot for the last information. It worked 😄
Actually, I want to retrieve metrics from code and I thought that these metrics might be stored somewhere in the local folders so that I can access them via code. You know the log file is created and stored in /tmp folder.
It worked. Thanks a lot 🙂