Reputation
Badges 1
533 × Eureka!How do I get all children tasks given a parent?
Oh I get it, that also makes sense with the docs directing this at inference jobs and avoiding GPU - because of the 1-N thing
does the services mode have a separate configuration for base image?
the Task
object has a method called Task.execute_remotely
Look it up here:
https://allegro.ai/docs/task.html#trains.task.Task.execute_remotely
Yeah but I don't get what it is for - for now I have 2 agents, each listening to some queues. I actually ignore the "services" queue until now
I don't get the difference between how I'm using my agents now, just starting them on machines, and making them listen to queues, to using the "services" mode
I manually deleted the allegroai/trains:latest
image, that didn't help either
Wait, suddenly the UI changed to 0.16.1, seems like I was shown a cached page
let me try to docker-compose down --rmi all
I mean if I continue and build on the example in the docs, what will happen if the training
task is completed, and then I get it and log to it? Will it be defined as running again?
a machine that had previous installation, but I deleted the /opt/trains
directory beforehand
I was here, but I can't find info for the questions I mentioned
Can you lend a few a words about how the not-pip freeze mechanism of detecting packages work?
To be clearer - how to I refrain from using the built in file-server altogether - and use MINIO for any storage need?
why not use my user and group?
and in the UI configuration I didn't understand where does permission management came into play
Continuing on this discussion... What is the relationship between configuring files_server
and all the rest we just talked about and the the default_output_uri
?
Thx DangerousDragonfly8 💪
I get this
` [ec2-user@ip-10-0-0-95 ~]$ docker-compose down
WARNING: The TRAINS_HOST_IP variable is not set. Defaulting to a blank string.
WARNING: The TRAINS_AGENT_GIT_USER variable is not set. Defaulting to a blank string.
WARNING: The TRAINS_AGENT_GIT_PASS variable is not set. Defaulting to a blank string.
ERROR: Couldn't connect to Docker daemon at http+docker://localhost - is it running?
If it's at a non-standard location, specify the URL with the DOCKER_HOST environment variable. `
but I can't seem to run docker-compose down
The latest, I curl
ed the docker-compose like 10 minutes ago
CostlyOstrich36 so why 1000:1000? My user and group are not that and so do all the otehr files I have under /opt/clearml