Reputation
Badges 1
94 × Eureka!because when I run that normally it differentiates workers basing on gpu that it is using
Hi SuccessfulKoala55
I commented about temporary solution for #828
https://github.com/allegroai/clearml/issues/828
I'll let it up to your decision whether it should be closed
I host the code on my Github
The use case was that server with repo wasn't responding for a while and I was thinking how to solve that. Thanks for the answer!
Can I do this to specify which worker should execute that task?CLEARML_WORKER_NAME=<worker_name> clearml-agent execute --id <task_id>
What is interesting, it works when using virtual environment setup
But stucks at the same moment when using docker
Ubuntu 21.10 to be concrete
there is no such option
CostlyOstrich36 have you ever seen something like my case maybe?
So seems like this dictionary works with strings
So there is no way to use Agent without use of remote repo (just using local server not connected to Internet), am I right?
Because it has no coincidence with some specific actions
or at least I can't specify such
SuccessfulKoala55 So, we have two problems:
Probably minor one, but strange. We run some number of workers using given compose file, that is attached in .zip. We can do:docker compose -f docker-compose-worker.yaml build docker compose -f docker-compose-worker.yaml upand in theory there should be 10 agents running, but frequently, not 10 are shown in UI (for example on last run we got 3 of them). When we run htop , we can see 10 agents in our system. What is even more strange, those...
version 1.8.1
No, there are no error messages. The behaviour is just very strange (or even incorrect)
Suppose that this is a task that is cloned:
` base_task = replacement_task.create_function_task(
func=some_func, # type: Callable
func_name=f'func_id_run_me_remotely_nr', # type:Optional[str]
task_name=f'a func task', # type:Optional[str]
# everything below will be passed directly to our function as arguments
some_argument=message,
some_argument_2=message,
rand...
I haven't change any port mapping
WARNING: You are using pip version 20.1.1; however, version 21.3.1 is available.You should consider upgrading via the '/usr/bin/python3 -m pip install --upgrade pip'command.Retrying (Retry(total=239, connect=239, read=240, redirect=240, status=240)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at
` 0x7faf9da78400>: Failed to establish a ...
The problem is that we have a a complex configuration of pipeline. Configuration changes quite frequently and we would not like to run the pipeline every time configuration changes, but we would like to have it scheduled in some defined periods.
Do you have an idea of some workaround / alternative solution for that problem?
Agent works when I am running it from virtual environment but stucks in the same place all the time when I using Docker
clearml_agent: ERROR: Instance with the same WORKER_ID [our_machine:gpu0] is already running