Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi, I Am A Pro Account User (Just Upgraded Today). I Currently Have A Queue Name "Services" And 2 Workers Are Listening To This Queue, But Only One Is Pulling Tasks From The Queue. What Am I Missing?

Hi, I am a pro account user (just upgraded today). I currently have a queue name "services" and 2 workers are listening to this queue, but only one is pulling tasks from the queue. What am I missing?

  
  
Posted 2 years ago
Votes Newest

Answers 13


AbruptWorm50 - does the issue still occur, or did you manage to resolve it?

  
  
Posted 2 years ago

Looking at the 2nd image you sent, I see that in addition to "services" queue, you also have queues called "training" and "training*_2" - and the experiments you circled is in the "training" queue. In that image - there are no experiments in the services queue.
If you press on the "services" queue (like you did in the first image) you can view the experiments in the queue and the workers. Can you check if there is a situation where there are tasks pending in that queue while one of the workers is idle?

  
  
Posted 2 years ago

CumbersomeCormorant74 As you can see in the attached - there were 2 experiments at the same time, but only one agent pulled the task, even though the second agent was free and listening to the queue.

  
  
Posted 2 years ago

SuccessfulKoala55 I can't share the logs.

  
  
Posted 2 years ago

AbruptWorm50 - the agent poll the queue, so any free agent can pull tasks. From the graph on the right, it looks like experiments were not waiting in the queue (max experiments is 1, and it was immediately pulled). Can you also check what happens if two experiments are enqueued at the same time?

  
  
Posted 2 years ago

I will elaborate on the situation:
I have 2 agents - training and training_2. They are both listening to the services queue, but only 'training' pulls the tasks. At the beginning I had 2 tasks in the services queue. Then, agent 'training' pulled one and is currently executing it, but for some reason - it also pulled the 2nd task into it's queue, that is although this agent is not free and I have another agent which is: 'training_2'.

  
  
Posted 2 years ago

AbruptWorm50 - just to make sure there is no misunderstanding - the last image you sent is on the "training" queue and not on the "services" queue. Are there free agents running on that queue?

  
  
Posted 2 years ago

Can you check if there is a situation where there are tasks pending in that queue while one of the workers is idle?

In what queue ? In services there are no pending tasks because they were all pulled by 'training'.

just to maker sure, how do you start the agents? Are you using the

--services-mode

option?

I used clearml-agent daemon --detached --services-mode --queue services --docker ubuntu:18.04 --cpu-only

  
  
Posted 2 years ago

Wait, so you have two agents:
one running in normal mode and monitoring the training queue another running in the services mode, monitoring the services queue?

  
  
Posted 2 years ago

We have been trying to resolve the issue. I will comment here again if any more problems arise. Thanks!

  
  
Posted 2 years ago

AbruptWorm50 just to maker sure, how do you start the agents? Are you using the --services-mode option?

  
  
Posted 2 years ago

Hi AbruptWorm50 ,
Should be working - this is basically only up to the agents... Can you perhaps share the agents' logs?

  
  
Posted 2 years ago

Just to clarify again - when I start the agents I run :
clearml-agent daemon --detached --queue training
and then: clearml-agent daemon --detached --services-mode --queue services --docker ubuntu:18.04 --cpu-only
This is why there are 'training' and 'training_2' queues.

  
  
Posted 2 years ago
970 Views
13 Answers
2 years ago
one year ago
Tags