Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi, I Have Several Long Running Experiments Failing With

Hi, I have several long running experiments failing with Process failed, exit code -9 and no other error with clearml 1.0.4 and clearml-agent 1.0.0, what could be the reason?
I went to check the agent logs on the machine and it says: DONE: Running task 'a2e3442178b5ee0d72f7a17e6648f953', exit status -9

  
  
Posted 2 years ago
Votes Newest

Answers 3


AgitatedDove14 After investigation, another program on the machine consumed all the memory available, most likely making the OS killing the agent/task

  
  
Posted 2 years ago

That makes total sense, this is exactly an OS scenario for signal 9 🙂

  
  
Posted 2 years ago

Hi JitteryCoyote63
Signal 9 is killed signal, could it be someone killed the process ? Do you have other logs to share ? Is this reproducible ?

  
  
Posted 2 years ago
533 Views
3 Answers
2 years ago
one year ago
Tags