Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
How To Configure Clearml Agent To Keep Pods Around After They Finish/Fail? I Want To Debug A Pod That Crashes, But It Gets Deleted Quickly

How to configure ClearML agent to keep pods around after they finish/fail? I want to debug a pod that crashes, but it gets deleted quickly

  
  
Posted 20 days ago
Votes Newest

Answers 6


Hi WobblyFrog79 , you mean when running the agent over K8s?

  
  
Posted 19 days ago

Yes CostlyOstrich36

  
  
Posted 19 days ago

Hey CostlyOstrich36 , could you provide any suggestions here, please?

  
  
Posted 18 days ago

Hi WobblyFrog79 - Please try setting the environment variable CLEARML_K8S_GLUE_DEBUG=1 on the Agent

agentk8sglue:
  extraEnvs:
    - name: CLEARML_K8S_GLUE_DEBUG
      value: "1"

This will make the Agent Pod print the rendered Task Pod template in the logs, so you can see it 🙂

  
  
Posted 17 days ago

Awesome CooperativeKitten94 , will definitely add that. It would also be very helpful if there was a way to delay deleting "completed/failed" pods. This is useful when something fails unexpectedly and ClearML logs are not enough to debug the issue. Does that make sense to you? I could contribute to your codebase if you're interested.

  
  
Posted 17 days ago

Wonderful - We do not have such feature planned for now, feel free to contribute 🙂

  
  
Posted 17 days ago
113 Views
6 Answers
20 days ago
16 days ago
Tags