Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
If I Clone A Task, I Suppose All Artifacts Are Not Cloned With It, Even If They Are Registered, Right?

If I clone a task, I suppose all artifacts are not cloned with it, even if they are registered, right?

  
  
Posted 2 years ago
Votes Newest

Answers 17


Yes that's what I thought, thanks for confirming.

  
  
Posted 2 years ago

Yeah, and just thinking out loud what I like about the numpy/pandas documentation

  
  
Posted 2 years ago

Yes, thanks AgitatedDove14 ! It's just that the configuration object passed onwards was a bit confusing.
Is there a planned documentation overhaul? 🤔

  
  
Posted 2 years ago

Basically when running remotely, the first argument to any configuration (whether object or string, or whatever) is ignored, right?

  
  
Posted 2 years ago

Generally, really. I've struggled recently (and in the past), because the documentation seems:
Very complete wrt available SDK (though the formatting is sometimes off) Very lacking wrt to how things interact with one anotherA lot of what I need I actually find from pluging into the source code.
I think ClearML would benefit itself a lot if it adopted a documentation structure similar to numpy ecosystem (numpy, pandas, scipy, scikit-image, scikit-bio, scikit-learn, etc)

  
  
Posted 2 years ago

First bullet point - yes, exactly

Second bullet point - all of it, really. The SDK documentation and the examples.
For example, the Task object is heavily overloaded and its documentation would benefit from being separated into logical units of work. It would also make it easier for the ClearML team to spot any formatting issues.
Any linked example to github is welcome, but some visualization/inline code with explanation is also very much welcome.

  
  
Posted 2 years ago

e.g. a separate structured user guide with common tips, usability, best practices - https://pandas.pydata.org/pandas-docs/stable/user_guide/index.html
vs the doc, where each function is its own page, e.g.
https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html

  
  
Posted 2 years ago

I'm not sure about the intended use of connect_configuration now.
I was under the assumption that in connect_configuration(configuration, name=None, description=None) , the configuration is only used in local execution.
But when I run config = task.connect_configuration({}, name='General') (in remote execution), the configuration is set to the empty dictionary

  
  
Posted 2 years ago

I'm not sure about the intended use of

connect_configuration

now.

Basically here is the rationale behind it:
I have a config file that I want to log on the Task, and I Also want to be able to change this configuration file externally when launching using an agent (i.e. edit the content) I have a nested dictionary that I do not want to flatten and push as hyper-parameters because it is not very readble, so I want to store it in a more human readable form and edit it as config file

...

Task.set_configuration_object

, etc.

Basically do not call set_configuration_object directly, instead use connect_configuration. The main difference is that "connect" as a concept is a two way conneciton, when runnign manually log on the Task/backend, when running via agent, get the values from the Task/backend into the code. set_xyz calls are always setting the Task's/backend value
Make sense ?

  
  
Posted 2 years ago

Very lacking wrt to how things interact with one another

If I'm reading it correctly, what you are saying is that some of the "big picture" / holistic approach on how different parts interact with one another is missing, is that correct?

I think ClearML would benefit itself a lot if it adopted a documentation structure similar to numpy ecosystem

Interesting thought, what exactly would you suggest we "borrow" in terms of approach?

  
  
Posted 2 years ago

Hi UnevenDolphin73 , yes this is correct. Artifacts are outputs so they are all removed during a clone or reset

  
  
Posted 2 years ago

For example, the

Task

object is heavily overloaded and its documentation would benefit from being separated into logical units of work. It would also make it easier for the ClearML team to spot any formatting issues.

This is a very good point (the current documentation is basically docstring, but we should create a structured one)

... but some visualization/inline code with explanation is also very much welcome.

I'm assuming this connected with the previous point?

  
  
Posted 2 years ago

Sorry, found it on my end!

  
  
Posted 2 years ago

Yes that makes total sense to me. How about a GitHub issue on the clearml-docs ?

  
  
Posted 2 years ago

There used to be a good example but it's now missing. I'm not sure what does Use only for automation (externally), otherwise use Task.connect_configuration mean when e.g. looking at Task.set_configuration_object , etc.

Could you clarify a bit, CostlyOstrich36 or AgitatedDove14 ?

  
  
Posted 2 years ago

Basically when running remotely, the first argument to any configuration (whether object or string, or whatever) is ignored, right?

Correct 🙂

Is there a planned documentation overhaul?

you mean specifically for the connect_configuration ? or in general on the connect approach rationale ?

  
  
Posted 2 years ago