Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
HollowDeer18
Moderator
6 Questions, 19 Answers
  Active since 10 January 2023
  Last activity 19 days ago

Reputation

0

Badges 1

19 × Eureka!
0 Votes
23 Answers
571 Views
0 Votes 23 Answers 571 Views
(a regular experiment did execute in the remote agent, I only get this with the pipe)
one year ago
0 Votes
1 Answers
571 Views
0 Votes 1 Answers 571 Views
How can you clone an experiment, but update the commit so it runs with the latest version of the code?
one year ago
0 Votes
0 Answers
775 Views
0 Votes 0 Answers 775 Views
File "/home/thermo/.clearml/venvs-builds/3.8/code/controller.py", line 20, in from .job import LocalClearmlJob, RunningJob, BaseJob
one year ago
0 Votes
0 Answers
562 Views
0 Votes 0 Answers 562 Views
Furthermore, if I try to execute a pipe in a remote agent, I get a python error from the clearml engine:
one year ago
0 Votes
0 Answers
556 Views
0 Votes 0 Answers 556 Views
ImportError: attempted relative import with no known parent package
one year ago
0 Votes
0 Answers
627 Views
0 Votes 0 Answers 627 Views
Hi,
Hi,
one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

my worker node is not a docker, but linux in conda environment

one year ago
one year ago
one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

FYI: this is my pipeline script

from clearml import PipelineController

pipe = PipelineController(name="My Pipe", project="Gridsquare-Training", version="0.0.5")
pipe.add_step(name="pipe step 1", base_task_project="Gridsquare-Training", base_task_name="remo2")
pipe.add_step(name="pipe step 2", base_task_project="Gridsquare-Training", base_task_name="remo2", parents=["pipe step 1"])

pipe.set_default_execution_queue("myqueue")
pipe.start("service")

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

in case of the local jupyter notebook, I create the pipeline and when I start it, it all works without the necessity to add the jupyter notebook to git

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

but if I run exactly the same code from a python script (which also calls start on te pipeline), the worker node tries to check out the script and runs that (or fails if you didn't check it into git yet)

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

Initially, I had only one queue and one worker set-up. If the pipeline 'default execution queue' is the same as the queue used in pipe.start('the queue'), it gets into sort of a dead-lock and waits forever

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

when I set-up two queues and two workers, set the default-execution-queue to one queue and use the other queue for pipe.start, it all works

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

but the behavior is different if you kick it off from a jupyter notebook (local) or a python script

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

the error occurs in the worker node when it tries to initialize the environment for the pipeline

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

Hi John, I've done more experiments and found that this only happens if you try to run the pipeline remotely directly from the python interpreter

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

I created the pipeline on another machine via interactive python shell. The pipeline is picked up by clearml, as I see it on the web ui.

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

if I look at the code of the clearml controller.py, I see that it expects additional code at a relative folder

one year ago
one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

if I go to the folder as mentioned in the error and than one level up, I see no other packages present

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

The notebook behavior is indeed how I expect it to work, the behavior via the script is strange

one year ago
0 (A Regular Experiment Did Execute In The Remote Agent, I Only Get This With The Pipe)

Full console log of the worker:

No tasks in Queues, sleeping for 5.0 seconds
No tasks in queue b5fe1e72614247f7a77e5f6cdac35580
No tasks in Queues, sleeping for 5.0 seconds
task 30ad27a7a1244b6e8aa722d81cb6015c pulled from b5fe1e72614247f7a77e5f6cdac35580 by worker NLEIN-315GNH2:0
Running task '30ad27a7a1244b6e8aa722d81cb6015c'
Storing stdout and stderr log to '/tmp/.clearml_agent_out.sppvun4p.txt', '/tmp/.clearml_agent_out.sppvun4p.txt'
Current configuration (clearml_agent v1.4.1, location:...

one year ago