Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
AttractiveHawk17
Moderator
7 Questions, 36 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

21 × Eureka!
0 Votes
11 Answers
504 Views
0 Votes 11 Answers 504 Views
Hi everyone, I'm using the https://api.clear.ml/ server and ran a bunch of experiments using hydra multirun (sequential runs). Many of these experiments appe...
one year ago
0 Votes
13 Answers
476 Views
0 Votes 13 Answers 476 Views
anyone seeing these errors?
one year ago
0 Votes
2 Answers
508 Views
0 Votes 2 Answers 508 Views
Hi everyone, qq: I specified the output_uri of the Task instance to be an s3 bucket and any artifact logged explicitly is indeed uploaded there. However, all...
one year ago
0 Votes
3 Answers
511 Views
0 Votes 3 Answers 511 Views
Hi everyone, I've seen that when re-running a script it sometimes overwrites a previous task in the dashboard instead of creating a new task. How does clearm...
one year ago
0 Votes
4 Answers
560 Views
0 Votes 4 Answers 560 Views
Hi everyone, im trying to use the aws autoscaler service. Provided the pac but is not able to clone the repo. It is not using the pac (using gitlab)
one year ago
0 Votes
4 Answers
538 Views
0 Votes 4 Answers 538 Views
one year ago
0 Votes
4 Answers
557 Views
0 Votes 4 Answers 557 Views
hi everyone, two questions: 1- How does clearml figures out the environment? for example in my env I have this qiskit 0.37.2 qiskit-aer 0.10.4 qiskit-experim...
one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

Im using the latest version of clearml and clearml-agenst and im seeing the same error

one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

actually I really need help with this, ive been struggling for 2 days to make the aws autoscaler work.
what I want:
do a multirun with hydra where each of the runs get executed remotely

my implementation (iterated over several using create_function_task
, etc:

` @hydra.main(config_path="configs", config_name="ou_cvae")
def main(config: DictConfig):
curr_dir = Path(file).parent
if config.clearml.enabled:
# Task.force_requirements_env_freeze(requirements_file=str(cur...

one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

I find this error if I try to run any of the runs generated
clearml_agent: ERROR: Could not find task id=a270d2a56feb475181ef3c9c82111b7f (for host: some_secret_host) Exception: __init__() got an unexpected keyword argument 'types'

one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

waiting now for the run...

but I still have the problem if I try to run locally for debugging purposes clearml-agent execute --id ...

one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

my bad :man-facepalming: the hydra error is because the data config folder is not commited (gitignore)

one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

multirun is not working as expected
when I run python run.py -m env=gpu clearml.task_name=demo_all_models "model=glob(*)"
it should run remotely one run per model
this is the output I see locally
` ╰─ python run.py -m env=gpu clearml.task_name=demo_all_models "model=glob(*)"
2022/09/13 20:49:31 WARNING mlflow.utils.autologging_utils: You are using an unsupported version of pytorch. If you encounter errors during autologging, try upgrading / downgrading pytorch to a supported version, or...

one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

still the same result. What's strange is that the remote jobs, as soon as they are launched, if I compare their configs while in state pending, they have the right all different configs, but later, while running, they all revent to the same config by the end

one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

Yes, so here you have the three task (here is a slight refactor using task_func instead of task but the result is the same)

1- all different (status pending)
2- two equal (those which started)
3- all equal (all running or completed)

one year ago
0 Hi Everyone, Two Questions: 1- How Does Clearml Figures Out The Environment? For Example In My Env I Have This

1.- The script im running uses qiskit.providers but as installed by when you install qiskit. If you try to install the submodules independently, it doesnt work. How do I use the full environment instead? cannot find this in the documentation. Also, I cannot configure the agents it seems because im using the aws autoscaler service so I dont spin them explicitly.

2.- My workflow would be that I usually, locally I run multiple sequential experiments using hydra multirun. What I want is th...

one year ago
0 Hi Everyone, I'M Using The

it doesnt happen with all the tasks of the multirun as you can see in the photo

one year ago
one year ago
0 Hi Everyone, I'M Using The

dont think will be reproducible with the hydra example. It was just that I launched like 50 jobs and some of them because of the parameters maybe failed (strangely with no error).
But is ok for now I guess, will debug wether those experiments that failed would failed if ran independently as well

one year ago
0 Anyone Seeing These Errors?

is an integer when it exists

one year ago
0 Anyone Seeing These Errors?

when adding a custom column to the table view from a param value. Maybe it happens because that param is not relevant for all the tasks in the table? it shouldnt through an error though, just show an empty value for the runs where is not relevant.

one year ago
0 Anyone Seeing These Errors?

is your server. Will check that example later

one year ago
0 Anyone Seeing These Errors?

it also happens with other configuration values like this one which is a boolean. I think it happens in general with configuration values that are passed in your run command as flags (using the override syntax of hydra)

one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

I guess one solution would be to write a clearml https://hydra.cc/docs/advanced/plugins/overview/ for hydra, like the one with ray.
I leave it here though for now (end of POC)

one year ago
0 Hi Everyone, Just A Simple Question What Does A Clearml Task Count As An Iteration? Is It The Number Of Logs I’Ve Done?

if you are doing logs i imagine these are done using Logger.report_scalar if so. iteration is an argument of that method

one year ago
0 Hi Everyone, Two Questions: 1- How Does Clearml Figures Out The Environment? For Example In My Env I Have This

basically running_locally() ok, I think I have everything I need. Will give it a try.

one year ago
0 Hi Everyone, I'M Using The

im running them with python my_script.py -m my_parameter=value_1,value_2,value_3 (using hydra multirun)

one year ago
0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

` ─ python run.py -m env=gpu clearml.task_name=connect_test "model=glob(*)" trainer_params.max_epochs=5
2022/09/14 01:10:07 WARNING mlflow.utils.autologging_utils: You are using an unsupported version of pytorch. If you encounter errors during autologging, try upgrading / downgrading pytorch to a supported version, or try upgrading MLflow.
/Users/juan/mindfoundry/git_projects/cvae/run.py:38: UserWarning:
The version_base parameter is not specified.
Please specify a compatability version level...

one year ago
one year ago
0 Hello Everyone, How Do I Tell The Agent That It Needs To Install A Local Module Of The Repo? If I Put Git+<Repopath> In The Requirements It Will Install The Module Version In The Repo And Not Necessarily The Version That Launched The Task. I Basically Wan

ok, yes I mean the branch im working on. I can assume I;ve pushed it. So ill be using something like

def get_package_url() -> str: repo = Repo(Path(__file__).parent) branch_name = repo.active_branch.name remote_url = repo.remote().url return f"git+ssh://{remote_url.replace(':', '/')}@{branch_name}"and
Task.add_requirements("my_package", "@ {get_package_url()}")

one year ago
0 Hi Everyone, Im Trying To Use The Aws Autoscaler Service. Provided The Pac But Is Not Able To Clone The Repo. It Is Not Using The Pac (Using Gitlab)

im trying to use https://clear.ml/docs/latest/docs/webapp/applications/apps_aws_autoscaler .
In the setup, I have to provide a personal access token (PAC) from git.
The agents when setting up the env to run the tasks from the queue cannot clone the repo using the pac

cloning: git@gitlab.com:<redacted>.git Using user/pass credentials - replacing ssh url 'git@gitlab.com:<redacted>.git' with https url ' ` <redacted>.git'
Host key verification failed.
fatal: Could not read from remote repos...

one year ago
Show more results compactanswers