AttractiveHawk17

7 Questions, 36 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Badges 1

21 × Eureka!

Questions 7
Answers 36

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi Everyone, I'Ve Seen That When Re-Running A Script It Sometimes Overwrites A Previous Task In The Dashboard Instead Of Creating A New Task. How Does Clearml Decides Whether To Create A New Task Or Overwrite An Existing?

Hi everyone, I've seen that when re-running a script it sometimes overwrites a previous task in the dashboard instead of creating a new task. How does clearm...

clearml

3 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hi Everyone, Im Trying To Use The Aws Autoscaler Service. Provided The Pac But Is Not Able To Clone The Repo. It Is Not Using The Pac (Using Gitlab)

Hi everyone, im trying to use the aws autoscaler service. Provided the pac but is not able to clone the repo. It is not using the pac (using gitlab)

aws mlops

3 years ago

0 Votes

13 Answers

2K Views

0 Votes 13 Answers 2K Views

Anyone Seeing These Errors?

anyone seeing these errors?

clearml

3 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Hi Everyone, Qq: I Specified The Output_Uri Of The

Hi everyone, qq: I specified the output_uri of the Task instance to be an s3 bucket and any artifact logged explicitly is indeed uploaded there. However, all...

clearml

3 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hi Everyone, Two Questions: 1- How Does Clearml Figures Out The Environment? For Example In My Env I Have This

hi everyone, two questions: 1- How does clearml figures out the environment? for example in my env I have this qiskit 0.37.2 qiskit-aer 0.10.4 qiskit-experim...

clearml

3 years ago

0 Votes

11 Answers

2K Views

0 Votes 11 Answers 2K Views

Hi Everyone, I'M Using The

Hi everyone, I'm using the https://api.clear.ml/ server and ran a bunch of experiments using hydra multirun (sequential runs). Many of these experiments appe...

clearml

3 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hello Everyone, How Do I Tell The Agent That It Needs To Install A Local Module Of The Repo? If I Put Git+<Repopath> In The Requirements It Will Install The Module Version In The Repo And Not Necessarily The Version That Launched The Task. I Basically Wan

Hello everyone, how do I tell the agent that it needs to install a local module of the repo? If I put git+ in the requirements it will install the module ver...

mlops

3 years ago

0 Hi Everyone, Im Trying To Use The Aws Autoscaler Service. Provided The Pac But Is Not Able To Clone The Repo. It Is Not Using The Pac (Using Gitlab)

or if you could point me to the part of the package that sets up the enviroment to figure out what=s worng, please

3 years ago

0 Hi Everyone, I'M Using The

indeed, im looking at their corresponding multirun outputs folder and the logs terminate before without error and the only plots saved are those in clearml. So as you say, it seems hydra kills these

3 years ago

0 Hi Everyone, I'M Using The

im running them with python my_script.py -m my_parameter=value_1,value_2,value_3 (using hydra multirun)

3 years ago

0 Hi Everyone, Two Questions: 1- How Does Clearml Figures Out The Environment? For Example In My Env I Have This

basically running_locally() ok, I think I have everything I need. Will give it a try.

3 years ago

0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

I guess one solution would be to write a clearml https://hydra.cc/docs/advanced/plugins/overview/ for hydra, like the one with ray.
I leave it here though for now (end of POC)

3 years ago

0 Anyone Seeing These Errors?

is your server. Will check that example later

3 years ago

0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

I find this error if I try to run any of the runs generated
clearml_agent: ERROR: Could not find task id=a270d2a56feb475181ef3c9c82111b7f (for host: some_secret_host) Exception: __init__() got an unexpected keyword argument 'types'

3 years ago

0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

still the same result. What's strange is that the remote jobs, as soon as they are launched, if I compare their configs while in state pending, they have the right all different configs, but later, while running, they all revent to the same config by the end

3 years ago

0 Anyone Seeing These Errors?

yes

3 years ago

0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

using 1.3.0

3 years ago

0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

yes!

3 years ago

0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

multirun is not working as expected
when I run python run.py -m env=gpu clearml.task_name=demo_all_models "model=glob(*)"
it should run remotely one run per model
this is the output I see locally
` ╰─ python run.py -m env=gpu clearml.task_name=demo_all_models "model=glob(*)"
2022/09/13 20:49:31 WARNING mlflow.utils.autologging_utils: You are using an unsupported version of pytorch. If you encounter errors during autologging, try upgrading / downgrading pytorch to a supported version, or...

3 years ago

0 Hi Everyone, Im Trying To Use The Aws Autoscaler Service. Provided The Pac But Is Not Able To Clone The Repo. It Is Not Using The Pac (Using Gitlab)

im trying to use https://clear.ml/docs/latest/docs/webapp/applications/apps_aws_autoscaler .
In the setup, I have to provide a personal access token (PAC) from git.
The agents when setting up the env to run the tasks from the queue cannot clone the repo using the pac

cloning: git@gitlab.com:<redacted>.git Using user/pass credentials - replacing ssh url 'git@gitlab.com:<redacted>.git' with https url ' ` <redacted>.git'
Host key verification failed.
fatal: Could not read from remote repos...

3 years ago

0 Hi Everyone, Qq: I Specified The Output_Uri Of The

yes! thanks

3 years ago

0 Anyone Seeing These Errors?

when adding a custom column to the table view from a param value. Maybe it happens because that param is not relevant for all the tasks in the table? it shouldnt through an error though, just show an empty value for the runs where is not relevant.

3 years ago

0 Hi Everyone, I'M Using The

yes

3 years ago

0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

Yes, so here you have the three task (here is a slight refactor using task_func instead of task but the result is the same)

1- all different (status pending)
2- two equal (those which started)
3- all equal (all running or completed)

3 years ago

0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

waiting now for the run...

but I still have the problem if I try to run locally for debugging purposes clearml-agent execute --id ...

3 years ago

0 Hi Everyone, I'Ve Seen That When Re-Running A Script It Sometimes Overwrites A Previous Task In The Dashboard Instead Of Creating A New Task. How Does Clearml Decides Whether To Create A New Task Or Overwrite An Existing?

I see, thank you!

3 years ago

0 Hi Everyone, I'M Using The

each of those runs finished producing 10 plots each but in clearml only 1, a few, or none got uploaded

3 years ago

0 Anyone Seeing These Errors?

it also happens with other configuration values like this one which is a boolean. I think it happens in general with configuration values that are passed in your run command as flags (using the override syntax of hydra)

3 years ago

0 Hi Everyone, Just A Simple Question What Does A Clearml Task Count As An Iteration? Is It The Number Of Logs I’Ve Done?

if you are doing logs i imagine these are done using Logger.report_scalar if so. iteration is an argument of that method

3 years ago

0 Hi Everyone, Two Questions: 1- How Does Clearml Figures Out The Environment? For Example In My Env I Have This

1.- The script im running uses qiskit.providers but as installed by when you install qiskit. If you try to install the submodules independently, it doesnt work. How do I use the full environment instead? cannot find this in the documentation. Also, I cannot configure the agents it seems because im using the aws autoscaler service so I dont spin them explicitly.

2.- My workflow would be that I usually, locally I run multiple sequential experiments using hydra multirun. What I want is th...

3 years ago

0 Hello Everyone, How Do I Tell The Agent That It Needs To Install A Local Module Of The Repo? If I Put Git+<Repopath> In The Requirements It Will Install The Module Version In The Repo And Not Necessarily The Version That Launched The Task. I Basically Wan

ok, yes, but this will install the package of the branch specified there.
So If im working on my own branch and want to run an experiment, I would have to manually put in the git path my current branch name. I guess I can add some logic to get the current branch from the env. Thank you

3 years ago

ok, yes I mean the branch im working on. I can assume I;ve pushed it. So ill be using something like

def get_package_url() -> str: repo = Repo(Path(__file__).parent) branch_name = repo.active_branch.name remote_url = repo.remote().url return f"git+ssh://{remote_url.replace(':', '/')}@{branch_name}"and
Task.add_requirements("my_package", "@ {get_package_url()}")

3 years ago

0 Hi Everyone, I'M Using The

it doesnt happen with all the tasks of the multirun as you can see in the photo

3 years ago

0 Hi Everyone, Two Questions: 1- How Does Clearml Figures Out The Environment? For Example In My Env I Have This

found the env freeze. For the second workflow all I would need I guess then would be and env variable that would tell me whether this is being currently run by an agent or not

3 years ago

0 Anyone Seeing These Errors?

is an integer when it exists

3 years ago

0 Hi Everyone, Im Trying To Use The Aws Autoscaler Service. Provided The Pac But Is Not Able To Clone The Repo. It Is Not Using The Pac (Using Gitlab)

it is supposed to replace the provided user psw in the url of the repo

3 years ago

0 Hello! I Get The Following Error In Results->Console After A Task Is Sent For Remote Execution (Using Sdk):

yes, the remote task is working 🙂

3 years ago

Show more results