UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity 9 months ago

Reputation

Badges 1

662 × Eureka!

Questions 106
Answers 749

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

I'D Like The Console In A Clearml Run To Show Only The Stdout/Stderr As It Does Now, But I'D Also Like Clearml To Capture Debug Level Logs. Is There An Easy Around This? It Would Be Nice If One Could E.G. Set

I'd like the CONSOLE in a ClearML run to show only the stdout/stderr as it does now, but I'd also like ClearML to capture DEBUG level logs. Is there an easy ...

clearml

2 years ago

0 Votes

19 Answers

1K Views

0 Votes 19 Answers 1K Views

For Remote Execution Where The Queue Has

For remote execution where the queue has poetry as the default package manager, can the user decide to e.g. ignore the lock file?

clearml

2 years ago

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

How Do I Stop A Zombie Agent?

How do I stop a zombie agent? 🤔 I see it in clearml-agent list , but I do not see it in ps aux and when I try clearml-agent daemon --stop , I get Could not ...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

I'm trying to set up some initial experiments within our stack, but when I use the execute_remotely task, I get this error: clearml_agent: ERROR: Failed gett...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Is There Any Way (Or Are There Any Plans) To Include Some View For Datasets In The Webui? One That Is Detached From The Generating Task?

Is there any way (or are there any plans) to include some view for datasets in the WebUI? One that is detached from the generating task?

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

We'Re Running Into Errors Such As This:

We're running into errors such as this: Action failed <500/0: tasks.add_or_update_artifacts/v2.10 (Update failed (BSONObj size: 18564134 (0x11B4426) is inval...

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Good Week Clearml Team

Good week ClearML team 🙂 I have found two more WebUI bugs (on the self-hosted server): If I look at a specific experiment (say, the Artifacts tab), and then...

clearml

2 years ago

0 Votes

7 Answers

999 Views

0 Votes 7 Answers 999 Views

In The Pipeline Examples, Components Have The Following Note:

In the pipeline examples, components have the following note: > # notice all package imports inside the function will be automatically logged as # required p...

clearml

one year ago

0 Votes

2 Answers

982 Views

0 Votes 2 Answers 982 Views

Can One Delete Individual Tasks From The Webui?

Can one delete individual tasks from the WebUI?

clearml

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Does Clearml Exposes (Even Temporarily) The Contents Of E.G.

Does ClearML exposes (even temporarily) the contents of e.g. sdk.aws.s3 to the relevant environment variables (i.e. AWS_SECRET_ACCESS_KEY )? Specifically, if...

clearml

2 years ago

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

Is There A Way To Interface With Clearml Agent (Cli?) To Handle Model Repositories And Data Versioning (But So, Not Experimentation, Tight Integration, Pipelining, Etc)?

Is there a way to interface with ClearML agent (CLI?) to handle model repositories and data versioning (but so, not experimentation, tight integration, pipel...

mlops

3 years ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

The Clearml Helm Charts Page Is Gone?

The ClearML helm charts page is gone? https://allegroai.github.io/clearml-helm-charts/

clearml

one year ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Follow Up On Execute_Remotely, I See One Can Limit The Available Gpu Resources In A Worker Daemon; Could One Also Limit The Number Of Cpu Cores Available?

Follow up on execute_remotely, I see one can limit the available GPU resources in a worker daemon; could one also limit the number of CPU cores available?

clearml

2 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Confirming About The Documentation For

Confirming about the documentation for clearml.conf : > agent.enable_task_env > ( > bool > ) Set the OS environments based on the Task's Environment section ...

clearml

2 years ago

0 Votes

2 Answers

574 Views

0 Votes 2 Answers 574 Views

Is There A Gcp Driver Similar To

Is there a GCP driver similar to aws_driver.py ? None

clearml

9 months ago

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

Hey! Probably Missed Something, But I Recently Upgraded To 1.1.1, And I'Ve Just Noticed That Png Artifacts Are Not Displayed In The Preview. Is This Intentional? Edit: Ah, It Has To Be A

Hey! Probably missed something, but I recently upgraded to 1.1.1, and I've just noticed that png artifacts are not displayed in the preview. Is this intentio...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Does Clearml Have Any Suggestions On Gpu And Non-Gpu Amis For The Autoscaler? The Two Default Ones In The Documentation Are Either:

Does ClearML have any suggestions on GPU and non-GPU AMIs for the AutoScaler? The two default ones in the documentation are either: Non-existent (too old) - ...

clearml

2 years ago

0 Votes

14 Answers

1K Views

0 Votes 14 Answers 1K Views

Automatic Logging For Generated Matplotlib Works Nicely, But For Some Reason It Names The Graphs As "00 Untitled" And Uses Series "Plot Image" In Clearml Ui -- Any Way To Circumvent This, Other Than Explicit Reporting?

Automatic logging for generated matplotlib works nicely, but for some reason it names the graphs as "00 untitled" and uses series "plot image" in ClearML UI ...

clearml

3 years ago

0 Votes

7 Answers

982 Views

0 Votes 7 Answers 982 Views

Weird Encounter On Macos (Local Execution, Rerunning After It Failed - Worked Normally):

Weird encounter on macOS (local execution, rerunning after it failed - worked normally): Traceback (most recent call last): File "/Users/.../repositories/......

clearml

2 years ago

0 Votes

11 Answers

1K Views

0 Votes 11 Answers 1K Views

Back To Autoscaler; Is There Any Way To Ensure The Environment Variables On The Services Queue (Where The Scaler Runs) Will Be Automatically Exposed To New Ec2 Instance? Some Bash Hack Or Similar Would Be Nice, Really

Back to autoscaler; is there any way to ensure the environment variables on the services queue (where the scaler runs) will be automatically exposed to new E...

clearml

2 years ago

0 Votes

0 Answers

998 Views

0 Votes 0 Answers 998 Views

I Have A Project Filled With Failed Attempts At Onboarding To Clearml By Now

I have a project filled with failed attempts at onboarding to clearml by now 😅

clearml

3 years ago

0 Votes

6 Answers

999 Views

0 Votes 6 Answers 999 Views

Soooo… The Tooltip For Some Buttons In The Webui Hides The Actual Buttons

Soooo… the tooltip for some buttons in the WebUI hides the actual buttons 🤔 Any quick fixes?

clearml

one year ago

0 Votes

11 Answers

1K Views

0 Votes 11 Answers 1K Views

Has Anyone Encountered The Following Webui Issue? When Leaving A Tab Open For A While (On The Experiments List Detailed View), The "Load More" Button Occasionally Just Disappears; Or It Could Be The Other Way Around - The List Is Fully Expanded (So The Lo

Has anyone encountered the following WebUI issue? When leaving a tab open for a while (on the experiments list detailed view), the "Load More" button occasio...

clearml

2 years ago

0 Votes

0 Answers

1K Views

0 Votes 0 Answers 1K Views

If We Use A Poetry Queue And Do Not Specify The

If we use a poetry queue and do not specify the agent.package_manager.poetry_version , will ClearML install/update anything? Asking because there are some is...

clearml

one year ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

The Comparison Page Seems To Resize The Experiments So That All Tags Will Fit In The Screen, But Then The Experiments Are Pretty Much Impossible To Compare

The comparison page seems to resize the experiments so that all tags will fit in the screen, but then the experiments are pretty much impossible to compare

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Is There A Way To Set Precedence On Package Managers? If We Set An Agent To Use

Is there a way to set precedence on package managers? If we set an agent to use poetry it will ignore local packages, even if I use the force_requirements_en...

mlops

2 years ago

0 Votes

4 Answers

981 Views

0 Votes 4 Answers 981 Views

Is There An Easy Way To Apply The Uncommitted Changes, Logged By Clearml, Locally To A Dev Environment?

Is there an easy way to apply the uncommitted changes, logged by ClearML, locally to a dev environment?

clearml

2 years ago

0 Votes

8 Answers

970 Views

0 Votes 8 Answers 970 Views

Is There A Guide On How To Deploy A Services Agent On A K8S Setup? Specifically, With The Cli We Use Flags Such As

Is there a guide on how to deploy a services agent on a k8s setup? Specifically, with the CLI we use flags such as --services-mode --cpu-only , and I can’t s...

mlops

one year ago

0 Votes

2 Answers

992 Views

0 Votes 2 Answers 992 Views

Can We Use S3 Buckets To Cache Environments?

Can we use S3 buckets to cache environments?

clearml

2 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Clearml Version 1.8.1 Had "Fix" For The Deferred Init Which Introduces A Bug Btw, I'Ve Opened

⚠️ ClearML version 1.8.1 had "fix" for the deferred init which introduces a bug btw, I've opened None to highlight this.

clearml

2 years ago

Show more results

0 The Comparison Page Seems To Resize The Experiments So That All Tags Will Fit In The Screen, But Then The Experiments Are Pretty Much Impossible To Compare

I can scroll sideways but if I open any of the comparison items, I pretty much can only see one experiment's values

2 years ago

0 What Happens To File That Are Downloaded To A Remote_Execution Via Storagemanager? Are They Removed At The End Of The Run, Or Does It Continuously Increases Disk Space?

I guess it's mixed. If #340 is resolved, then this initializer task will be a no-op: detach, and init-close new tasks as needed.

2 years ago

0 I Encountered A Weird Edge Case With The Aws Auto-Scaler, Wondering If There Are Any Solutions Or If This Is A Known Issue. Something As Follows Happened:

The instance that took a while to terminate (or has taken a while to disappear from the idle workers)

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

SuccessfulKoala55 could this be related to the monkey patching for logging platform? We have our own logging handlers that we use in this case

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

What do you mean 😄 Using logging.config.dictConfig(...)

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

What's new in 1.1.6rc0?

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

If that's the case, wouldn't it apply across the board? This happens in a single task within ray - the other tasks (I have many in a single run) are fine

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

Or do you mean the contents of the configuration, probably :face_palm: ... one moment

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

I'll try upgrading to 1.1.5, one moment

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

Ah it is.

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

I thought so too - so I added flush calls just in case, but nothing's changed.
This is somewhat weird since it always happens in the above scenario (Ray + ClearML), and always in the last task/job from Ray

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

We just inherit from logging.Handler and use that in our logging.config.dictConfig ; weird thing is that it still logs most of the tasks, just not the last one?

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

I'll try with 1.1.5 first, then 1.1.6rc0

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

Another side effect btw is that some of our log files (we add a file handler to the logger) end up at 0 bytes. This specifically happens with Ray and ClearML and does not reproduce locally

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

1.1.4

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

TimelyPenguin76 I added pip install --update clearml-agent to the extra_vm_bash_script for the autoscaler, that should at least guarantee the latest clearml agent is used on the instance, right?

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

Odd; switching to virtual environment results in
fatal: could not read Username for ' ': terminal prompts disabledeven though it does earlier show that:
agent.git_user = xxx

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

That's enabled; I was aiming if there are flags to add to pip install CLI, such as --no-use-pep517

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

I can indeed.

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

I'm guessing that's not on pypi yet?

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

I'm trying, let's see; our infra person is away on holidays :X Thanks! Uh, which configuration exactly would you like to see? We're running using the helm charts on K8s, so I don't think I have direct access to the agent configuration/update it separately?

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

Nope, no .netrc defined anywhere, really (+I've abandoned the use of docker for the autoscaler as it complicates things, at least for now)

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

Sounds like a nice idea 😁
Follow-up; any ideas how to avoid PEP 517 with the auto scaler? 🤔 Takes a long time to build the wheels

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

That was a good idea, unfortunately did not help too much, but I think I may have a found a work around, thanks!

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

I'm using some old agent I fear, since our infra person decided to use chart 3.3.0 😕
I'll try with the env var too. Do you personally recommend docker over the simple AMI + virtual environment?

More complete log does not add much information -
Cloning into '/root/.clearml/venvs-builds/3.10/task_repository/xxx/xxx'... fatal: could not read Username for ' ': terminal prompts disabled fatal: clone of ' ` ' into submodule path '/root/.clearml/venvs-builds/3.10/task_repository/...

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

Then the username and password would be visible in the autoscaler task 😕
But it should work out of the box, as it does work like that out of the box also regardless of ClearML. The user and personal access token are used as is and it propagates down to submodules, since those are simply another git repository.
I've further checks on a different machine and it works as well 🤔

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

We have a read-only user with personal access token for these things, works seamlessly throughout and in our current on premise servers... So perhaps something missing in the autoscaler definitions?

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

Hurrah! Added
git config --system credential.helper 'store --file /root/.git-credentials' to the extra_vm_bash_script and now it works
(logs the given git credentials in the store file, which can then be used immediately for the recursive calls)

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

Different AMI image/installing older Python instances that don't enforce this...
For future reference though, the environment variable should be PIP_USE_PEP517=false

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

I just set the git credentials in the clearml.conf and it works out of the box

2 years ago

Show more results