UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity 10 months ago

Reputation

Badges 1

662 × Eureka!

Questions 106
Answers 749

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Follow Up On Execute_Remotely, I See One Can Limit The Available Gpu Resources In A Worker Daemon; Could One Also Limit The Number Of Cpu Cores Available?

Follow up on execute_remotely, I see one can limit the available GPU resources in a worker daemon; could one also limit the number of CPU cores available?

clearml

2 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

I Encountered A Weird Edge Case With The Aws Auto-Scaler, Wondering If There Are Any Solutions Or If This Is A Known Issue. Something As Follows Happened:

I encountered a weird edge case with the AWS Auto-scaler, wondering if there are any solutions or if this is a known issue. Something as follows happened: Th...

mlops

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

If, For Starters, I'D Only Like To Use Clearml For Logging Purposes (That Is, Experiments Run Locally, And I Report Metrics, Graphs, Etc To Clearml). Do I Still Need To Set Up The

If, for starters, I'd only like to use ClearML for logging purposes (that is, experiments run locally, and I report metrics, graphs, etc to ClearML). Do I st...

clearml

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Are The Docs Broken?

Are the docs broken? https://clear.ml/docs/latest/docs/release_notes/ver_1_7 gives me this

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Does Clearml Have Any Suggestions On Gpu And Non-Gpu Amis For The Autoscaler? The Two Default Ones In The Documentation Are Either:

Does ClearML have any suggestions on GPU and non-GPU AMIs for the AutoScaler? The two default ones in the documentation are either: Non-existent (too old) - ...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

One more follow-up still; we're trying to run non-GPU scaler, and I've finally sorted out subnet and security groups issues, only to run into this: Executing...

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Does Anyone Know How We Can Restore Clearml On Helms Chart From Existing Snapshots (Aws)?

Does anyone know how we can restore ClearML on Helms chart from existing Snapshots (AWS)?

clearml

one year ago

0 Votes

42 Answers

50K Views

0 Votes 42 Answers 50K Views

How Can I Ensure Tasks In A Pipeline Have The Same Environment As The Pipeline Itself? It Seems A Bit Counter-Intuitive That The Pipeline (Executed Remotely) Captures The Local Environment, But The Tasks (Executed Remotely) Do Not Use That Same Environmen

How can I ensure tasks in a pipeline have the same environment as the pipeline itself? It seems a bit counter-intuitive that the pipeline (executed remotely)...

clearml

one year ago

0 Votes

0 Answers

1K Views

0 Votes 0 Answers 1K Views

I Have A Project Filled With Failed Attempts At Onboarding To Clearml By Now

I have a project filled with failed attempts at onboarding to clearml by now 😅

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

We'Re Moving From On-Premise To Aws - Is There An Easy Way To Migrate Wall The Tasks, Projects, Etc To The New Instance?

We're moving from on-premise to AWS - is there an easy way to migrate wall the tasks, projects, etc to the new instance?

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Is It Expected That K8S Helm Chart Also Starts A Clearml Worker?

Is it expected that K8s helm chart also starts a ClearML worker?

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

If I Create A Dataset With

If I create a dataset with Dataset.create(..., use_current_task=True) , that task holds the dataset. Can I then refer/copy/attach the same dataset to other t...

clearml

2 years ago

0 Votes

23 Answers

1K Views

0 Votes 23 Answers 1K Views

When Using

When using StorageManager.download_folder , I get the following error: Traceback (most recent call last): File "/home/idan/.clearml/venvs-builds/3.7/lib/pyth...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Can we _temporarily_ disable the autologging feature? (specifically for `matplotlib`?)

Can we temporarily disable the autologging feature? (specifically for matplotlib ?)

clearml

one year ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

In The Pipeline Examples, Components Have The Following Note:

In the pipeline examples, components have the following note: > # notice all package imports inside the function will be automatically logged as # required p...

clearml

one year ago

0 Votes

12 Answers

1K Views

0 Votes 12 Answers 1K Views

When Uploading An Artifact, Can I List It In Some Grouping (Like With Parameters, Having E.G.

When uploading an artifact, can I list it in some grouping (like with parameters, having e.g. Args/foo , Args/bar )?

clearml

2 years ago

Show more results

0 If I Create A Dataset With

Unfortunately not, each task defines and constructs its own dataset. I want cloned task to save that link 🤔

2 years ago

0 Getting A Weird Error On Local Setup For Clearml Server:

Seems up and running

2 years ago

0 There Seems To Be An Error If A Project Name Has Spaces (At Least At The Top-Level Name). I Created A Project Called

I created a new task with the project name internal tests , and no task name (so it's derived by ClearML).
The task was a simple print out.
The project does not appear in the project space and does not turn up on searches (the task does)

2 years ago

0 Back To This

Basically when there are occasionally extreme values (i.e. most values fall in [0, 50] range, and one value suddenly falls in 50e+12 range), the plotting library (matplotlib or ClearML, unsure) hangs for a really long time

2 years ago

0 Is There Some Automated Migration For Existing Tasks From Other Mlops Frameworks To Clearml? (Specifically, Interested In Migrating From Mlflow)

I'll have a look, at least it seems to only use from clearml import Task , so unless mlflow changed their SDK, it might still work!

2 years ago

0 The Comparison Page Seems To Resize The Experiments So That All Tags Will Fit In The Screen, But Then The Experiments Are Pretty Much Impossible To Compare

Sure! It looks like this

2 years ago

0 Our Mac Users Are Having Some Issues. They Have Their Respective ~/Clearml.Conf, And Yet They Get: Clearml 1.1.5

My suspicion is that this relates to https://clearml.slack.com/archives/CTK20V944/p1643277475287779 , where the config file is loaded prematurely (upon import ), so our dotenv.load_dotenv() call has not yet registered.

2 years ago

0 We’Re Randomly Getting The Following Message -

Ah. Apparently getting a task ID while it’s running can cause this behaviour 🤔

one year ago

0 What Would Be The Best Way To Approach This Flow?

We have a more complicated case but I'll work around it 😄

Follow up though - can configuration objects refer to one-another internally in ClearML?

2 years ago

0 What Would Be The Best Way To Approach This Flow?

BTW AgitatedDove14 following this discussion I ended up doing the regex way myself to sync these, so our code has something like the following. We abuse the object description here to store the desired file path.

` config_path = task.connect_configuration(configuration=config_path, name=config_fname)
included_files = find_included_files_in_source(config_path)
while included_files:
file_to_include = included_files.pop()
sub_config = task.connect_configuration(
configurat...

2 years ago

0 What Would Be The Best Way To Approach This Flow?

And last but not least, for dictionary for example, it would be really cool if one could do:
my_config = task.connect_configuration(my_config, name=name) my_other_config = task.connect_configuration(my_other_config, name=other_name) my_other_config['bar'] = my_config # Creates the link automatically between the dictionaries

2 years ago

0 What Would Be The Best Way To Approach This Flow?

And task = Task.init(project_name=conf.get("project_name"), ...) is basically a no-op in remote execution so it does not matter if conf is empty, right?

2 years ago

0 Can we _temporarily_ disable the autologging feature? (specifically for `matplotlib`?)

After the task was initialized? 🤔

one year ago

0 Clearml 1.3.2 I'M Running

I'm not sure why internally ClearML tries to initialize a task when get_task is called...

2 years ago

0 What Happens To File That Are Downloaded To A Remote_Execution Via Storagemanager? Are They Removed At The End Of The Run, Or Does It Continuously Increases Disk Space?

It does not 🙂
We started discussing it here - https://clearml.slack.com/archives/CTK20V944/p1640955599257500?thread_ts=1640867211.238900&cid=CTK20V944
You suggested this solution - https://clearml.slack.com/archives/CTK20V944/p1640973263261400?thread_ts=1640867211.238900&cid=CTK20V944
And I eventually found this solution to work - https://clearml.slack.com/archives/CTK20V944/p1641034236266500?thread_ts=1640867211.238900&cid=CTK20V944

2 years ago

0 Can I Shutdown Specific Workers Somehow? Running

Yes

2 years ago

0 Does Clearml Exposes (Even Temporarily) The Contents Of E.G.

Because setting env vars and ensuring they exist on the remote machine during execution etc is more complicated 😁

There are always ways around, I was just wondering what is the expected flow 🙂

2 years ago

0 Hi, I Updated To Clearml-Server 1.4.0 And I Am Uncomfortable With The New Table/Detail View, Is There A Way To Disable It And Use The Previous One (On Click -> Open Details)?

JitteryCoyote63 please do not get used to it :D there's an open ticket/feature request to either revert this or let the user/server choose the most comfortable way

2 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

3.3.0 😅

2 years ago

0 We'Re Trying To Upgrade Our Clearml On K8S But We'Re Getting This Error -

For now this is okay - no data lost, really - but I'd like to make sure we're not missing any steps in the next upgrade

2 years ago

0 Clearml 1.3.2 I'M Running

It's a small snippet that ensures identically named projects are still unique'd with a running number.

2 years ago

0 When Using

21s is just ridiculous, it's scanning the entire file system starting at /

3 years ago

0 When Using

Added the following line under volumes for apiserver , fileserver , agent-services :
- /data/clearml:/data/clearml

3 years ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Does it make sense to you to run several such glue instances, to manage multiple resource requirements?

one year ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Perfect, thanks for the answers Valeriano. These small stuff are missing from the documentation, but I now feel much more confident in setting this up.

one year ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Yes, I’ve found that too (as mentioned, I’m familiar with the repository). My issue is still that there is documentation as to what this actually offers.
Is this simply a helm chart to run an agent on a single pod? Does it scale in any way? Basically - is it a simple agent (similiar to on-premise agents, running in the background, but here on K8s), or is it a more advanced one that offers scaling features? What is it intended for, and how does it work?

The official documentation are very spa...

one year ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

Maybe @<1523701827080556544:profile|JuicyFox94> can answer some questions then…

For example, what’s the difference between agentk8sglue.nodeSelector and agentk8sglue.basePodTemplate.nodeSelector ?
Am I correct in understanding that the former decides the node type that runs the “scaler” (listening to the given agentk8sglue.queue ), and the latter for any new booted instance/pod, that will actually run the agent and the task?
Read: The former can be kept lightweight, as it does no...

one year ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

We’re using karpenter (more magic keywords for me), so my understanding is that that will manage the scaling part.

one year ago

0 Is There An Autoscaler Equivalent For K8S? That Is, A Service That Will Launch Pods Based On Incoming Requests?

But... Which queue does it listen to, and which type of instances will it use etc

one year ago

0 Weird Error; My Local Execution Hung With

Yes

2 years ago

Show more results