UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity 9 months ago

Reputation

Badges 1

662 × Eureka!

Questions 106
Answers 749

0 Votes

18 Answers

1K Views

0 Votes 18 Answers 1K Views

!! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

!! In remote execution, do agents inherit the config from the queue from which they pull the task?

clearml

2 years ago

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

How Do I Stop A Zombie Agent?

How do I stop a zombie agent? 🤔 I see it in clearml-agent list , but I do not see it in ps aux and when I try clearml-agent daemon --stop , I get Could not ...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Is there any testing suite that ships with ClearML? If we'd like to make some unit tests for our code?

clearml

2 years ago

0 Votes

3 Answers

983 Views

0 Votes 3 Answers 983 Views

+ Side Question - Any Plans To Include Native Support For

side question - any plans to include native support for lgbm ?

clearml

3 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Can I Shutdown Specific Workers Somehow? Running

Can I shutdown specific workers somehow? Running clearml-agent .... --stop just iterates over all the local workers and shuts them down one by one

clearml

2 years ago

0 Votes

16 Answers

1K Views

0 Votes 16 Answers 1K Views

How Would I Go About Adding Multiple Credentials In The Autoscaler? (I.E. Specify Multiple

How would I go about adding multiple credentials in the autoscaler? (i.e. specify multiple sdk.aws.s3.credentials ) It's a bit odd that one would also have t...

mlops

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

For Some Reason The Agent Is Now Trying To Use Python 2.7 All Of A Sudden, Any Idea Why?

For some reason the agent is now trying to use Python 2.7 all of a sudden, any idea why? 🤔 I see the following in the task's console output: New python exec...

mlops

2 years ago

0 Votes

6 Answers

948 Views

0 Votes 6 Answers 948 Views

Is There Some Automated Migration For Existing Tasks From Other Mlops Frameworks To Clearml? (Specifically, Interested In Migrating From Mlflow)

Is there some automated migration for existing tasks from other MLOps frameworks to ClearML? (Specifically, interested in migrating from mlflow)

clearml

2 years ago

0 Votes

11 Answers

1K Views

0 Votes 11 Answers 1K Views

Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

Pipelines suddenly no longer appear in the Pipelines tab, what could/should I look into? EDIT: Using latest ClearML (agent, server, SDK), and creating the pi...

clearml

one year ago

0 Votes

13 Answers

1K Views

0 Votes 13 Answers 1K Views

We Use Environment Variables In Our

We use environment variables in our clearml.conf , but those might only be available later down the line (after dotenv.load_dotenv is called). When those env...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

I'm trying to set up some initial experiments within our stack, but when I use the execute_remotely task, I get this error: clearml_agent: ERROR: Failed gett...

clearml

3 years ago

0 Votes

0 Answers

1K Views

0 Votes 0 Answers 1K Views

Any Easy Way To Ensure The Pipline (Tasks + Controller) Are Not Hidden In The Projects Ui? Edit: Apparently Not; For The Time Being I Created This

Any easy way to ensure the pipline (tasks + controller) are not hidden in the projects UI? EDIT: Apparently not; for the time being I created this https://gi...

clearml

one year ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

What Happens To File That Are Downloaded To A Remote_Execution Via Storagemanager? Are They Removed At The End Of The Run, Or Does It Continuously Increases Disk Space?

What happens to file that are downloaded to a remote_execution via StorageManager? Are they removed at the end of the run, or does it continuously increases ...

clearml

2 years ago

0 Votes

18 Answers

1K Views

0 Votes 18 Answers 1K Views

What Would Be The Best Way To Approach This Flow?

What would be the best way to approach this flow? We have a configuration file that defines e.g. the project name to use in ClearML, alongside other experime...

clearml

2 years ago

0 Votes

18 Answers

1K Views

0 Votes 18 Answers 1K Views

Since Clearml 1.6.3, A Dataset Attached To A Task Now Renames That Task By Adding A

Since ClearML 1.6.3, a dataset attached to a Task now renames that task by adding a .datasets in the path, making it hidden. Can we control/cancel this? We c...

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Is It Possible To Avoid The Clearml-Agent For Local Installations, And Have The File Server Automatically Use An S3 Bucket? I'Ve Found

Is it possible to avoid the clearml-agent for local installations, and have the file server automatically use an S3 bucket? I've found https://clearml.slack....

clearml

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Is There A Way To Force Clearml To First Parse The Config File, Before Running

Is there a way to force ClearML to first parse the config file, before running Task.init ? We’re relying in some code that the credentials in clearml.conf ar...

clearml

one year ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

How/Where Can I Change The Session Timeout Time For The Webui? It'S Timing Out Way Too Fast After The Latest Updates

How/where can I change the session timeout time for the WebUI? It's timing out WAY too fast after the latest updates

clearml

2 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

When I Upgrade The Server, Do I Need To Do Anything With The Agents/Queues (Redefine Them, Etc)?

When I upgrade the server, do I need to do anything with the agents/queues (redefine them, etc)?

clearml

2 years ago

0 Votes

11 Answers

1K Views

0 Votes 11 Answers 1K Views

Clearml 1.3.2 I'M Running

ClearML 1.3.2 I'm running Task.get_tasks(project_name=...) and suddenly I get 2022-04-07 10:28:43,617 - clearml.Task - ERROR - Failed reloading task c2d514f3...

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Can We Report A Pandas Table With Styling To Be Retained In The Webui? It Would Be Nice To Report E.G.

Can we report a pandas table with styling to be retained in the WebUI? It would be nice to report e.g. df.style.background_gradient(cmap='coolwarm', axis=Non...

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Can We Use Dynamodb With Clearml Helm Charts Instead Of Mongodb? We'D Like To Move All Stateful Storage To Aws As A Separate Service And That Would Be A Nice Alternative

Can we use dynamodb with ClearML helm charts instead of mongodb? We'd like to move all stateful storage to AWS as a separate service and that would be a nice...

clearml

2 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Sometimes, On Shorter Tasks, My Task Would Hang For The 5 Minute Timeout With E.G.

Sometimes, on shorter tasks, my Task would hang for the 5 minute timeout with e.g. 2022-01-04 08:45:25,181 - clearml.Task - INFO - Waiting for repository det...

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

We'Re Running Into Errors Such As This:

We're running into errors such as this: Action failed <500/0: tasks.add_or_update_artifacts/v2.10 (Update failed (BSONObj size: 18564134 (0x11B4426) is inval...

clearml

2 years ago

0 Votes

7 Answers

949 Views

0 Votes 7 Answers 949 Views

Is There A Way To Generate Usage Stats And Reports For Queues? For Example, How Often Is A Queue Used, How Much Cpu Does

Is there a way to generate usage stats and reports for queues? For example, how often is a queue used, how much CPU does the queue consume when it's used, et...

clearml

2 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Follow Up On Execute_Remotely, I See One Can Limit The Available Gpu Resources In A Worker Daemon; Could One Also Limit The Number Of Cpu Cores Available?

Follow up on execute_remotely, I see one can limit the available GPU resources in a worker daemon; could one also limit the number of CPU cores available?

clearml

2 years ago

0 Votes

7 Answers

996 Views

0 Votes 7 Answers 996 Views

In The Pipeline Examples, Components Have The Following Note:

In the pipeline examples, components have the following note: > # notice all package imports inside the function will be automatically logged as # required p...

clearml

one year ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

If I Create A Dataset With

If I create a dataset with Dataset.create(..., use_current_task=True) , that task holds the dataset. Can I then refer/copy/attach the same dataset to other t...

clearml

2 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Can I Run An Autoscaler Listening To A Regular Queue (I.E. Combine Autoscaling With On Premise Machines)? Or Does It Run The Danger Of Creating An Ec2 Instance While An On-Premise Machine Takes The Job?

Can I run an autoscaler listening to a regular queue (i.e. combine autoscaling with on premise machines)? Or does it run the danger of creating an EC2 instan...

mlops

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

How Does Clearml Store Some Of The More Intrinsic Data (Scalars, Plots, Etc)? We Recently Recovered Mongo From A Backup And While The Projects And Tasks Are There, The Console, Info, Etc Of All Of Them Is Missing

How does ClearML store some of the more intrinsic data (scalars, plots, etc)? We recently recovered Mongo from a backup and while the projects and tasks are ...

clearml

one year ago

Show more results

0 How Can I Ensure Tasks In A Pipeline Have The Same Environment As The Pipeline Itself? It Seems A Bit Counter-Intuitive That The Pipeline (Executed Remotely) Captures The Local Environment, But The Tasks (Executed Remotely) Do Not Use That Same Environmen

Yes. Though again, just highlighting the naming of foo-mod is arbitrary. The actual module simply has a folder structured with an implicit namespace:

foo/
  mod/
    __init__.py
    # stuff

FWIW, for the time being I’m just setting the packages to all the packages the pipeline tasks sees with:

    packages = get_installed_pkgs_detail()
    packages = [f"{name}=={version}" if version else name for name, version in packages.values()]
    packages = task.data.script.require...

one year ago

There's no decorator, just e.g.

def helper(foo: Optional[Any] = None):
    return foo

def step_one(...):
    # stuff

Then the type hints are not removed from helper and the code immediately crashes when being run

one year ago

There's code that strips the type hints from the component function, just think it should be applied to the helper functions too :)

one year ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

We just inherit from logging.Handler and use that in our logging.config.dictConfig ; weird thing is that it still logs most of the tasks, just not the last one?

2 years ago

0 We'Re Using Ray And Clearml Together, And Suddenly We'Re Seeing Some Hanging Threads, And Finally We Got An Error Message:

I'll try with 1.1.5 first, then 1.1.6rc0

2 years ago

0 If I Clone A Task, I Suppose All Artifacts Are Not Cloned With It, Even If They Are Registered, Right?

e.g. a separate structured user guide with common tips, usability, best practices - https://pandas.pydata.org/pandas-docs/stable/user_guide/index.html
vs the doc, where each function is its own page, e.g.
https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.read_csv.html

2 years ago

0 I'D Like The Console In A Clearml Run To Show Only The Stdout/Stderr As It Does Now, But I'D Also Like Clearml To Capture Debug Level Logs. Is There An Easy Around This? It Would Be Nice If One Could E.G. Set

I... did not, ashamed to admit. The documentation says only boolean values.

2 years ago

0 What Would Be The Best Way To Approach This Flow?

And last but not least, for dictionary for example, it would be really cool if one could do:
my_config = task.connect_configuration(my_config, name=name) my_other_config = task.connect_configuration(my_other_config, name=other_name) my_other_config['bar'] = my_config # Creates the link automatically between the dictionaries

2 years ago

0 Can One Compare Experiments/Tasks From Different Projects? Edit: I Mean, I Can Manually Navigate To Some

I can navigate through the projects, but selecting one task in one project, then navigating to another project and selecting a different task -> there is no suggestion to compare the tasks.

In the projects page if I show all - I just see the projects. If I search for a task of similar name, I get results, but I can't compare them via the UI.

The only way I managed so far was to create a pseudo-comparison between unrelated tasks in the same project, then remove one task from comparion, and u...

2 years ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Another example - trying to validate dataset interactions ends with

` else:
self._created_task = True
dataset_project, parent_project = self._build_hidden_project_name(dataset_project, dataset_name)
task = Task.create(
project_name=dataset_project, task_name=dataset_name, task_type=Task.TaskTypes.data_processing)
if bool(Session.check_min_api_server_version(Dataset.__min_api_version)):
get_or_create_proje...

2 years ago

0 I Guess One Experiment Is Running Backwards In Time

I have seen this quite frequently as well tbh!

2 years ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

I'll try it out, but I would not like to rewrite that code myself maintain it, that's my point 😅

Or are you suggesting I Task.import_offline_session ?

2 years ago

0 Is There A Way To Generate Usage Stats And Reports For Queues? For Example, How Often Is A Queue Used, How Much Cpu Does

CostlyOstrich36 That looks promising, but I don't see any documentation on the returned schema (i.e. workers.worker_stats is not specified anywhere?)

2 years ago

We have an internal mono-repo and some of the packages are required - they’re all available correctly for the controller, only some are required for the individual tasks, but the “magic” doesn’t happen 😞
That is, the controller does not identify them as a requirement, so they’re not installed in the tasks environment.

one year ago

It’s just that for the packages argument, ClearML says:

If not provided, packages are automatically added based on the imports used inside the wrapped function.

So… 🤔

one year ago

0 Can I Somehow Change A Project Settings Via The Python Sdk?

I can also do this via Mongo directly, but I was hoping to skip the K8S interaction there.

one year ago

0 For Some Reason The Agent Is Now Trying To Use Python 2.7 All Of A Sudden, Any Idea Why?

Any follow up thoughts SuccessfulKoala55 or CostlyOstrich36 ?

2 years ago

0 Clearml Version 1.8.1 Had "Fix" For The Deferred Init Which Introduces A Bug Btw, I'Ve Opened

The deferred_init input argument to Task.init is bool by default, so checking type(deferred_init) == int makes no sense to begin with, and is altering the flow.

2 years ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Last but not least - can I cancel the offline zip creation if I'm not interested in it 🤔
EDIT: I see not, guess one has to patch ZipFile ...

2 years ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

FWIW running clearml ==1.9.1 with WebApp: 1.9.2-317 • Server: 1.9.2-317 • API: 2.23

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

Happens with the latest version indeed.
I can’t share our code, but the gist of it is:

pipe = PipelineController(name=..., project=..., version=...)

pipe.add_function_step(...)  # Many calls

pipe.set_default_execution_queue(...)
pipe.start(queue=..., wait=True)

one year ago

0 Pipelines Suddenly No Longer Appear In The Pipelines Tab, What Could/Should I Look Into? Edit: Using Latest Clearml (Agent, Server, Sdk), And Creating The Pipelines Via The Pipelinecontroller Sdk (

So the pipeline runs successfully, I can find all the different tasks, but I cannot see them in the Pipelines tab…

one year ago

0 Can We Use Dynamodb With Clearml Helm Charts Instead Of Mongodb? We'D Like To Move All Stateful Storage To Aws As A Separate Service And That Would Be A Nice Alternative

Thanks SuccessfulKoala55 and AgitatedDove14 ! We'll go through the hoops of setting up mongo on AWS then.
We're working to decouple the data from the helm chart, seems like a dangerous idea to store long term data on k8s in case of failure 😅

2 years ago

0 Bug Report? We Noticed That The Aws Autoscaler Will Lose Track Of Instances Crashing Due To No Space Left On Device, And The Ec2 Instance Will Remain Running Indefinitely.

We're using self hosted account

one year ago

0 How Would I Go About Adding Multiple Credentials In The Autoscaler? (I.E. Specify Multiple

I am indeed

2 years ago

0 We Have Configured The Aws Credentials In The Remote Worker'S

nevermind! Found and answered (solution in the issue linked above)

2 years ago

0 For Some Reason The Agent Is Now Trying To Use Python 2.7 All Of A Sudden, Any Idea Why?

Yes; I tried running it both outside venv and inside a venv. No idea why it uses 2.7?

2 years ago

I’ve tracked it down further, it seems the pigar utility does not apply any smart logic there.
The case we have is the following -

We have a monorepo, but all modules/libs share a common namespace foo ; so e.g. working on module mod , we use from foo.mod import …
This then looks for a module called foo , even though it’s just a namespace
In the dist-info requirement, it seems any hyphen, dot, etc are swapped for an underscore, so our site-packages represents this as `foo_m...

one year ago

0 Hi All, I'M Looking For A Hint On How To Configure A Local Network Drive (Nas) As A File Server For Clearml, That Will Be Cross Platform (We Have Windows, Linux And Macos Users...) The Easy Part Is To Modify The Conf File

minio was a tiny bit of headache to configure, but I'd be happy to help if you want CrookedWalrus33 , I just went through this process yesterday and today (see a few threads up...)

2 years ago

0 Is There A Way To Interface With Clearml Agent (Cli?) To Handle Model Repositories And Data Versioning (But So, Not Experimentation, Tight Integration, Pipelining, Etc)?

If everything is managed with a git repo, does this also mean PRs will have a messy metadata file attached to them?

3 years ago

Show more results