UnevenDolphin73

106 Questions, 749 Answers

Active since 10 January 2023

Last activity 9 months ago

Reputation

Badges 1

662 × Eureka!

Answers 749

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

I guess it does not do so for all settings, but only those that come from Session()

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

Right, but that's as defined in the services agent, which is not immediately transparent

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

Let me know if you do; would be nice to have control over that 😁

2 years ago

0 I'M Trying To Set Up Some Initial Experiments Within Our Stack, But When I Use The

The idea is that the features would be copied/accessed by the server, so we can transition slowly and not use the available storage manager for data monitoring

3 years ago

0 Weird Error; My Local Execution Hung With

It misses the repository information of course, but the 'configuration/Args' were logged. So something weird in identifying the repository

2 years ago

0 Can We Use Dynamodb With Clearml Helm Charts Instead Of Mongodb? We'D Like To Move All Stateful Storage To Aws As A Separate Service And That Would Be A Nice Alternative

Thanks SuccessfulKoala55 and AgitatedDove14 ! We'll go through the hoops of setting up mongo on AWS then.
We're working to decouple the data from the helm chart, seems like a dangerous idea to store long term data on k8s in case of failure 😅

2 years ago

0 Is There A Way To Set Precedence On Package Managers? If We Set An Agent To Use

Or some users that update their poetry.lock and some that update manually as they prefer to resolve on their own.

2 years ago

0 Hello All, I'M Trying To Run A Few Pipeline Tasks Remotely Via A Private Docker Image But Struggling To Find Any Documentation On How/Where The

Well you can install the binary in the additional start up commands.
Matter of fact, you can just include the ECR login in the "startup steps" offered by the scaler, so no need for this repository. I was thinking these are local instances.

one year ago

0 Can I Run An Autoscaler Listening To A Regular Queue (I.E. Combine Autoscaling With On Premise Machines)? Or Does It Run The Danger Of Creating An Ec2 Instance While An On-Premise Machine Takes The Job?

Thanks CostlyOstrich36 !
And can I make sure the same budget applies to two different queues?
So that for example, an autoscaler would have a resource budget of 6 instances, and it would listen to aws and default as needed?

2 years ago

0 Does Clearml Somehow

Kinda, yes, and this has changed with 1.8.1.
The thing is that afaik currently ClearML does not officially support a remotely executed task to spawn more tasks, so we also have a small hack that marks the remote "master process" as a local task prior to anything else.

2 years ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Coming back to this; ClearML prints a lot of error messages in local tests, supposedly because the output streams are not directly available:
` --- Logging error ---
Traceback (most recent call last):
File "/usr/lib/python3.10/logging/init.py", line 1103, in emit
stream.write(msg + self.terminator)
ValueError: I/O operation on closed file.
Call stack:
File "/home/idan/CC/git/ds-platform/.venv/lib/python3.10/site-packages/clearml/task.py", line 3504, in _at_exit
self.__shutdown...

2 years ago

0 ... And Yet Another

i.e.
ERROR Fetching experiments failed. Reason: Backend timeout (600s)
ERROR Fetching experiments failed. Reason: Invalid project ID

2 years ago

0 What Would Be The Best Way To Approach This Flow?

I don't think there's a PR issue for that yet, at least I haven't created one.

I could have a look at this and maybe make a PR.
Not sure what would the recommended flow be like though 🤔

2 years ago

0 If, For Starters, I'D Only Like To Use Clearml For Logging Purposes (That Is, Experiments Run Locally, And I Report Metrics, Graphs, Etc To Clearml). Do I Still Need To Set Up The

Thanks, that's what I thought - so I'm missing something else in the installation. I'll dig further 🙂

3 years ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

Another example - trying to validate dataset interactions ends with

` else:
self._created_task = True
dataset_project, parent_project = self._build_hidden_project_name(dataset_project, dataset_name)
task = Task.create(
project_name=dataset_project, task_name=dataset_name, task_type=Task.TaskTypes.data_processing)
if bool(Session.check_min_api_server_version(Dataset.__min_api_version)):
get_or_create_proje...

2 years ago

0 Our Mac Users Are Having Some Issues. They Have Their Respective ~/Clearml.Conf, And Yet They Get: Clearml 1.1.5

I'm not sure; the setup is not unique to Mac.
Each user has their own .env file which is given to the code entry point, and at some point will be loaded with dotenv.load_dotenv() .
The environment variables are not set in code anywhere, but the clearml.conf uses them directly.

2 years ago

0 We'Re Trying To Use The Aws Autoscaler And Have Managed To Get It Up And Running With Spinning Up Instances. However, It Does Not Seem To Pull Any Of The Tasks For The Remote Instances. We See It Gets

No it does not show up. The instance spins up and then does nothing.

2 years ago

Any thoughts SuccessfulKoala55 ?

2 years ago

Maybe it's the missing .bashrc file actually. I'll look into it.

2 years ago

Anything specific we should look into TimelyPenguin76 ?

2 years ago

0 Can We Use S3 Buckets To Cache Environments?

Nice, that works 🙂 Thanks!

2 years ago

We just redeployed to use the 1.1.4 version as Jake suggested, so the logs are gone 😞

2 years ago

... and any way to define the VPC is missing too 🤔

2 years ago

We do not CostlyFox64 , but this is useful for the future 🙂 Thanks!
TimelyPenguin76 I'll have a look, one moment.

2 years ago

0 Since V1.4.0, Our

SweetBadger76 TimelyPenguin76
We're finally tackling this (since it has kept us back at 1.3.2 even though 1.6.2 is out...), and noticed that now the bucket name is also part of the folder?

So following up from David's latest example:
StorageManager.download_folder(remote_url='s3://****-bucket/david/', local_folder='./')Actually creates a new folder ./****-bucket/david/ and puts it contents there.

EDIT: This is with us using internal MinIO, so I believe ClearML parses that end...

2 years ago

0 !! In Remote Execution, Do Agents Inherit The Config From The Queue From Which They Pull The Task?

Sorry, I misspoke, yes of course, the agents config file, not the queues

2 years ago

0 Hi All, I Am Trying To Spin Up Some Aws Autoscaler Instances, But I Seem To Have Some Issues With The Instance Creation:

Seems like you're missing an image definition (AMI or otherwise)

one year ago

0 I Encountered A Weird Edge Case With The Aws Auto-Scaler, Wondering If There Are Any Solutions Or If This Is A Known Issue. Something As Follows Happened:

I cannot, the instance is long gone... But it's not different to any other scaled instances, it seems it just took a while to register in ClearML

2 years ago

0 When Uploading An Artifact, Can I List It In Some Grouping (Like With Parameters, Having E.G.

Hey FrothyDog40 ! Thanks for clarifying - guess we'll have to wait for that as a feature 😁

Should I create a new issue or just add to this one? https://github.com/allegroai/clearml/issues/529

2 years ago

Show more results