AmusedCat74

11 Questions, 53 Answers

Active since 26 January 2023

Last activity one year ago

Reputation

Badges 1

31 × Eureka!

Questions 11
Answers 53

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Gcp Autoscaler Feature Request: Currently, One Has To Provide The Exact Machine Image To Use In The 'Machine Image' Field. However, I'D Like To Use The Latest Image From An Image Family. In

GCP Autoscaler Feature Request: Currently, one has to provide the exact machine image to use in the 'Machine Image' field. However, I'd like to use the lates...

clearml

one year ago

0 Votes

4 Answers

912 Views

0 Votes 4 Answers 912 Views

Hi, I'M Trying To Understand A Bit More About

Hi, I'm trying to understand a bit more about clearml-agent . When a task is run through clearml-agent why is clearml-agent itself installed in the python en...

clearml

one year ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

There Is A

There is a requests vulnerability. Can someone please action None

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Gcp Autoscaler Limits Not Working Correctly?

GCP AutoScaler limits not working correctly? Hi there, I have encountered some unexpected behaviour with the GCP Autoscaler. The AutoScaler does not appear t...

clearml

one year ago

0 Votes

10 Answers

2K Views

0 Votes 10 Answers 2K Views

Hi All, I Am Having An Issue With Clearml Scheduler Where It Doesn'T Reuse The Task As I Would Expected. I Have Raised

Hi all, I am having an issue with ClearML Scheduler where it doesn't reuse the task as I would expected. I have raised this issue . Has anyone else experienc...

clearml

2 years ago

0 Votes

12 Answers

2K Views

0 Votes 12 Answers 2K Views

Hello Everyone, I Am Having Issues With The Gcp Autoscaler. This Is In The Output Logs:

Hello everyone, I am having issues with the GCP Autoscaler. This is in the output logs: 2023-11-17 11:18:19,156 - clearml.Auto-Scaler - ERROR - Found invalid...

clearml

2 years ago

0 Votes

12 Answers

3K Views

0 Votes 12 Answers 3K Views

Been Looking All Over And Can'T Find The Solution To This One. If I Have A User Id Hash (The 32 Alphanumeric), How Do I Find The User'S Name Which Is Associated With This Id Using Clearml Python Sdk? Cheers.

Been looking all over and can't find the solution to this one. If I have a user id hash (the 32 alphanumeric), how do I find the user's name which is associa...

clearml

2 years ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hey All, Does Anyone Know How To Query Hidden Aws Autoscaler Tasks Using The Python Sdk? I'Ve Put Such A Task Below

Hey all, does anyone know how to query hidden aws autoscaler tasks using the python sdk? I've put such a task below Task.query_tasks(task_filter={"system_tag...

aws mlops

2 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi All, I'M Getting Set Up With Gcp Autoscaler And I'M Wondering What Image People Typically Use For Running Docker Jobs. The Image That I Was Using

Hi all, I'm getting set up with GCP Autoscaler and I'm wondering what image people typically use for running docker jobs. The image that I was using projects...

clearml

2 years ago

0 Votes

11 Answers

2K Views

0 Votes 11 Answers 2K Views

Hi All, I Am Seeing This In My Logs:

Hi all, I am seeing this in my logs: 2023-02-08 15:17:25,538 - clearml - WARNING - Failed decoding debug image [553, 509, 3] 2023-02-08 15:17:25,539 - clearm...

clearml

2 years ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hi All, I Have An Ongoing Issue With Queues.

Hi all, I have an ongoing issue with queues. - I send a task to my_queue which does not have any listening agents. This is done using task = Task.init(...) a...

clearml

2 years ago

0 Been Looking All Over And Can'T Find The Solution To This One. If I Have A User Id Hash (The 32 Alphanumeric), How Do I Find The User'S Name Which Is Associated With This Id Using Clearml Python Sdk? Cheers.

Furthermore, when using APIClient() , users is not a valid endpoint at all.

class APIClient(object):

    auth = None  # type: Any
    queues = None  # type: Any
    tasks = None  # type: Any
    workers = None  # type: Any
    events = None  # type: Any
    models = None  # type: Any
    projects = None  # type: Any

This is taken from clearml/backend_api/session/client/client.py

2 years ago

0 Good Day, We Have Been Using Clearml For Project Monitoring And Task Management. Recently, We Decided To Try The Google Cloud Platform Autoscaler To Automate Our Existing Gcp Vm Creation Pipelines, Task Queuing, And Processing. However, We'Ve Encountere

Given that nvidia-smi is working you may have already done that. In this case depending on your ubuntu version you may have another problem. ubuntu 22+ has this issue which has workaround. This also caught me out...

None

6 months ago

0 Hi All, I Am Seeing This In My Logs:

Yep, that seems to be working fine.

2 years ago

0 Hello Everyone, I Am Having Issues With The Gcp Autoscaler. This Is In The Output Logs:

@<1523701087100473344:profile|SuccessfulKoala55> Just following up as I figured out what was happening here and could be useful for the future.

The prefilled value for Number of GPUs in the GCP Autoscaler is 1 .

When one ticks Run in CPU mode (no gpus) it hides the GPU Type and Number of GPUs fields. However, the value which was these fields are still submitted in the API Request (I'm guessing here) when the Autoscaler is launched.

Hence, to get past this, you need to...

one year ago

0 Hi All, I Am Having An Issue With Clearml Scheduler Where It Doesn'T Reuse The Task As I Would Expected. I Have Raised

If a Task is in the 'Completed' I think the only option is to 'Reset' it (see image). You do clear the previous run execution but I think for a repetitive task this is fine.
Maybe this should only be the case if it is in a 'Completed' state rather than 'Failed'. I can see that in this case you would not want to clear the execution because you would want to see why it Failed. Thoughts?

2 years ago

0 Hi All, I Am Having An Issue With Clearml Scheduler Where It Doesn'T Reuse The Task As I Would Expected. I Have Raised

This is not working. Please see None which details the problem

2 years ago

0 Gcp Autoscaler Limits Not Working Correctly?

Let me know if you need additional information.

one year ago

0 Hi All, I Am Seeing This In My Logs:

Thank you 👍

2 years ago

0 Hi All, We Are Running Clearml-Session With A Ubuntu Docker Container, Our Host System Is Ubuntu 22.04. From Time To Time We Have Driver Failures. Just Nvidia Driver Stops To Work Inside Clearml-Session, While At Host Machine Everything Is Fine. May Be So

Hi, we encountered this a while ago. In our case, there is an issue with running docker containers with gpu on ubuntu22.04.

See this issue for more info:

https://github.com/NVIDIA/nvidia-docker/issues/1730

11 months ago

Is there a way I can do this with the python APIClient or even with the requests library?

2 years ago

0 Hi, I Am Trying To Figure Out How To Get Cloud Storage Access Working In The Agent. I Am Running The Agent Locally In Docker Mode. I Set Up Gcp Storage In The Clearml.Json But It Seems Not To Get Passed To The Agent. Also Tried To Add Agent.Extra_Docker_A

@<1673863823901069312:profile|BraveToad81>

one year ago

0 Hi All, I Am Having An Issue With Clearml Scheduler Where It Doesn'T Reuse The Task As I Would Expected. I Have Raised

Yep 👍

2 years ago

0 Gcp Autoscaler Limits Not Working Correctly?

Apologies for the delay.

I have obfuscated the private information with XXX . Let me know if you think any of it is relevant.

{"gcp_project_id":"XXX","gcp_zone":"XXX","subnetwork":"XXX","gcp_credentials":"{\n  \"type\": \"service_account\",\n  \"project_id\": \"XXX\",\n  \"private_key_id\": \"XXX\",\n  \"private_key\": \"XXX\",\n  \"client_id\": \"XXX\",\n  \"auth_uri\": \"XXX\",\n  \"token_uri\": \"XXX\",\n  \"auth_provider_x509_cert_url\": \"XXX\",\n  \"client_x509_cert_url\": \"...

one year ago

Is there documentation for this as I was not able to figure this out unfortunately.

2 years ago

0 Hello Everyone, I Am Having Issues With The Gcp Autoscaler. This Is In The Output Logs:

@<1537605940121964544:profile|EnthusiasticShrimp49> How do I specify to not attach a gpu? I thought ticking 'Run in CPU Mode' would be sufficient. Is there something else I'm missing?

2 years ago

0 Is It Possible To Merge

Cheers 👍

2 years ago

0 Trying To Run Get_Tasks On Aws Lambda And Getting The Following Error. Any Suggestions On How To Overcome This? { "Errormessage": "[Errno 38] Function Not Implemented", "Errortype": "Oserror", "Requestid": "", "Stacktrace": [ " File \"/Var/L

I don't think there's really a way around this because AWS Lambda doesn't allow for multiprocessing.

Instead, I've resorted to using a clearml Scheduler which runs on a t3.micro instance for jobs which I want to run on a cron.

2 years ago

0 Hello Everyone, I Am Having Issues With The Gcp Autoscaler. This Is In The Output Logs:

Thanks Jake. Do you know how I set the GPU count to 0?

2 years ago

Ah ok

2 years ago

0 Hi Today I'M Suddenly Getting This

I am having the same error since yesterday on Ubuntu. Works fine on Mac.

I cannot ping api.clear.ml

2 years ago

0 Hi All, I Am Seeing This In My Logs:

I am using ClearML version 1.9.1. In code, I am creating a plot using matplotlib. I am able to see this in Tensorboard but it is not available in ClearML Plots

2 years ago

0 Hi Today I'M Suddenly Getting This

It started working fine this morning again for me

2 years ago

$ curl -H "Authorization: Bearer <TOKEN>" -X GET



{"meta":{"id":"ed6c52d030f240a89f001b447ee64a6b","trx":"ed6c52d030f240a89f001b447ee64a6b","endpoint":{"name":"debug.ping","requested_version":"2.26","actual_version":"1.0"},"result_code":200,"result_subcode":0,"result_msg":"OK","error_stack":null,"error_data":{},"alarms":{}},"data":{"msg":"Hello World"}}%                                                                                                    

$ curl -H "Authoriz...

2 years ago

Show more results