Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
AmusedCat74
Moderator
11 Questions, 50 Answers
  Active since 26 January 2023
  Last activity one month ago

Reputation

0

Badges 1

31 × Eureka!
0 Votes
12 Answers
901 Views
0 Votes 12 Answers 901 Views
Hello everyone, I am having issues with the GCP Autoscaler. This is in the output logs: 2023-11-17 11:18:19,156 - clearml.Auto-Scaler - ERROR - Found invalid...
one year ago
0 Votes
1 Answers
970 Views
0 Votes 1 Answers 970 Views
Hi all, I have an ongoing issue with queues. - I send a task to my_queue which does not have any listening agents. This is done using task = Task.init(...) a...
one year ago
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
Hey all, does anyone know how to query hidden aws autoscaler tasks using the python sdk? I've put such a task below Task.query_tasks(task_filter={"system_tag...
one year ago
0 Votes
2 Answers
942 Views
0 Votes 2 Answers 942 Views
There is a requests vulnerability. Can someone please action None
one year ago
0 Votes
4 Answers
152 Views
0 Votes 4 Answers 152 Views
Hi, I'm trying to understand a bit more about clearml-agent . When a task is run through clearml-agent why is clearml-agent itself installed in the python en...
2 months ago
0 Votes
11 Answers
1K Views
0 Votes 11 Answers 1K Views
Hi all, I am seeing this in my logs: 2023-02-08 15:17:25,538 - clearml - WARNING - Failed decoding debug image [553, 509, 3] 2023-02-08 15:17:25,539 - clearm...
one year ago
0 Votes
1 Answers
190 Views
0 Votes 1 Answers 190 Views
GCP Autoscaler Feature Request: Currently, one has to provide the exact machine image to use in the 'Machine Image' field. However, I'd like to use the lates...
2 months ago
0 Votes
12 Answers
1K Views
0 Votes 12 Answers 1K Views
Been looking all over and can't find the solution to this one. If I have a user id hash (the 32 alphanumeric), how do I find the user's name which is associa...
one year ago
0 Votes
10 Answers
1K Views
0 Votes 10 Answers 1K Views
Hi all, I am having an issue with ClearML Scheduler where it doesn't reuse the task as I would expected. I have raised this issue . Has anyone else experienc...
one year ago
0 Votes
3 Answers
861 Views
0 Votes 3 Answers 861 Views
Hi all, I'm getting set up with GCP Autoscaler and I'm wondering what image people typically use for running docker jobs. The image that I was using projects...
one year ago
0 Votes
7 Answers
698 Views
0 Votes 7 Answers 698 Views
GCP AutoScaler limits not working correctly? Hi there, I have encountered some unexpected behaviour with the GCP Autoscaler. The AutoScaler does not appear t...
one year ago
0 Hi Today I'M Suddenly Getting This

I am having the same error since yesterday on Ubuntu. Works fine on Mac.

I cannot ping api.clear.ml

one year ago
0 Hi Today I'M Suddenly Getting This

It started working fine this morning again for me

one year ago
0 There Is A

Thank you!

one year ago
0 Is It Possible To Merge

I ran again without the debug mode option and got this error:

> 
> Starting Task Execution:
> 
> 
> Traceback (most recent call last):
>   File "/root/.clearml/venvs-builds/3.6/code/interactive_session.py", line 377, in <module>
>     from tcp_proxy import TcpProxy
> ModuleNotFoundError: No module named 'tcp_proxy'
> 
> Process failed, exit code 1
one year ago
0 Is It Possible To Merge

I did not run clearml-session from within the git repo.

one year ago
0 Gcp Autoscaler Limits Not Working Correctly?

Apologies for the delay.

I have obfuscated the private information with XXX . Let me know if you think any of it is relevant.

{"gcp_project_id":"XXX","gcp_zone":"XXX","subnetwork":"XXX","gcp_credentials":"{\n  \"type\": \"service_account\",\n  \"project_id\": \"XXX\",\n  \"private_key_id\": \"XXX\",\n  \"private_key\": \"XXX\",\n  \"client_id\": \"XXX\",\n  \"auth_uri\": \"XXX\",\n  \"token_uri\": \"XXX\",\n  \"auth_provider_x509_cert_url\": \"XXX\",\n  \"client_x509_cert_url\": \"...
one year ago
0 Gcp Autoscaler Limits Not Working Correctly?

Let me know if you need additional information.
image

one year ago
0 Hello Everyone, I Am Having Issues With The Gcp Autoscaler. This Is In The Output Logs:

This is something you can do in the GCP console, one would imagine it can be done using their python library.

I think the limitation is that you can only pass a relative subnet path in the GCP Autoscaler console. Then, by the looks of the error message, the ClearML Autoscaler constructs the full path under the hood /project/<project_id>/subnet/<subnet_id> .

I'd like the option to specify the full path myself in the Autoscaler which would then allow me to use a shared subnet.

one year ago
0 Hello Everyone, I Am Having Issues With The Gcp Autoscaler. This Is In The Output Logs:

@<1537605940121964544:profile|EnthusiasticShrimp49> How do I specify to not attach a gpu? I thought ticking 'Run in CPU Mode' would be sufficient. Is there something else I'm missing?

one year ago
0 Hello Everyone, I Am Having Issues With The Gcp Autoscaler. This Is In The Output Logs:

@<1523701087100473344:profile|SuccessfulKoala55> Just following up as I figured out what was happening here and could be useful for the future.

The prefilled value for Number of GPUs in the GCP Autoscaler is 1 .

When one ticks Run in CPU mode (no gpus) it hides the GPU Type and Number of GPUs fields. However, the value which was these fields are still submitted in the API Request (I'm guessing here) when the Autoscaler is launched.

Hence, to get past this, you need to...

one year ago
0 Hi, I'M Trying To Understand A Bit More About

Thanks. I am trying to completely minimise the start up time. Given I am using a docker image which has clearml-agent and pip installed, is there a way I can skip the installation of this when a task starts up using the daemon?

2 months ago
0 Hi, I'M Trying To Understand A Bit More About

@<1523701087100473344:profile|SuccessfulKoala55> Thanks for getting back to me. My image contains clearml-agent==1.9.1 . There is a recent release to 1.9.2 and now on every run the agent installs this newer version thanks to the -U flag which is being passed. From the docs it looks like there may be a way to prevent this upgrade but it's not clear to me exactly how to do this. Is it possible?

one month ago
0 Is It Possible To Merge

No particular reason. This was our first time trying it and it seemed the quickest way to get off the ground. When I try without I have a similar error trying to connect although that could be due to the instance.

one year ago
0 Hello Everyone, I Am Having Issues With The Gcp Autoscaler. This Is In The Output Logs:

Thanks Jake. Do you know how I set the GPU count to 0?

one year ago
0 Hi All, I'M Getting Set Up With Gcp Autoscaler And I'M Wondering What Image People Typically Use For Running Docker Jobs. The Image That I Was Using

@<1523701070390366208:profile|CostlyOstrich36> Thank you. Which docker image do you use with this machine image?

one year ago
0 Hi, Is Anyone Can Access

I'm also having issues.

one year ago
0 Hi All, I Have An Ongoing Issue With Queues.

Hi,

I've managed to fix it.

Basically, I had a tracker running on our queues to ensure that none of them were lagging. This was using get_next_task from APIClient().queues .

If you call get_next_task it removes the task from the queue but does not put it into another state. I think because typically get_next_task is immediately followed by something to make the task run in the daemon or delete it.

Hence you end up in this weird state were the task thinks its queued bec...

one year ago
0 Hello Everyone, I Am Having Issues With The Gcp Autoscaler. This Is In The Output Logs:

👍 Thanks for getting back to me.

Another issue I found was that I could only use vpc subnets from the google project I am launching the VMs in.

I cannot use shared vpc subnets from another project. This would be a useful feature to implement as GCP recommends segmenting the cloud estate so that the vpc and VMs are in different projects.

one year ago
0 Hi All, I Am Seeing This In My Logs:

Yep, that seems to be working fine.

one year ago
0 Hi All, I Am Seeing This In My Logs:

Could you give me some insight into why this is please?

one year ago
one year ago
0 Hi All, I Am Seeing This In My Logs:

Further to this, I have inspected further. This is working as expected for ClearML 1.8.3 but not for ClearML 1.9.0.

I looked at the commits and found that a change had been made to the _decode_image method:

None

This aligns with the error message I'm seeing:

2023-02-08 15:17:25,539 - clearml - WARNING - Error: I/O operation on closed file.

Can this be actioned for the next release plea...

one year ago
0 Hi All, I Am Seeing This In My Logs:

The code is quite nested by I've tried to extract out the important parts ( summmary_writer is a tensorboard logger).

self.figure, (ax1, ax2, axc) = plt.subplots(1, 3, figsize=(total_width, total_height), facecolor="white")

self.summary_writer = self.tb_logger.experiment

self.summary_writer.add_figure(Partition.TRAINING.value, train_plot.figure, global_step=self.current_epoch + 1) 

The train_plot.figure is a matplotlib figure created using seaborn.

Let me know if this...

one year ago
0 Hi All, I Am Seeing This In My Logs:

I am using ClearML version 1.9.1. In code, I am creating a plot using matplotlib. I am able to see this in Tensorboard but it is not available in ClearML Plots

one year ago
0 Been Looking All Over And Can'T Find The Solution To This One. If I Have A User Id Hash (The 32 Alphanumeric), How Do I Find The User'S Name Which Is Associated With This Id Using Clearml Python Sdk? Cheers.

Furthermore, when using APIClient() , users is not a valid endpoint at all.

class APIClient(object):

    auth = None  # type: Any
    queues = None  # type: Any
    tasks = None  # type: Any
    workers = None  # type: Any
    events = None  # type: Any
    models = None  # type: Any
    projects = None  # type: Any

This is taken from clearml/backend_api/session/client/client.py

one year ago
Show more results compactanswers