Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
SarcasticSquirrel56
Moderator
16 Questions, 144 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

137 × Eureka!
0 Votes
13 Answers
2K Views
0 Votes 13 Answers 2K Views
Hi folks, I have a question related to the storage of artifacts, as it is not entirely clear to me where to configure it. If I read the documentation https:/...
3 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
Good morning folks, I am setting up ClearML on a (self-hosted) K8s cluster using the https://github.com/allegroai/clearml-helm-charts/blob/main/charts/clearm...
3 years ago
0 Votes
14 Answers
2K Views
0 Votes 14 Answers 2K Views
Hi folks, I have installed ClearML on kubernets using the helm chart, but I had to specify three different domains for the ui, apiserver and fileserver. Is t...
3 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
Hi folks, good morning 🙂 In our setup we have a set of queues that do not use any GPU resources. Yet, when I run an experiment in such queues, we see a Warn...
3 years ago
0 Votes
15 Answers
2K Views
0 Votes 15 Answers 2K Views
3 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
3 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
3 years ago
0 Votes
17 Answers
2K Views
0 Votes 17 Answers 2K Views
3 years ago
0 Votes
31 Answers
124K Views
0 Votes 31 Answers 124K Views
3 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
I do have one questions about using the helm chart, is there any way to specify the users in the values.yaml?
3 years ago
0 Votes
31 Answers
128K Views
0 Votes 31 Answers 128K Views
3 years ago
0 Votes
31 Answers
109K Views
0 Votes 31 Answers 109K Views
Hi folks, I just deployed a ClearML agent using the Helm chart. I have a few doubts: after the deployment, I see a new queue called k8s_scheduler, which I di...
3 years ago
0 Votes
19 Answers
2K Views
0 Votes 19 Answers 2K Views
Hi folks, one question: I have a script that looks like: import clearml as cml import numpy as np from sklearn.linear_model import LogisticRegression from sk...
3 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
Hi folks, I have a question on something that it's not clear to me reading the documentation at https://clear.ml/docs/latest/docs/clearml_agent/ From what I ...
3 years ago
0 Votes
8 Answers
2K Views
0 Votes 8 Answers 2K Views
3 years ago
0 Votes
31 Answers
121K Views
0 Votes 31 Answers 121K Views
3 years ago
0 Hi Folks, One Question: I Have A Script That Looks Like:

Hi Jake, sorry I left the office yesterday. On my laptop I have clearml==1.6.4

3 years ago
0 Hi Folks, One Question: I Have A Script That Looks Like:

actually there are some network issues right now, I'll share the output as soon as I manage to run it

3 years ago
0 Hi Folks, One Question: I Have A Script That Looks Like:

sure, give me a couple of minutes to make the changes

3 years ago
0 Hi Folks, One Question: I Have A Script That Looks Like:

Thanks Martin.. I'll add this and check whether it fixes the issue, but I don't get quite well this though.. The local code doesn't need to import pandas, because the get method returns a DataFrame object that has a .loc method.
I was expecting the remote experiment to behave similarly, why do I need to import pandas there?

3 years ago
0 Hi Folks, Occasionally When I Clone A Job And Enqueue It, Instead Of Being Processed By The Expected Queue, A New Queue (With Some Id That Looks Like An Hash) Is Created Instead, And The Experiment Hangs In A "Pending" State. When This Happens, If I Abor

If I now reset the experiment, and enqueue the experiment to the gpu queue (but in the experimet, the user-properties configuration for k8s-glue is still set to cpu) the experiment is left in a Pending state... and in the K8sGlue Agent for the gpu queue, I can see a similar error as the one in the cpu agent....

` No tasks in Queues, sleeping for 5.0 seconds
No tasks in queue 75174e0e7ac047f195ab4dce6e9f03f7
No tasks in Queues, sleeping for 5.0 seconds
FATAL ERROR:
Traceback (most recent call...

3 years ago
0 Hi Folks, Good Morning

Hi folks, I think I found the issue, the documentation mention to set NVIDIA_VISIBLE_DEVICES to "", when in reality it should be "none" according to the code:

if Session.get_nvidia_visible_env() == 'none': # NVIDIA_VISIBLE_DEVICES set to none, marks cpu_only flag # active_gpus == False means no GPU reporting self._active_gpus = False

3 years ago
0 Hi Folks, One Question: I Have A Script That Looks Like:

the same that is available in the agent: - clearml==1.6.4

3 years ago
0 Hi Folks I Have A Problem I Can'T Understand. Plots Are Not Shown When Experiments Are Executed From The Ui. For Example, If I Run The Code On My Laptop, And I Go To The Experiment Page I Can See Correctly The Plots: But If I Then Clone The Task, And Ex

OK, it wasn't the clearml.conf settings...

In the deployment I was referring to the fileserver, apiserver, etc. with the internal kubernetes dns names.
I changed them to the one exposed to the users (the same I have in my local clearml.conf) and things work.

But I can't really figure out why that would be the case...

3 years ago
0 Hi Folks, Occasionally When I Clone A Job And Enqueue It, Instead Of Being Processed By The Expected Queue, A New Queue (With Some Id That Looks Like An Hash) Is Created Instead, And The Experiment Hangs In A "Pending" State. When This Happens, If I Abor

and in the logs of the K8s Glue I see an exception occurred:

` No tasks in queue 54d3edb05a89462faaf51e1c878cf2c7
No tasks in Queues, sleeping for 5.0 seconds
No tasks in queue 54d3edb05a89462faaf51e1c878cf2c7
No tasks in Queues, sleeping for 5.0 seconds
FATAL ERROR:
Traceback (most recent call last):
File "/usr/local/lib/python3.6/dist-packages/urllib3/connectionpool.py", line 710, in urlopen
chunked=chunked,
File "/usr/local/lib/python3.6/dist-packages/urllib3/connectionpool.py", l...

3 years ago
0 Good Morning Folks, I Am Setting Up Clearml On A (Self-Hosted) K8S Cluster Using The

Right now I see the default agent that comes with the helm chart...

3 years ago
0 Hi Folks, One Question: I Have A Script That Looks Like:

About .get_local_copy... would that then work in the agent though?
Because I understand that there might not be a local copy in the Agent?

3 years ago
0 Hi Folks, One Question: I Have A Script That Looks Like:

And if instead I want to force "get()" to return me the path (e.g. I want to read the csv with a library that is not pandas) do we have an option for that?

3 years ago
0 Hi Folks, I Have Installed Clearml On Kubernets Using The Helm Chart, But I Had To Specify Three Different Domains For The Ui, Apiserver And Fileserver. Is There Any Way To Let Clearml Know That The Apiserver Is At

OK I could connect with the SDK, so everything is working, I'd just like to get the right hosts shown in the UI when a new token is created

3 years ago
0 Hi Folks, I Did A Deployment Of Clearml Using The K8S Helm Chart, And I Set The Agent Using K8S Glue. I Run A Task Locally, And I Went To The Ui Cloned The Experiment And Scheduled It In The Default Queue. After Doing This, I See That The Experiment Is Q

Hi Martin, I'll try to get the logs on Monday, though the K8s configuration doesn't "scare" me, I can solve that with my colleagues.
But I'll share it if it helps debug the issue

3 years ago
3 years ago
0 Hi Folks I Have A Problem I Can'T Understand. Plots Are Not Shown When Experiments Are Executed From The Ui. For Example, If I Run The Code On My Laptop, And I Go To The Experiment Page I Can See Correctly The Plots: But If I Then Clone The Task, And Ex

And this is the list of variables defined in the K8SGlue pod:
` CLEARML_REDIS_MASTER_PORT_6379_TCP_PROTO
CLEARML_REDIS_MASTER_SERVICE_HOST
CLEARML_REDIS_MASTER_PORT
CLEARML_MONGODB_PORT_27017_TCP
CLEARML_ELASTIC_MASTER_PORT_9300_TCP_PROTO
CLEARML_WEBSERVER_SERVICE_HOST
K8S_GLUE_EXTRA_ARGS
CLEARML_ELASTIC_MASTER_PORT_9300_TCP_PORT
CLEARML_FILESERVER_PORT_8081_TCP_PROTO
HOSTNAME
CLEARML_MONGODB_PORT_27017_TCP_PORT
CLEARML_MONGODB_PORT
CLEARML_ELASTIC_MASTER_SERVICE_PORT
CLEARML_FILESERVER_PORT_...

3 years ago
0 Hi Folks, I Did A Deployment Of Clearml Using The K8S Helm Chart, And I Set The Agent Using K8S Glue. I Run A Task Locally, And I Went To The Ui Cloned The Experiment And Scheduled It In The Default Queue. After Doing This, I See That The Experiment Is Q

By the way, after fixing the agentservice issue, and having the pod configured correctly, now I see an error in the agentgroup-cpu pod, because it says that the token is not the correct one:

http://:8081 http://:8080 `

WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ConnectTimeoutError(<pip._vendor.urllib3.connection.HTTPSConnection object at 0x7fa4b00438d0>, 'Connection to pypi.org timed out. (connect timeout=15)')':...

3 years ago
Show more results compactanswers