Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
SubstantialElk6
Moderator
115 Questions, 310 Answers
  Active since 10 January 2023
  Last activity 11 months ago

Reputation

0

Badges 1

282 × Eureka!
0 Votes
2 Answers
557 Views
0 Votes 2 Answers 557 Views
Hi, we have been using ClearML in our development environment to train our models and benchmarking them. I was wondering what is ClearML's role in transition...
3 years ago
0 Votes
2 Answers
695 Views
0 Votes 2 Answers 695 Views
one year ago
0 Votes
5 Answers
622 Views
0 Votes 5 Answers 622 Views
Hi, i have a question about clearml-data. Clearml-Data probably does well on Data Versioning, but when it comes to actual loading of data, are there examples...
2 years ago
0 Votes
7 Answers
581 Views
0 Votes 7 Answers 581 Views
Hi, i would like to understand more on how ClearML deal with codes. I noticed that i am able to read the source codes of the python script that i have used a...
3 years ago
0 Votes
20 Answers
625 Views
0 Votes 20 Answers 625 Views
3 years ago
0 Votes
3 Answers
618 Views
0 Votes 3 Answers 618 Views
Hi, I've multiple tasks setup in a complex pipeline. How can I; Define prior to running the pipeline, which tasks to be running on which remote queue using w...
2 years ago
0 Votes
30 Answers
601 Views
0 Votes 30 Answers 601 Views
Hi, i have been getting the following for a while. Is there a more detailed log i can look into? This happens on both https and http. 2021-05-27 08:47:02,539...
2 years ago
0 Votes
22 Answers
655 Views
0 Votes 22 Answers 655 Views
Hi, ClearML console leaks credentials passed in as Env Vars. The issue remains with clearml version==1.1.1.135 - 1.1.1 - 2.1.4 (As listed on the profile page...
2 years ago
0 Votes
5 Answers
675 Views
0 Votes 5 Answers 675 Views
Hi, i had a task successfully completed. Then i cloned it and enqueued it again without any changes. But the task ends up with an error. Here's the logs, not...
2 years ago
0 Votes
0 Answers
138 Views
0 Votes 0 Answers 138 Views
Hi, is there a way to export ClearML experiments into a file package and import them on another ClearML instance?
2 months ago
0 Votes
1 Answers
557 Views
0 Votes 1 Answers 557 Views
Hi, in your latest changelog. There's a new function. Task.launch_multi_node() for distributed experiment execution In the context of using with K8S glue, wi...
11 months ago
0 Votes
7 Answers
738 Views
0 Votes 7 Answers 738 Views
Hi, i was using the K8S Glue and it worked fine on one project but didn't work on another. At the point just before a git clone was executed, i get the error...
3 years ago
0 Votes
3 Answers
790 Views
0 Votes 3 Answers 790 Views
Hi, i was adding data using clearml-data and get the following consistent errors. Retrying (Retry(total=237, connect=237, read=240, redirect=240, status=240)...
one year ago
0 Votes
4 Answers
634 Views
0 Votes 4 Answers 634 Views
Hi recently upgraded all the clearml, clearml-server, clearml-agent. Now running k8s glue with clearml-agent=1.0.1rc1. python3 k8s_glue_example.py --queue 1b...
2 years ago
0 Votes
8 Answers
584 Views
0 Votes 8 Answers 584 Views
I just getting this in my agent run task. Would appreciate if someone can advise where i externalrequirement is pointing at. RequirementsManager handler rais...
2 years ago
0 Votes
5 Answers
718 Views
0 Votes 5 Answers 718 Views
2 years ago
0 Votes
4 Answers
631 Views
0 Votes 4 Answers 631 Views
Hi, i noticed that all other users can see my experiments. Does ClearML has the feasibility to only allow certain groups of people to see each other's work?
3 years ago
0 Votes
1 Answers
596 Views
0 Votes 1 Answers 596 Views
Hi, what would happen if you have different clearml-agents of different versions running? Would it have any adverse effects?
2 years ago
0 Votes
4 Answers
626 Views
0 Votes 4 Answers 626 Views
Hi, i'm working on a post deployment data and model monitoring using ClearML. The idea is this. Use ClearML to serve my model out to Triton. Data MonitoringC...
2 years ago
0 Votes
1 Answers
610 Views
0 Votes 1 Answers 610 Views
Hi, i was running my agent and had a few scripts for agent.extra_docker_shell_script. but when I looked through the logs, they were not executed. Any idea wh...
2 years ago
0 Votes
22 Answers
693 Views
0 Votes 22 Answers 693 Views
3 years ago
0 Votes
4 Answers
659 Views
0 Votes 4 Answers 659 Views
Hi, i am trying to understand clearml-data and only found this piece of article explaining it. https://github.com/allegroai/clearml/blob/master/docs/datasets...
3 years ago
0 Votes
4 Answers
696 Views
0 Votes 4 Answers 696 Views
3 years ago
0 Votes
11 Answers
615 Views
0 Votes 11 Answers 615 Views
Hi, i shifted my clearml setup to an on-premise disconnected env, which has a pip repo setup. I noted this warning, Trying pip install: /root/.clearml/venvs-...
3 years ago
0 Votes
1 Answers
572 Views
0 Votes 1 Answers 572 Views
Hi, i;m running ClearML jobs using K8SGlue. When the job is running, the scalar for monitor:machine seems to be reporting Node statistics instead of the Pod ...
one year ago
0 Votes
3 Answers
562 Views
0 Votes 3 Answers 562 Views
Hi, can i get ClearML to not print anything other than the prints from my codes? The reason is because clearml is printing the username and passwords i passe...
2 years ago
0 Votes
1 Answers
599 Views
0 Votes 1 Answers 599 Views
Hi, is there a pdf version of your documentation at https://clear.ml/docs/latest/docs ? We work mostly in an offline environment and would benefit from this ...
2 years ago
0 Votes
29 Answers
718 Views
0 Votes 29 Answers 718 Views
Hi, I started my agent using. clearml-agent daemon --gpus 0 --queue gpu --docker --foreground, with the following parameters in clearml.conf. default_docker:...
3 years ago
0 Votes
3 Answers
649 Views
0 Votes 3 Answers 649 Views
Hi, is there a way I can supply credentials to clearml-data (cli and python) without going thru the ClearML.conf?
2 years ago
0 Votes
8 Answers
665 Views
0 Votes 8 Answers 665 Views
Hi, if i've ClearML agents installed on several servers, each with a single GPU. How can I train a gpt2 model that would require multiple GPUs?
one year ago
Show more results questions
0 Hi, We Recently Upgraded Clearml To 1.1.1-135 . 1.1.1 . 2.14. The Task Init Is

Hi,
I'm running on Dell ECS storage appliance, which offers S3 compatibility.
yes http://ECS.ai is the DNS name of the server.
ClearML-models is the bucket.
Let me try with ip:port.

2 years ago
0 Hi, I Would Like To Understand More On How Clearml Deal With Codes.

Hi thanks.
So i suppose ClearML make use of the information in .git folder at the root of the script folder to gather those info.

I have yet to go through thoroughly with ClearML agent. TimelyPenguin76 , so if i run a training with uncommited changes and didn't commit/push after. When i clone the task, isn't ClearML agent unable to pull that script from the git repo?

3 years ago
0 Hi, Several Changes Occurred Recently And I Would Like To Know If There'S A Way To Verbose Catch All The Printout That Happening Within A K8S Glue Spawned Pod. We Have An Issue Where All Of Our New Remote_Execution Tasks Are Stuck In The 'Pending' Stage.

Some breakthrough. The problem is because we switched the web, api and files server to use https (ssl) endpoint instead. I had switched back to http end points to test this theory.

Although its not printing the error, i suspect its not able to connect due to lack of the self signed cert. Previously this wasn't an issue, not sure what changed in clearml_agent=1.1.0.

There's a secondary issue resulting, i will put this on a new thread.

2 years ago
0 Hi, I Am Trying To Use Clearml-Data To Upload My Data To S3, Which Is Password Protected. How Should I Indicate The Credentials After I Set --Storage S3://.... ?

I see, so its a path. Another question, as far as i can tell, clearml-data will download entire datasets before starting training. This isn't very ideal when we are dealing with billions of datasets (E.g. WE might want to download a subset at a time, send to GPU for training and then use the CPU to concurrently pull another subset.). Any comments on this?

3 years ago
0 Hi, I Am Trying To Use Clearml-Data To Upload My Data To S3, Which Is Password Protected. How Should I Indicate The Credentials After I Set --Storage S3://.... ?

like create multiple datasets?
create parent (all) - upload to S3
create child1 (first 100k)
create child2 (second 100k)...blah blah

Then only pull indices from children. Technically workable but not sure if its best approach since different ppl have different batch sizes in mind.

3 years ago
0 Hi, I Am Trying To Use Clearml-Data To Upload My Data To S3, Which Is Password Protected. How Should I Indicate The Credentials After I Set --Storage S3://.... ?

Got that thanks. Just to better understand. When clearml-data upload my recursive folder of image data, it convert it into a compressed form with a different folder structure than the original datasets.

When my software pull the data, i'm returned a str. How would we manipulate the data from there?

3 years ago
0 Hi, Several Changes Occurred Recently And I Would Like To Know If There'S A Way To Verbose Catch All The Printout That Happening Within A K8S Glue Spawned Pod. We Have An Issue Where All Of Our New Remote_Execution Tasks Are Stuck In The 'Pending' Stage.

Hi, i dont't think clearml agent actually ran at that point in time. All i can see in the pod is
apt install of libpthread-stubs, libx11, libxau and libxcb1 packages. pip install of clearml-agentAfter the above are successful, the pod just hang there.

2 years ago
0 Hi, Several Changes Occurred Recently And I Would Like To Know If There'S A Way To Verbose Catch All The Printout That Happening Within A K8S Glue Spawned Pod. We Have An Issue Where All Of Our New Remote_Execution Tasks Are Stuck In The 'Pending' Stage.

Ok. That brings me back to the spawned pod. At this point, clearml-agent and its config would be a controbuting factor. Is the absence of /tmp/.clearml_agent.xxxxxx.cfg an issue?

2 years ago
0 I Am Facing This Error While Trying To Run My Code

I'm having the same problem. You using latest clearmagent? Is your docker image a root user by default?

one year ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

My assumption is that the agent will have pulled that off the client's clearml.conf.

2 years ago
2 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

i see. Can i take it that when the client uses
task.execute_remotely(queue_name="1gpu", exit_process=True)then none of the content in its clearml.conf will be used, except for the API part. And Clearml simply uses whatever is on the Agent side.
api { # Notice: 'host' is the api server (default port 8008), not the web server. api_server: web_server: files_server: # Credentials are generated using the webapp, `
# Override with os environment: ...

2 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Setting the credentials on agent machine means the users cannot use their own credentials since an k8s glue agent serves multiple users.

Referencing your suggestion, we can configure output_uri on task.set_base_docker() but how should we do this for the credentials?

2 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

yes its on purpose, each user would have their own AWS credentials for default_output_uri.

2 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Hi SuccessfulKoala55 , just to add, my clearml.conf (client) and clearml.agent.conf (agent) can have differing values. I'm not sure which one takes precedence and if this could be the cause.

2 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Going back to the open source, I think that adding the credentials as part of the source code might allow to have "credentials" auto populate as part of the remote execution, wdyt?

Not sure how this will work when i can't supply the credentials to ClearML programatically.

2 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Do you have more info on vault?
Actually it only make sense if the entire department or organisation are saving their models in a common repo. In our case this is not possible due to client security (e.g. training data from clients can potentially be 'reverse engineered' from trained models in future). So each department and even projects will need their own repo.

2 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

It would make sense on a very large resource cluster. Unfortunately we only have less than 50 GPUs to share across. A multi-tenant SAAS would cut the resources into even more smaller clusters and not help with efficiency. Or would you have a suggestion?

2 years ago
0 Hi, I Shifted My Clearml Setup To An On-Premise Disconnected Env, Which Has A Pip Repo Setup. I Noted This Warning,

Hi AgitatedDove14 , i changed everything to cuda 10.1 and tried again with the same rrror. the section as follows. I made sure torch==1.6.0+cu101 and torchvision==0.8.2+cu101 are in the pypi repo. But the same error still came up.
` # Python 3.6.9 (default, Oct 8 2020, 12:12:24) [GCC 8.4.0]
boto3 == 1.14.56
clearml == 0.17.4
numpy == 1.19.1
torch == 1.6.0
torchvision == 0.7.0

Detailed import analysis

**************************

IMPORT PACKAGE boto3

clearml.storage: 0

IMPORT PACKAG...

3 years ago
3 years ago
0 Hi, I Shifted My Clearml Setup To An On-Premise Disconnected Env, Which Has A Pip Repo Setup. I Noted This Warning,

Hi AgitatedDove14 , what version i should change it to? I'm currently on v0.17.2rc3.

3 years ago
0 Hi, I Shifted My Clearml Setup To An On-Premise Disconnected Env, Which Has A Pip Repo Setup. I Noted This Warning,

I can't seem to find the fix to this. Ended up using an image that comes with torch installed.

3 years ago
Show more results compactanswers