SubstantialElk6

117 Questions, 310 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Badges 1

282 × Eureka!

Questions 117
Answers 310

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi, There'S Something I Don'T Find Too Logical When Using Clearml And Its Agents. I Will Need To Run My Code Once On My Client Computer This Is Without Gpus. And Then I Will Need To Run It Via The Ui On Clearml Server That Has Gpus. Why Can'T I Configure

Hi, there's something i don't find too logical when using ClearML and its agents. I will need to run my code once on my client computer this is without GPUs....

clearml

4 years ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hi, How Is The Priority Of The Configuration Like? Which One Takes Precedence? For Example, Output_Uri

Hi, how is the priority of the configuration like? Which one takes precedence? For example, output_uri default_output_uri in clearml.conf on client files_ser...

dataset

3 years ago

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

Hi, We Are Planning To Move On To Openshift. Can I Ask If K8S-Glue Supports Openshift?

Hi, we are planning to move on to openshift. Can I ask if k8s-glue supports openshift?

clearml

4 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi, Can I Default The Clearml Fileserver To A S3 Path?

Hi, can I default the ClearML fileserver to a s3 path?

clearml

4 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

I Got An Interesting Question From My Devs. If They Wish To Do Distributed Training, Is Clearml K8S Glue Suitable For It? Local Multiple Gpu: Just A Matter Of Assigning More Than One Gpu In The Yaml File Sent To The K8S Glue. Question Is How To Make This

I got an interesting question from my Devs. If they wish to do distributed training, is clearml k8s glue suitable for it? Local multiple GPU: just a matter o...

clearml

4 years ago

0 Votes

10 Answers

2K Views

0 Votes 10 Answers 2K Views

Hi, I Would Like To Check What Would Be The Recommended Hardware Specs For The Server Host Clearml Server. I Had One Configured With 32 Cpu Cores, 64Gb Ram And I Noticed That If We Have A Surge In Remote Task Creation, The Following Delays Occurs.

Hi, i would like to check what would be the recommended hardware specs for the server host ClearML server. I had one configured with 32 CPU cores, 64GB ram a...

clearml

4 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi, Can I Get Clearml To Not Print Anything Other Than The Prints From My Codes? The Reason Is Because Clearml Is Printing The Username And Passwords I Passed To The Container Via Env Vars.

Hi, can i get ClearML to not print anything other than the prints from my codes? The reason is because clearml is printing the username and passwords i passe...

clearml

4 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi, I Had A Task Successfully Completed. Then I Cloned It And Enqueued It Again Without Any Changes. But The Task Ends Up With An Error. Here'S The Logs, Not Sure What Went Wrong.

Hi, i had a task successfully completed. Then i cloned it and enqueued it again without any changes. But the task ends up with an error. Here's the logs, not...

clearml

4 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hi, I Am Trying To Understand Clearml-Data And Only Found This Piece Of Article Explaining It.

Hi, i am trying to understand clearml-data and only found this piece of article explaining it. https://github.com/allegroai/clearml/blob/master/docs/datasets...

dataset

4 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hi, I Noticed That All Other Users Can See My Experiments. Does Clearml Has The Feasibility To Only Allow Certain Groups Of People To See Each Other'S Work?

Hi, i noticed that all other users can see my experiments. Does ClearML has the feasibility to only allow certain groups of people to see each other's work?

clearml

4 years ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hi, Can Clearml-Server Support Replicaset In K8S?

Hi, can Clearml-Server support ReplicaSet in K8S?

clearml

3 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

I'M Getting This When Running With Keras Framework. Clearml.Storage - Error - Failed Uploading: [Errno 21] Is A Directory: 'Model.Savedmodel'.

I'm getting this when running with Keras framework. clearml.storage - ERROR - Failed uploading: [Errno 21] Is a directory: 'model.savedmodel'.

clearml

4 years ago

0 Votes

22 Answers

2K Views

0 Votes 22 Answers 2K Views

Hi, I Would Like To Pass In Some Pip Arguments That Clearml-Agent Would Include When Setting Up The Venv On The Containers. How Should I Specify This? The Argument In Question Are --Trusted-Host And --Find-Links . I Need Them As I'Ve Installed A Pypi Repo

Hi, I would like to pass in some pip arguments that clearml-agent would include when setting up the venv on the containers. How should I specify this? The ar...

clearml

4 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi, I'Ve Multiple Tasks Setup In A Complex Pipeline. How Can I;

Hi, I've multiple tasks setup in a complex pipeline. How can I; Define prior to running the pipeline, which tasks to be running on which remote queue using w...

clearml

3 years ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hi, Would You Have A Working Example On This?

Hi, would you have a working example on this?

clearml

3 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi, I Noted That If I Run My Codes On My Laptop With Remote_Execute Off A Python3.8 Venv, And When The Remote Task Starts Executing But The Image Is Installed With A Different Version Of Python, Say Python3.8, We Would Encounter Errors With Venv. At This

Hi, i noted that if I run my codes on my laptop with remote_execute off a python3.8 venv, and when the remote task starts executing but the image is installe...

clearml

3 years ago

0 Votes

30 Answers

2K Views

0 Votes 30 Answers 2K Views

Hi, I'M Getting This Long Error When Running

Hi, i'm getting this long error when running task.execute_remotely(queue_name="1gpu", exit_process=True) . I also notices an error Failed to fetching activit...

clearml

4 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi, I'M Wondering If Clearml Did A Comparison Of Their Clearml Pipelines With Other Solutions Such As Apache Beam? Or If Clearml Supports Integration With Such Third Party Solutions?

Hi, i'm wondering if ClearML did a comparison of their ClearML Pipelines with other solutions such as Apache Beam? Or if ClearML supports integration with su...

clearml

3 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi, Can I Default A Docker Image When Running A Pipeline? I Currently Set It As

Hi, can i default a docker image when running a pipeline? I currently set it as pipe = PipelineController(...) pipe.task.setbase_docker("ubuntu:20:04") pipe....

clearml

3 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi, How Do I Switch Clearml Server To Run On Https With A Self Signed Cert?

Hi, how do I switch clearml server to run on https with a self signed cert?

clearml

4 years ago

0 Votes

9 Answers

2K Views

0 Votes 9 Answers 2K Views

Hi, Just To Check. Does The K8S Glue Install Torch By Default? I'M Getting

Hi, just to check. Does the k8s glue install torch by default? I'm getting Warning: could not resolve python wheel replacement for torch==1.8.0 even though i...

tensorflow

4 years ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

Hi, Is There Any Code Examples Of How Dataops Is Being Established?

Hi, is there any code examples of how DataOps is being established? https://clear.ml/products/clearml-dataops/

clearml

4 years ago

0 Votes

7 Answers

2K Views

0 Votes 7 Answers 2K Views

Hi, I Was Using The K8S Glue And It Worked Fine On One Project But Didn'T Work On Another. At The Point Just Before A Git Clone Was Executed, I Get The Error

Hi, i was using the K8S Glue and it worked fine on one project but didn't work on another. At the point just before a git clone was executed, i get the error...

clearml

4 years ago

0 Votes

10 Answers

2K Views

0 Votes 10 Answers 2K Views

Hi, We Noted That Using K8S Glue, There Are Some Situations Where The Task Cannot Be Registered As Error And Will Be Stuck At Pending. An Example Of One Situation Is When The Task Is Pulling A Docker Image That Doesn'T Exist. Is There A Way To Catch Such

Hi, we noted that using K8S glue, there are some situations where the task cannot be registered as error and will be stuck at pending. An example of one situ...

clearml

4 years ago

0 Votes

1 Answers

3K Views

0 Votes 1 Answers 3K Views

[Distributed Training] Hi, I Have A Clearml Setup With K8Sglue That Spins Up Pods Of 4 Gpus When Picking Tasks Off The Clearml Queue. We Would Now Want To Proceed With Multi-Node Training, And Some Of The Examples We Are Trying Are Here.

[Distributed Training] Hi, i have a ClearML setup with K8SGlue that spins up pods of 4 GPUs when picking tasks off the clearml queue. We would now want to pr...

clearml

2 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi, Is There A Way I Can Supply Credentials To Clearml-Data (Cli And Python) Without Going Thru The Clearml.Conf?

Hi, is there a way I can supply credentials to clearml-data (cli and python) without going thru the ClearML.conf?

dataset

4 years ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

I Just Getting This In My Agent Run Task. Would Appreciate If Someone Can Advise Where I Externalrequirement Is Pointing At.

I just getting this in my agent run task. Would appreciate if someone can advise where i externalrequirement is pointing at. RequirementsManager handler rais...

mlops

4 years ago

0 Votes

11 Answers

2K Views

0 Votes 11 Answers 2K Views

Hi, I Shifted My Clearml Setup To An On-Premise Disconnected Env, Which Has A Pip Repo Setup. I Noted This Warning,

Hi, i shifted my clearml setup to an on-premise disconnected env, which has a pip repo setup. I noted this warning, Trying pip install: /root/.clearml/venvs-...

pytorch

4 years ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hi, What Would Happen If You Have Different Clearml-Agents Of Different Versions Running? Would It Have Any Adverse Effects?

Hi, what would happen if you have different clearml-agents of different versions running? Would it have any adverse effects?

clearml

4 years ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

Hi, If I'Ve Clearml Agents Installed On Several Servers, Each With A Single Gpu. How Can I Train A Gpt2 Model That Would Require Multiple Gpus?

Hi, if i've ClearML agents installed on several servers, each with a single GPU. How can I train a gpt2 model that would require multiple GPUs?

clearml

2 years ago

Show more results

0 Hi, I Shifted My Clearml Setup To An On-Premise Disconnected Env, Which Has A Pip Repo Setup. I Noted This Warning,

AgitatedDove14 , would you elaborate on this resolution process?

4 years ago

0 Hi, I Would Like To Start Logging How Often My Users Uses Clearml. How Might I Query This Kind Of Information?

Ok sure. Thanks.

3 years ago

0 Hi, How Do I Switch Clearml Server To Run On Https With A Self Signed Cert?

Ok let me try that.

4 years ago

0 Hi, We Noted That Using K8S Glue, There Are Some Situations Where The Task Cannot Be Registered As Error And Will Be Stuck At Pending. An Example Of One Situation Is When The Task Is Pulling A Docker Image That Doesn'T Exist. Is There A Way To Catch Such

Hi SuccessfulKoala55 , is there a channel here that posts version updates?

4 years ago

0 Hi, I'M Running The Following And Encountering Some Ssl Errors.

ok that's the latest? Will do.

4 years ago

0 Hi. Try To Use Clearml On Work. I'M Have Problem With Clearml-Agent, Because On Work We Dont Have Internet Acceses. For Install Packages We Used Mirror Pypi (Not All Packages) And Manualy Add Package On Disk With Line In Pip.Conf --Follow-Link=~/Pypi. It

I used nvcr pytorch image and instruct clearml to inherit global dependencies. No need to install torch and work well.

4 years ago

0 Hi Community, I’Ve Just Posted My First Blog Post About Mlops. I Am Open To Any Suggestions.

Here's my two cents worth.
I thought its really nice to start off the topic highlighting 'pipelines', its unfortunately one of the most missed component when ppl start off with ML work. Your article mentioned about drfits and how MLOps process covered it. I thought there are 2 more components that was important and deserves some mention.Retraining pipelines. ML engineers tend not to give much thought to how they want to transit a training pipeline in development to a automated retraining pipe...

3 years ago

0 Hi, I'M Running The Following And Encountering Some Ssl Errors.

clearml=1.0.3
python=3.8.10
clearml-data upload --id 12314jhg42342j4j --storagehttp://ecs.ai is an on-prem DELL EMC ECS that serves as our S3 storage configured with s self signed cert.

4 years ago

0 Hi, I Had A Task Successfully Completed. Then I Cloned It And Enqueued It Again Without Any Changes. But The Task Ends Up With An Error. Here'S The Logs, Not Sure What Went Wrong.

Space is way above nominal. What created this folder that it's trying to process? What processing is this?
Processing /tmp/build/80754af9/attrs_1604765588209/workIs there any paths in the agent machine that i can clear out to remove any possible issues from previous versions?

4 years ago

0 Hi, If I'Ve Clearml Agents Installed On Several Servers, Each With A Single Gpu. How Can I Train A Gpt2 Model That Would Require Multiple Gpus?

From ClearML perspective, how would we enable this, considering we don't have direct control or even IP of the agents

2 years ago

0 Hi, Is There Any Code Examples Of How Dataops Is Being Established?

Create immutable and differentiable versions on-prem or in the cloud with our data agnostic solution.

4 years ago

0 Hi, I Would Like To Understand More On How Clearml Deal With Codes.

I see i understand better now. Thanks.

4 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Thank. Gonna try that out. But i hit another snag. Strangely, the Agent is not creating the right venv. This is what the Agent created.
` pip:

asn1crypto==0.24.0
attrs==20.3.0
certifi==2020.12.5
chardet==4.0.0
cryptography==2.1.4
Cython==0.29.22
furl==2.1.0
future==0.18.2
humanfriendly==9.1
idna==2.6
importlib-metadata==3.7.0
jsonschema==3.2.0
keyring==10.6.0
keyrings.alt==3.0
orderedmultidict==1.0.1
pathlib2==2.3.5
psutil==5.8.0
pycrypto==2.6.1
pygobject...

4 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

That didn't work as well...

4 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Hi, the problem is the same.

I noticed that its not checking out the latest version in gitlab. This latest version would contain the requirements.txt.
Using cached repository in "/root/.clearml/vcs-cache/pytorchmnist.f220373e7227ec760b28c7f4cd99b534/pytorchmnist" warning: redirecting to Note: checking out 'cfb833bcc70f3e10d3b6a96cfad3225ed682382b'.But i'm guessing this block below applied the diff..does it include the requirements.txt though?
` HEAD is now at cfb833b Upload New Fil...

4 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Yes, as listed in the snippet. The torch library is torchvision.

4 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Thanks. That's easy to miss as its not quite apparent in the main docs. How should i pass in env variables with Task?

4 years ago

0 Hi, If I'Ve Clearml Agents Installed On Several Servers, Each With A Single Gpu. How Can I Train A Gpt2 Model That Would Require Multiple Gpus?

Yeah.. issue is ClearML unable to talk to the nodes cos pytorch distributed needs to know their IP. There is some sort of integration missing that would enable this.

2 years ago

0 Hi, Is There Any Code Examples Of How Dataops Is Being Established?

The first is probably done using pipeline controllers, the second using Datasets or HyperDatasets. Its not very clear how the last one is achieved, especially on the searchable data catalogs.

4 years ago

0 Hi, We Are Using Gitlab And It Is A Security Requirement To Use Ssh Keys To Access The Repos For Each Individual. We Are Also Using K8S Glue. Is There Any Provisions To Do This Seamlessly?

what feature on this paid roadmap are you referring to? I am indeed communicating with Noem on paid features.

4 years ago

0 Hi, I'M Working On A Post Deployment Data And Model Monitoring Using Clearml. The Idea Is This.

Hi SuccessfulKoala55 , just wondering how i can follow up on this.

3 years ago

0 There Is An Annoying Bug In Docker Mode Which I Have Noticed Several Months Ago But Put Up With It For Some Reason, Despite I Kept Running Into It All The Time. Sometimes Agents Don'T Pull Layers For Updated Docker Images If The Image With The Same Name

Try set docker_force_pull: true under agent section of your agent's clearml.conf.

4 years ago

0 Hi, I Notice A New Behavuour With Clearml-Agent=1.1.0. When It Is Installing The Packages I Nrequirements.Txt, It Failed With.

Its hard to tell, but the agent change was a significant one. Unless python versions has something to do with it.

4 years ago

0 Hi

Hi i missed this. Is there a recording?

4 years ago

0 Hi, We Are Using Gitlab And It Is A Security Requirement To Use Ssh Keys To Access The Repos For Each Individual. We Are Also Using K8S Glue. Is There Any Provisions To Do This Seamlessly?

Thanks SuccessfulKoala55 . Just pm'ed him.

4 years ago

0 Hi, My Devsecops Team Has Raised Some Issues Of Us Deploying Clearml For Use. In Particular, They Are Not Happy With Docker.Sock Configuration As It Would Potentially Expose The Entire Cluster To Unauthorised View. Can We Do Without It?

Hi thanks. How about Agent, does its docker mode or k8s mode require docker.sock to be exposed?

4 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Hi,
It did, nvidia/cuda:10.1-runtime-ubuntu18.04.

So if i need to set this every time, what is the following config for? And how do i pass in new env parameters?
` default_docker: {
# default docker image to use when running in docker mode
image: "dockerrepo/mydocker:custom"

    # optional arguments to pass to docker image
    # arguments: ["--ipc=host", ]
    arguments: ["--env GIT_SSL_NO_VERIFY=true",]
} `

4 years ago

This would be solved if --env GIT_SSL_NO_VERIFY=true is passed to the k8s pod that's spawned to run the job. Currently its not.

4 years ago

Thanks 👍 . Should i create an issue on Github?

4 years ago

It has always been there.

4 years ago

Show more results