SubstantialElk6

115 Questions, 310 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

282 × Eureka!

Questions 115
Answers 310

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hi, I Can'T Seem To Set A Password To Clearml, Anyone Seems To Be Able To Just Enter The Username And They Can Enter That Username'S Workspace.

Hi, i can't seem to set a password to clearml, anyone seems to be able to just enter the username and they can enter that username's workspace.

clearml

3 years ago

0 Votes

4 Answers

989 Views

0 Votes 4 Answers 989 Views

Hi, I'M Working On A Post Deployment Data And Model Monitoring Using Clearml. The Idea Is This.

Hi, i'm working on a post deployment data and model monitoring using ClearML. The idea is this. Use ClearML to serve my model out to Triton. Data MonitoringC...

clearml

2 years ago

0 Votes

2 Answers

944 Views

0 Votes 2 Answers 944 Views

Sorry, I'M Asking Too Much Questions Today. I Gave Myself A Whole Day To Fully Evaluate Clearml...That'S Why. Here Goes. Regarding Automatic Logging (Automagikal), I Took Your Example (

Sorry, I'm asking too much questions today. I gave myself a whole day to fully evaluate ClearML...that's why. Here goes. Regarding Automatic Logging (Automag...

clearml

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi. Does Clearml Support Auto-Magical (Scalars) For Scenarios Where We Need To Execute Third Party Installed Toolkits, Which Runs Training With Common Ml Frameworks Such As Pytorch And Tensorflow? Two Examples Below. Clearml Can Capture The Console Output

Hi. Does ClearML support auto-magical (scalars) for scenarios where we need to execute third party installed toolkits, which runs training with common ML fra...

pytorch

one year ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi I Saw This Announcement From Nvidia On Tao'S Integration With Clearml. How Can We Use It?

Hi i saw this announcement from NVIDIA on TAO's integration with ClearML. How can we use it?

clearml

one year ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi, I Am Working On Creating Retraining Pipelines In Production. The Way I'M Doing This Is To Install Clearml-Server On My Production. Then I Recreate The Ingestion, Preprocessing And Training/Opt Tasks Into A Clearml-Pipeline. Thereafter, I Would Call

Hi, i am working on creating retraining pipelines in production. The way i'm doing this is to install clearml-server on my production. Then i recreate the in...

clearml

2 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hi, I Noticed That All Other Users Can See My Experiments. Does Clearml Has The Feasibility To Only Allow Certain Groups Of People To See Each Other'S Work?

Hi, i noticed that all other users can see my experiments. Does ClearML has the feasibility to only allow certain groups of people to see each other's work?

clearml

3 years ago

0 Votes

3 Answers

971 Views

0 Votes 3 Answers 971 Views

Hi, Can I Default A Docker Image When Running A Pipeline? I Currently Set It As

Hi, can i default a docker image when running a pipeline? I currently set it as pipe = PipelineController(...) pipe.task.setbase_docker("ubuntu:20:04") pipe....

clearml

2 years ago

0 Votes

3 Answers

932 Views

0 Votes 3 Answers 932 Views

Hi, I Got This Screenshot From Our Workers Tab And Would Like To Clarify Some Understanding. I Noticed That The

Hi, i got this screenshot from our workers tab and would like to clarify some understanding. I noticed that the Total number of workers and Number of active ...

clearml

2 years ago

0 Votes

0 Answers

835 Views

0 Votes 0 Answers 835 Views

Hi, We Are Encountering An Increasing Number Of Cases Where It Takes Quite A While Before Actual Training (Gpu Utilisation) Can Be Done. After Observing, This Is What We Discovered. The Following Are The Steps And Bottlenecks.

Hi, we are encountering an increasing number of cases where it takes quite a while before actual training (GPU utilisation) can be done. After observing, thi...

clearml

one year ago

0 Votes

2 Answers

914 Views

0 Votes 2 Answers 914 Views

Hi, I Was Trying Out The Steps On This (

Hi, i was trying out the steps on this ( https://github.com/allegroai/clearml/blob/master/docs/clearml-task.md ) and encountered the following error. Environ...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, I Was Adding Data Using Clearml-Data And Get The Following Consistent Errors.

Hi, i was adding data using clearml-data and get the following consistent errors. Retrying (Retry(total=237, connect=237, read=240, redirect=240, status=240)...

dataset

2 years ago

0 Votes

1 Answers

972 Views

0 Votes 1 Answers 972 Views

[Security] Hi, One Of Our Teams Noted That Previews Of Clearml-Data Datasets Are Saved In The Files_Server (Indicated In Clearml.Conf) Instead Of The Indicated Output_Uri In The Dataset.Create Argument. This Results In A Security Breach. May I Ask If This

[Security] Hi, one of our teams noted that previews of clearml-data datasets are saved in the files_server (indicated in ClearML.conf) instead of the indicat...

dataset

one year ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

Hi, If I'Ve Clearml Agents Installed On Several Servers, Each With A Single Gpu. How Can I Train A Gpt2 Model That Would Require Multiple Gpus?

Hi, if i've ClearML agents installed on several servers, each with a single GPU. How can I train a gpt2 model that would require multiple GPUs?

clearml

one year ago

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

Hi, We Noted That Using K8S Glue, There Are Some Situations Where The Task Cannot Be Registered As Error And Will Be Stuck At Pending. An Example Of One Situation Is When The Task Is Pulling A Docker Image That Doesn'T Exist. Is There A Way To Catch Such

Hi, we noted that using K8S glue, there are some situations where the task cannot be registered as error and will be stuck at pending. An example of one situ...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

I Got An Interesting Question From My Devs. If They Wish To Do Distributed Training, Is Clearml K8S Glue Suitable For It? Local Multiple Gpu: Just A Matter Of Assigning More Than One Gpu In The Yaml File Sent To The K8S Glue. Question Is How To Make This

I got an interesting question from my Devs. If they wish to do distributed training, is clearml k8s glue suitable for it? Local multiple GPU: just a matter o...

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, I Had A Task Successfully Completed. Then I Cloned It And Enqueued It Again Without Any Changes. But The Task Ends Up With An Error. Here'S The Logs, Not Sure What Went Wrong.

Hi, i had a task successfully completed. Then i cloned it and enqueued it again without any changes. But the task ends up with an error. Here's the logs, not...

clearml

3 years ago

0 Votes

3 Answers

982 Views

0 Votes 3 Answers 982 Views

Just Wondering, Why Aren'T You Guys Getting Yourselves Known In Gtc?

Just wondering, why aren't you guys getting yourselves known in GTC?

clearml

3 years ago

0 Votes

2 Answers

839 Views

0 Votes 2 Answers 839 Views

Hi, Is There A Way To Use Api To Return The Urls Of All The Models In The Clearml Repository?

Hi, is there a way to use API to return the URLs of all the models in the ClearML repository?

clearml

one year ago

0 Votes

1 Answers

994 Views

0 Votes 1 Answers 994 Views

Hi, I;M Running Clearml Jobs Using K8Sglue. When The Job Is Running, The Scalar For Monitor:Machine Seems To Be Reporting Node Statistics Instead Of The Pod Statistics. Can This Behavior Be Changed So Its Reporting For The Pod Instead??

Hi, i;m running ClearML jobs using K8SGlue. When the job is running, the scalar for monitor:machine seems to be reporting Node statistics instead of the Pod ...

clearml

one year ago

0 Votes

14 Answers

986 Views

0 Votes 14 Answers 986 Views

So I Bumped Onto This Comparison Shared By Dagshub. It Kinda Placed Clearml Is A Rather Bad Position Compared To Everything Else In The Industry.

So i bumped onto this comparison shared by dagshub. It kinda placed ClearML is a rather bad position compared to everything else in the industry. https://dag...

clearml

3 years ago

0 Votes

3 Answers

921 Views

0 Votes 3 Answers 921 Views

Hi, Can I Get Clearml To Not Print Anything Other Than The Prints From My Codes? The Reason Is Because Clearml Is Printing The Username And Passwords I Passed To The Container Via Env Vars.

Hi, can i get ClearML to not print anything other than the prints from my codes? The reason is because clearml is printing the username and passwords i passe...

clearml

3 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hi, Trying To Understand Clearml-Session. I Have An Agent Running On A Machine Monitoring A Queue Then I Ran Clearml-Session --Queue Myqueu --Docker Torch-Image. The Clearml Session Ended Up Tunneling Into The Physical Machine That My Agent Is Running

Hi, trying to understand clearml-session. I have an agent running on a machine monitoring a queue Then I ran clearml-session --queue myqueu --docker torch-im...

mlops remote-ssh

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi, Is It Possible To Have A Compatibility Matrix Between Clearml Sdk, Clearml Agent, Clearml Server...Etc? Recently We Updated The Clearml Sdk Without Updating The Rest And Ends Up Unable To Push Any Tasks Via K8S Glue.

Hi, is it possible to have a compatibility matrix between clearml sdk, ClearML agent, ClearML server...etc? Recently we updated the ClearML sdk without updat...

clearml

one year ago

0 Votes

1 Answers

924 Views

0 Votes 1 Answers 924 Views

Hi, I'Ve Three Questions Regarding Clearml Pipelines.

Hi, I've three questions regarding clearml pipelines. - can I check when we use a clearml pipeline and data get transferred from stage to stage, do the data ...

clearml

one year ago

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

Hi, V1 Of Agent Seems To Have Removed Agent.Package_Manager.Force_Repo_Requirements_Txt. Is This Still Available In Other Forms?

Hi, v1 of agent seems to have removed agent.package_manager.force_repo_requirements_txt. Is this still available in other forms?

mlops

3 years ago

0 Votes

1 Answers

946 Views

0 Votes 1 Answers 946 Views

Hi, Can You Run Clearml Experiments On Docker Images That Does Not Have Root?

Hi, can you run clearml experiments on docker images that does not have root?

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, I'M Running The Following And Encountering Some Ssl Errors.

Hi, i'm running the following and encountering some SSL errors. SSL_CERT_FILE=ca.crt clearml-data upload --id 12314jhg42342j4j --storage clearml.storage - ER...

aws dataset

3 years ago

0 Votes

1 Answers

960 Views

0 Votes 1 Answers 960 Views

Hi, Can Clearml-Server Support Replicaset In K8S?

Hi, can Clearml-Server support ReplicaSet in K8S?

clearml

2 years ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

Hi, I'M Using The K8S Glue And Have A Few Questions.

Hi, I'm using the k8s glue and have a few questions. Noted that it's not requesting the http://nvidia.com/gpu thus the pod created doesn't have a GPU resourc...

clearml

3 years ago

Show more results

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Sorry i don't quite understand this. The task itself was submitted as I run the code on the client. I suppose the dependancies requirements would be copied over as the experiment is cloned?

3 years ago

0 Hi, I Notice A New Behavuour With Clearml-Agent=1.1.0. When It Is Installing The Packages I Nrequirements.Txt, It Failed With.

Its hard to tell, but the agent change was a significant one. Unless python versions has something to do with it.

3 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Yes, as listed in the snippet. The torch library is torchvision.

3 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Hi,
It did, nvidia/cuda:10.1-runtime-ubuntu18.04.

So if i need to set this every time, what is the following config for? And how do i pass in new env parameters?
` default_docker: {
# default docker image to use when running in docker mode
image: "dockerrepo/mydocker:custom"

    # optional arguments to pass to docker image
    # arguments: ["--ipc=host", ]
    arguments: ["--env GIT_SSL_NO_VERIFY=true",]
} `

3 years ago

0 I Just Getting This In My Agent Run Task. Would Appreciate If Someone Can Advise Where I Externalrequirement Is Pointing At.

This is a env var?
CLEARML_CONFIG_FILE

3 years ago

0 Hi, I'M Getting This Long Error When Running

Can i somehow perform an export or backup?

3 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Hi, the problem is the same.

I noticed that its not checking out the latest version in gitlab. This latest version would contain the requirements.txt.
Using cached repository in "/root/.clearml/vcs-cache/pytorchmnist.f220373e7227ec760b28c7f4cd99b534/pytorchmnist" warning: redirecting to Note: checking out 'cfb833bcc70f3e10d3b6a96cfad3225ed682382b'.But i'm guessing this block below applied the diff..does it include the requirements.txt though?
` HEAD is now at cfb833b Upload New Fil...

3 years ago

0 Hi, I Notice A New Behavuour With Clearml-Agent=1.1.0. When It Is Installing The Packages I Nrequirements.Txt, It Failed With.

Ok. I noted this is due to the venv_update setting. It needs to be disabled as it has a dependancy on the internet url. We can close this.

3 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

running git diff on my terminal in this repo gave nothing. nothing at all.

3 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Ok that works. thanks.

3 years ago

0 I Just Getting This In My Agent Run Task. Would Appreciate If Someone Can Advise Where I Externalrequirement Is Pointing At.

Yes for both clearml and clearml-agent

3 years ago

0 Hi, I'M Getting This Long Error When Running

Thanks that did solve the problem, the tasks are running again.

3 years ago

0 Hi, I Have A Scenario Where When The Code Is Run Remotely Via Clearml-Agent, The Code Appears To Get Stuck At

Is there anyway to see an error log from that?

one year ago

0 Hi, We Are Having An Interesting Issue Here. We Serve Many Users And Each User Has Their Own Credentials In Accessing The Private Git Repo. We Can'T Seem To Find A Way For The End User To Pass In Their Git Credentials When They Run Their Codes In Both Age

Hi AgitatedDove14 . I'm trying out passing env via the code instead.
task.set_base_docker("nvcr.io/nvidia/tensorflow:19.11-tf2-py3 --env TRAINS_AGENT_GIT_USER=git_username_here --env TRAINS_AGENT_GIT_PASS=git_password_here")So the strange thing is when my k8sglue pulls a task, this happens.
` Pulling task xxxxxxxxxx launching on kubernetes cluster
Pushing task xxxxxxxxxx into temporary pending queue
Kubernetes scheduling task id=xxxxxxxxxxxx
skipping docker argument TRAINS_AGENT_GIT_USE...

3 years ago

0 Hi, We Would Like To Incorporate Some Approval Process In Clearml. One Of The Needs Is To Attach Some Pdfs And Word Docs To A Published Experiment, Preferbly Through The Web Ui. The Attachments Could Be In The Form Of The Actual Files, Or Links To The Fil

I think in general, the 'published' action can be considered an 'approval'. The question is, how do we control who has the authority to 'publish'? The Web UI today does not support any uploads outside of the coding environment, would be nice it would be supported. But for now, the only workaround is to include parameters that stores document urls in the user properties.

one year ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Do you have more info on vault?
Actually it only make sense if the entire department or organisation are saving their models in a common repo. In our case this is not possible due to client security (e.g. training data from clients can potentially be 'reverse engineered' from trained models in future). So each department and even projects will need their own repo.

3 years ago

0 Hi, We Have Recurring Disk Space Issues On Our Clearml Server (Drop Of Many Gb In A Few Days). After Some Analysis, We Noted

Thanks SuccessfulKoala55 , how might I do this clean up? Does this increase with more use of ClearML? And to add, we save all artifacts onto a remote S3 server.

2 years ago

0 Hi, There'S Something I Don'T Find Too Logical When Using Clearml And Its Agents. I Will Need To Run My Code Once On My Client Computer This Is Without Gpus. And Then I Will Need To Run It Via The Ui On Clearml Server That Has Gpus. Why Can'T I Configure

Hi, it make sense if i only had to change hyperparameters, but it's not so when i am still changing the model architecture (training code) and train and repeat.

3 years ago

0 Hi, Several Changes Occurred Recently And I Would Like To Know If There'S A Way To Verbose Catch All The Printout That Happening Within A K8S Glue Spawned Pod. We Have An Issue Where All Of Our New Remote_Execution Tasks Are Stuck In The 'Pending' Stage.

Hi, i dont't think clearml agent actually ran at that point in time. All i can see in the pod is
apt install of libpthread-stubs, libx11, libxau and libxcb1 packages. pip install of clearml-agentAfter the above are successful, the pod just hang there.

3 years ago

0 Hi, Is There A Command I Can Use To Generate A Report That Can

ok thanks! will try it out.

3 years ago

0 Hi, We Are Using Gitlab And It Is A Security Requirement To Use Ssh Keys To Access The Repos For Each Individual. We Are Also Using K8S Glue. Is There Any Provisions To Do This Seamlessly?

Thanks SuccessfulKoala55 . I can try my hand on a patch. But the pod spinning is handled by the k8s glue, which has no link to the client side. How should the client pass the key over to k8s glue during runtime via clearml server?

3 years ago

No i didn't indicate this particular issue on the git issue. Only the apply template.yml is on the issue.

3 years ago

0 Hi, I'M Getting This Long Error When Running

[root@2c7498711bef elasticsearch]# curl -XGET `
yellow open events-training_stats_scalar-d1bd92a3b039400cbafc60a7a5b1e52b 4hAFNtGkRr-CHNGnUYfbTA 1 1 4724 271 660.9kb 660.9kb
yellow open events-log-d1bd92a3b039400cbafc60a7a5b1e52b M3qgFy1HRU2PibDOr1YOdw 1 1 1221 20 1013.6kb 1013.6kb
red open worker_stats_d1bd92a3b039400cbafc60a7a5b1e52b_2021-05 EQK8mnlhRxCrrKK3clcUFA 1 1
red open queue_metrics_d1bd92a3b039400cbafc60a7a5b1e52b_...

3 years ago

0 Hi, I Would Like To Pass In Some Pip Arguments That Clearml-Agent Would Include When Setting Up The Venv On The Containers. How Should I Specify This? The Argument In Question Are --Trusted-Host And --Find-Links . I Need Them As I'Ve Installed A Pypi Repo

What's the diff between template-yaml and --overrides-yaml? I used the latter to ensure the gpu is passed in.

3 years ago

0 Hi I Saw This On The Clearml-Agent Docs But Other Than The Docker Image, I'M Not Sure How To Integrate This With Clearml Py And Clearml-Server. Please Advise.

This is probably the whole script.

kubectl get nodes
pip install clearml-agent
python k8s_glue_example.py

3 years ago

0 Hi, How Can I Make A Stage In A Clearml Pipeline Non-Blocking? The Scenario Is That Stages Downstream Needed Runtime Info From The First Stage, However The First Stage Needs To Continue Running To Act As A Monitor For The Other Downstream Stages.

Yes it is! But ClearML didn't support multi node training out of the box in a way that it streamline the process. So we are trying to figure out a way to do it.

one year ago

0 Hi, I'M Using The K8S Glue And Have A Few Questions.

I think the default action of clearml-agent k8s glue when running a task is to create a virtual env and installing the dependancies. So i'm just checking how to change that behaviour to look at global instead.

3 years ago

0 Hi, I Had A Task Successfully Completed. Then I Cloned It And Enqueued It Again Without Any Changes. But The Task Ends Up With An Error. Here'S The Logs, Not Sure What Went Wrong.

Space is way above nominal. What created this folder that it's trying to process? What processing is this?
Processing /tmp/build/80754af9/attrs_1604765588209/workIs there any paths in the agent machine that i can clear out to remove any possible issues from previous versions?

3 years ago

0 Hi, Can I Default A Docker Image When Running A Pipeline? I Currently Set It As

Hi CostlyOstrich36 , What you described is task. I was referring to the pipeline controller.

2 years ago

0 Hi, We Are Planning To Move On To Openshift. Can I Ask If K8S-Glue Supports Openshift?

Is there enterprise support for k8s glue on OpenShift?

3 years ago

Show more results