DeliciousBluewhale87

38 Questions, 126 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

121 × Eureka!

Questions 38
Answers 126

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Does Clearml Has A Webhook Mechanism ? Example, When A Training Job Is Completed.. There Is Notification Raised So Can Proceed To Do Deployment Etc ...

Does Clearml has a webhook mechanism ? Example, when a training job is completed.. There is notification raised so can proceed to do deployment etc ...

clearml

4 years ago

0 Votes

17 Answers

2K Views

0 Votes 17 Answers 2K Views

I Saw A Guide On Setting Up Clearml Server In Kubernetes.

I saw a guide on setting up clearml server in kubernetes. https://allegro.ai/clearml/docs/docs/deploying_clearml/clearml_server_kubernetes.html . However, i ...

kubernetes

4 years ago

0 Votes

13 Answers

2K Views

0 Votes 13 Answers 2K Views

Hi , I Have This Use Case.

Hi , I have this use case. I have Dataset project. This stores the dataset and its lineage. 2. I have another project build using the Pipeline. The pipeline ...

clearml

4 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Do We Get Workers In The

Do we get workers in the http://app.community.clear.ml servers ?

clearml

4 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

I Suppose The

I suppose the Data Management & Lineage (is similar to clearml-data in the open source version) and if so, what is this Hyperdatasets function (Not really ge...

dataset

4 years ago

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

Hi Everyone, Yesterday I Pushed An Experiment To The

Hi everyone, yesterday I pushed an experiment to the http://app.community.clear.ml server. It is still pending after 19 hours, is this normal behaviour ? Thanks

clearml

4 years ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hi, I Am Trying To Hyperparameterization. I Am Not Sure How To Pass A Args['Dataset_Url'] From Hyper.Py To Base.Py.

Hi, I am trying to hyperparameterization. I am not sure how to pass a args['dataset_url'] from hyper.py to base.py. hyper_parameters=[ DiscreteParameterRange...

clearml

4 years ago

0 Votes

11 Answers

2K Views

0 Votes 11 Answers 2K Views

Hi, We Have Clearml On K8 Setup. Using The Below, We Run Dynamic Pods On The Cluster.

Hi, we have clearml on k8 setup. Using the below, we run dynamic pods on the cluster. k8s_glue_example.py --queue glue_qI pushed this code, https://github.co...

clearml

4 years ago

Show more results

0 Hi Everyone, Yesterday I Pushed An Experiment To The

I just had to set up the clearml-agent on my machine. Closing this issue.

4 years ago

0 Hi, Expanding On

Hi AgitatedDove14 , Now we prefer to run dynamic agents instead using
python3 k8s_glue_example.pyIn this case, is it still possible to pass --order-fairness at the queue level or this is more of a Enterprise edition feature.

4 years ago

0 Hi, I Have Quite A Generic Question. Basically, I Am Picking Your Brains For Any Solution. Our Current Pipeline Has (Clearml-Data, Clearml And Seldon). We Were Looking For Some Workflow Orchestrator To Stitch Them Up. One Scenario:

Ah, so in the future, we can add non-clearml code as a step in the pipeline controller.

4 years ago

0 Hi, Expanding On

Sound Perfect. 👍

4 years ago

0 Hi, We Have Clearml On K8 Setup. Using The Below, We Run Dynamic Pods On The Cluster.

So now you don’t have any failures but gpu usage issue?

I didnt run the hyper_parameter_optimzer.py, as I was thinking if there is already a problem with the base, no use with running the series of experiments

How about running the ClearML agent in docker mode?

Prev, we had our clearml-agent run in the bare-metal machine instead in docker formation. There wasnt any issue.. Though I havent tried with 0.17.2 version

4 years ago

0 Hi, We Have Clearml On K8 Setup. Using The Below, We Run Dynamic Pods On The Cluster.

let me run the clearml-agent outside the k8 system.. and get back to u

4 years ago

0 Hi, We Have Clearml On K8 Setup. Using The Below, We Run Dynamic Pods On The Cluster.

` Could not load dynamic library 'libcupti.so.11.0'; dlerror: libcupti.so.11.0: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64
2021-03-11 09:11:17.368793: W tensorflow/stream_executor/platform/default/dso_loader.cc:60] Could not load dynamic library 'libcupti.so'; dlerror: libcupti.so: cannot open shared object file: No such file or directory; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64
2021-03-11 09...

4 years ago

0 Hi, We Have Clearml On K8 Setup. Using The Below, We Run Dynamic Pods On The Cluster.

Hi TimelyPenguin76 ,
Instead of running the hyper_parameter_optimizer.py, I tried running the base_template_keras_simple.py instead.. It seems that I didnt use the GPU, however when i ssh into clearml-glueq-id-ffaf55c984ea4dbfb059387b983746ba:gpuall pod, and ran nvidia-smi, it gave an output.

4 years ago

0 Hi, We Have Clearml On K8 Setup. Using The Below, We Run Dynamic Pods On The Cluster.

We have k8s on ec2 instances in the cloud. I'll try it there 2morrow and report back..

4 years ago

0 Hi, I Am Trying To Understand Clearml-Data And Only Found This Piece Of Article Explaining It.

Hi, Some walk around I thought of.. Btw, I havent tried . AnxiousSeal95 , your comments

1 ) Attach a clearml-task id to each new dataset-id
So in the future, when new data comes in, get the last data commit from the project(Dataset) and get the clearml-task for it. Then clone the clearml-task, and pass in the new data. The only downside, is the need to clone the cleaml-task.
Or alternatively
2) Attach a gitsha-id of the processing code to each new dataset-id.
This can't give the exact code ...

4 years ago

0 Hi , I Have This Use Case.

kkie.. I have two differenet projects under clearml web server.
First project , stores datasets only.. using clearml-data (PROJ_1) Second project, is a clearml-pipeline project, (PROJ_2) which pulls the latest commited dataset from (PROJ_1) and does few other steps ... Now, I manually start the PROJ_2 when i know the dataset is updated in PROJ_1.

4 years ago

One use case now :
Load Data from Label Studio (Manager to manually approve) Push data to Clearml-data Run Training (Manager to manually Publish) Pushes model uri to next step Seldon deploy itLater, if seldon detects a data drift, it will automatically run (steps 2-5)..
At this point, we havent drilled all of it down yet

4 years ago

0 Hi Everyone, Yesterday I Pushed An Experiment To The

Btw, this is just the example code from clearml repo..

4 years ago

0 Using The S3 To Push Data, I Tried This Snippet.. How To Add In The Key And Secret Key ? I See That It Requires Another Class, Storagehelper, But I Wasnt Able To Find A Code Snippet

Ah kk.. Got it. Thanks..

4 years ago

0 I Just Deployed Clearml Into K8 Cluster Using Clearml Helm Package. When I Ran A Job, It Gave This Error In The Clearml Web Server (Attached Below). I Sshed Into The Pod Running The Clearml-Agent. Upon Typing Clearml-Agent Init, I Realised The Clearml.Con

Yup, i used the value file for the agent. However, i manually edited for the agentservices (as there was no example for it in the github).. Also I am not sure what is the CLEARML_HOST_IP (left it empty)

4 years ago

0 Hi Everyone, Yesterday I Pushed An Experiment To The

Hmm, unfortutenly it is still pending as in nothing is running

4 years ago

0 I Have Used Aws S3 And Minio As Storage For Clearml Artifacts. But Has Anyone Used Nexus As A Storage ?

CostlyOstrich36 :
They mentioned that they already have a Nexus backend. So just was wondering if we could use it for storage purposes.

4 years ago

0 I Have Used Aws S3 And Minio As Storage For Clearml Artifacts. But Has Anyone Used Nexus As A Storage ?

In our local setup, we use minio though ?

4 years ago

0 Hi Folks, We Are Trying To Find A Tool To Help With Workflow Orchestration. This Is Our Stack So Far (Label Studio/Clearml/Seldon). Does Anyone Have Any Experience With Using Any Workflow Which Is Most Compatible Esp Wrt To Clearml.

AgitatedDove14 Not creating but more for orchestrating...

Currently, we manually push a dataset to cleaml-dataset .
Have a pipeline controller Task which (takes in data from clearml-dataset, runs preprocessing, runs training) and Publishes a model (if certain threshold is met).
We have clearml monitor which will monitor all Published models .It will push the uri of the published model to a rabbitmq.

We have a subscriber (python code) listening to the rabbitmq. This takes in the uri from t...

4 years ago

0 Hi , I Have This Use Case.

MagnificentSeaurchin79 How to do this ? Can it be done via ClearMl itself ?

sounds like you need to run a service to monitor for new commits in PROJ_1, to trigger the pipeline

4 years ago

0 Does Clearml Has A Webhook Mechanism ? Example, When A Training Job Is Completed.. There Is Notification Raised So Can Proceed To Do Deployment Etc ...

Is this some sort of polling ?
End of the day, we are just worried whether this will hog resources compared to a web-hook ? Any ideas

4 years ago

0 Hi, I Am Running A File Like This

Hi AgitatedDove14 , This isnt the issue. With or without specifying the queue, I have this error when I do the "Create version" as compared to the "Init version".
I wonder whether this is some issue with using the Create version together with execute_remotely() ..

4 years ago

0 I Have Used Aws S3 And Minio As Storage For Clearml Artifacts. But Has Anyone Used Nexus As A Storage ?

we can always use the latest Clearml.
We were thinking of a use case for a client, who has Sonatype Nexus in their environment ? Could we leverage on it, or would we need minio instead ?

4 years ago

Something is wierd.. It is showing workers which are not running now...

4 years ago

0 Clearml Server For Kubernetes Clusters Using Helm

Hi, for the values.yaml, is there some reference for it esp so , if we assign more Memory to webserver service etc. I tried googling around but so far no luck

4 years ago

This is from my k8 cluster. Using the clearml helm package, I was able to set this up.

4 years ago

0 Quick Qn, When Using The Clearml-Task, How To Specify The Output_Uri.

i ran this in my local machine..
clearml-task --project playground --name tensorboard_toy --script tensorboard_toy.py --requirements requirements.txt --queue myqueue

4 years ago

0 Base_Template_Keras_Simply.Py

Just figured out..
Seems like the docker image below, didnt have tensorflow package.. 😮
tensorflow/tensorflow:latest-devel-gpuI shld have checked prior... My Bad..
Thanks for the help

4 years ago

Hi martin, i just untemplate-ed the
helm template clearml-server-chart-0.17.0+1.tgzI found this lines inside.
- name: CLEARML_AGENT_DOCKER_HOST_MOUNT value: /opt/clearml/agent:/root/.clearmlUpon ssh-ing into the folders in the both the physical node (/opt/clearml/agent) and the pod (/root/.clearml), it seems there are some files there.. So the mounting worked, it seems.
I am not sure, I get your answer. Should i change the values to something else ?
Thanks

4 years ago

0 Hi, I Am Running A Pipeline (Which Does Preprocessing And Training) ? Once Training Ends, I Want To Automatically Publish The Task (Model). Reading The Docs, I Tried This Approach Below. I Wrote A

4 years ago

Show more results