SubstantialElk6

117 Questions, 310 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

282 × Eureka!

Questions 117
Answers 310

0 Votes

3 Answers

989 Views

0 Votes 3 Answers 989 Views

Hi, Can I Default The Clearml Fileserver To A S3 Path?

Hi, can I default the ClearML fileserver to a s3 path?

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, Can I Ask How I Can Make Clearml-Datasets In Comparison With Pytorch Datasets/Dataloader? In Particular, Pytorch Dataloaders Would Be Able To Batch Pull And Then Preprocess Data Using Multi-Cpus, Feed It Into The Training Loop And Achieve As High Util

Hi, can i ask how i can make Clearml-Datasets in comparison with PyTorch datasets/dataloader? In particular, pytorch dataloaders would be able to batch pull ...

pytorch

2 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hi, I Am Trying To Understand Clearml-Data And Only Found This Piece Of Article Explaining It.

Hi, i am trying to understand clearml-data and only found this piece of article explaining it. https://github.com/allegroai/clearml/blob/master/docs/datasets...

dataset

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi, Is It Possible To Have A Compatibility Matrix Between Clearml Sdk, Clearml Agent, Clearml Server...Etc? Recently We Updated The Clearml Sdk Without Updating The Rest And Ends Up Unable To Push Any Tasks Via K8S Glue.

Hi, is it possible to have a compatibility matrix between clearml sdk, ClearML agent, ClearML server...etc? Recently we updated the ClearML sdk without updat...

clearml

one year ago

0 Votes

22 Answers

1K Views

0 Votes 22 Answers 1K Views

Hi, Clearml Console Leaks Credentials Passed In As Env Vars. The Issue Remains With Clearml Version==1.1.1.135 - 1.1.1 - 2.1.4 (As Listed On The Profile Page) I Am Using K8S Glue And The Clearml.Conf Has The Following In The Agent Section.

Hi, ClearML console leaks credentials passed in as Env Vars. The issue remains with clearml version==1.1.1.135 - 1.1.1 - 2.1.4 (As listed on the profile page...

mlops

3 years ago

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

Hi, We Noted That Using K8S Glue, There Are Some Situations Where The Task Cannot Be Registered As Error And Will Be Stuck At Pending. An Example Of One Situation Is When The Task Is Pulling A Docker Image That Doesn'T Exist. Is There A Way To Catch Such

Hi, we noted that using K8S glue, there are some situations where the task cannot be registered as error and will be stuck at pending. An example of one situ...

clearml

3 years ago

0 Votes

1 Answers

997 Views

0 Votes 1 Answers 997 Views

Hi, What Would Happen If You Have Different Clearml-Agents Of Different Versions Running? Would It Have Any Adverse Effects?

Hi, what would happen if you have different clearml-agents of different versions running? Would it have any adverse effects?

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, We Are Running On Disconnected On Prem With A K8S Glue. When A Pod Is Spawned, We Noted That An Apt-Get Command Is Performed On The Pod. Short Of Changing The Content Of /Etc/Apt/Sources.List In The Images, Is There A Way For Clearml Agent To Override

Hi, we are running on disconnected on prem with a k8s glue. When a pod is spawned, we noted that an apt-get command is performed on the pod. SHort of changin...

mlops

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

I'M Getting This When Running With Keras Framework. Clearml.Storage - Error - Failed Uploading: [Errno 21] Is A Directory: 'Model.Savedmodel'.

I'm getting this when running with Keras framework. clearml.storage - ERROR - Failed uploading: [Errno 21] Is a directory: 'model.savedmodel'.

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Would Like To Check. So An Agent Pulled A Docker Image And Install The Pip Dependencies On It. What If I Have Os Library Dependencies As Well? (Apt Install, Rpm Install...Etc).

Hi, would like to check. So an agent pulled a docker image and install the pip dependencies on it. What if I have OS library dependencies as well? (Apt insta...

mlops

3 years ago

0 Votes

0 Answers

610 Views

0 Votes 0 Answers 610 Views

Hi, Is There A Way To Export Clearml Experiments Into A File Package And Import Them On Another Clearml Instance?

Hi, is there a way to export ClearML experiments into a file package and import them on another ClearML instance?

clearml

10 months ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

Hi, If I'Ve Clearml Agents Installed On Several Servers, Each With A Single Gpu. How Can I Train A Gpt2 Model That Would Require Multiple Gpus?

Hi, if i've ClearML agents installed on several servers, each with a single GPU. How can I train a gpt2 model that would require multiple GPUs?

clearml

one year ago

0 Votes

1 Answers

869 Views

0 Votes 1 Answers 869 Views

Hi, Is It Possible For Clearml To Support Languages Other Than Python?

Hi, is it possible for ClearML to support languages other than Python?

clearml

one year ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hi, I Notice A New Behavuour With Clearml-Agent=1.1.0. When It Is Installing The Packages I Nrequirements.Txt, It Failed With.

Hi, i notice a new behavuour with clearml-agent=1.1.0. When it is installing the packages i nrequirements.txt, it failed with. clearml_agent: ERROR: HTTPSCOn...

clearml

3 years ago

0 Votes

2 Answers

910 Views

0 Votes 2 Answers 910 Views

Hi, Is There A Way To Use Api To Return The Urls Of All The Models In The Clearml Repository?

Hi, is there a way to use API to return the URLs of all the models in the ClearML repository?

clearml

one year ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, I'M Running The Following And Encountering Some Ssl Errors.

Hi, i'm running the following and encountering some SSL errors. SSL_CERT_FILE=ca.crt clearml-data upload --id 12314jhg42342j4j --storage clearml.storage - ER...

aws dataset

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Sorry, I'M Asking Too Much Questions Today. I Gave Myself A Whole Day To Fully Evaluate Clearml...That'S Why. Here Goes. Regarding Automatic Logging (Automagikal), I Took Your Example (

Sorry, I'm asking too much questions today. I gave myself a whole day to fully evaluate ClearML...that's why. Here goes. Regarding Automatic Logging (Automag...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, Is There A Way I Can Supply Credentials To Clearml-Data (Cli And Python) Without Going Thru The Clearml.Conf?

Hi, is there a way I can supply credentials to clearml-data (cli and python) without going thru the ClearML.conf?

dataset

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Thought I Would Share This. Something To Think About Over The New Year.

Thought i would share this. Something to think about over the new year. 🙂 https://www.thoughtworks.com/content/dam/thoughtworks/documents/whitepaper/tw_whit...

clearml

2 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, I;M Running Clearml Jobs Using K8Sglue. When The Job Is Running, The Scalar For Monitor:Machine Seems To Be Reporting Node Statistics Instead Of The Pod Statistics. Can This Behavior Be Changed So Its Reporting For The Pod Instead??

Hi, i;m running ClearML jobs using K8SGlue. When the job is running, the scalar for monitor:machine seems to be reporting Node statistics instead of the Pod ...

clearml

one year ago

0 Votes

11 Answers

977 Views

0 Votes 11 Answers 977 Views

Hi, We Have Recurring Disk Space Issues On Our Clearml Server (Drop Of Many Gb In A Few Days). After Some Analysis, We Noted

Hi, we have recurring disk space issues on our ClearML server (Drop of many GB in a few days). After some analysis, we noted /opt/clearml/data/elastic_7 to b...

clearml

2 years ago

0 Votes

2 Answers

930 Views

0 Votes 2 Answers 930 Views

Hi, After Reviewing Clearml For Months, I Am Considering Pushing Clearml To Be The Primary Mlops Tool Within My Dept. I Would Be Looking At Monitoring, Production Labelling And Automated Retraining In A Pipeline As Well. Just Wondering If Anyone Had Luck

Hi, after reviewing ClearML for months, i am considering pushing ClearML to be the primary MLOps tool within my dept. I would be looking at monitoring, produ...

clearml

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi, I Have A Scenario Where When The Code Is Run Remotely Via Clearml-Agent, The Code Appears To Get Stuck At

Hi, i have a scenario where when the code is run remotely via clearml-agent, the code appears to get stuck at task.init() . It seems to only affect the attac...

kubernetes

one year ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi I Upgraded Some Of My K8S Glue To Use The Latest Clearml-Agent 1.1.0 And Receive This Error When The Agent Pulls A Task. Pulling Task Xxxbbbxxxbb Launching On Kubernetes Cluster Pushing Task Xxxbbbxxxbb Into Temporary Pending Queue Kubernetes Scheduli

Hi I upgraded some of my k8s glue to use the latest clearml-agent 1.1.0 and receive this error when the agent pulls a task. Pulling task xxxbbbxxxbb launchin...

kubernetes mlops

3 years ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

Hi, I'M Using The K8S Glue And Have A Few Questions.

Hi, I'm using the k8s glue and have a few questions. Noted that it's not requesting the http://nvidia.com/gpu thus the pod created doesn't have a GPU resourc...

clearml

3 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Hi, i have been getting the following for a while. Is there a more detailed log i can look into? This happens on both https and http. 2021-05-27 08:47:02,539...

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, How Can I Make A Stage In A Clearml Pipeline Non-Blocking? The Scenario Is That Stages Downstream Needed Runtime Info From The First Stage, However The First Stage Needs To Continue Running To Act As A Monitor For The Other Downstream Stages.

Hi, how can i make a stage in a clearml pipeline non-blocking? The scenario is that stages downstream needed runtime info from the first stage, however the f...

clearml

one year ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

Hi, I'Ve A Few Questions On Clearml-Session.

Hi, I've a few questions on clearml-session. We will be running some GUI applications so is it possible to forward the GUI to the clearml-session? We have a ...

mlops

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, The `

Hi, the https://github.com/allegroai/trains/blob/master/examples/services/jupyter-service/execute_jupyter_notebook_server.py file linked by following page is...

clearml

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi, I Am Working On Creating Retraining Pipelines In Production. The Way I'M Doing This Is To Install Clearml-Server On My Production. Then I Recreate The Ingestion, Preprocessing And Training/Opt Tasks Into A Clearml-Pipeline. Thereafter, I Would Call

Hi, i am working on creating retraining pipelines in production. The way i'm doing this is to install clearml-server on my production. Then i recreate the in...

clearml

2 years ago

Show more results

0 I'M Getting This When Running With Keras Framework. Clearml.Storage - Error - Failed Uploading: [Errno 21] Is A Directory: 'Model.Savedmodel'.

thanks. That seems to work. I got a question, does it save the best model or the model in the last epoch?

3 years ago

0 Prev, I Worked With Clearml (1 Year Back) And Back Then, We Config Seldon Core For The Deployment And Clearml For The Training.. Now There Is Clearml-Serving, Does It And Can It Fulfill A Similar Objective ?

Hi, i'm gonna hijack this thread a bit. My community uses ClearML and is looking at various model deployment strategies. We are looking at a seamless integration with Triton but noted they Triton does not support deployment strategies. ClearML-Serving seems to but the strategies are rather limited. Is there a roadmap to expand Clearml-serving?

2 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Do you have more info on vault?
Actually it only make sense if the entire department or organisation are saving their models in a common repo. In our case this is not possible due to client security (e.g. training data from clients can potentially be 'reverse engineered' from trained models in future). So each department and even projects will need their own repo.

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Hi, we are still not getting the model repo to work, mainly due to clearml.storage failing to save the models.
We tried a vanilla boto3 code and it works, but we can't figure out why we get connectionreseterror 104 when clearml does it.

How do we configure clearml in correspondence to following boto code?

S3= boto3.resource('s3', endpoint_url=' https://ecs.ai ', aws_access_key_id='mykey', aws_secret_access_key='mysevret', config=Config(signature_version='s3v4'), region_name='us-east-1', ve...

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Ok thanks. that explains alot. We have been doing this wrongly the whole time, thinking that the clearml.conf on the client side would be acknowledged by the remote agent execution. In reality, only the API section is utilised.

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

i see. Can i take it that when the client uses
task.execute_remotely(queue_name="1gpu", exit_process=True)then none of the content in its clearml.conf will be used, except for the API part. And Clearml simply uses whatever is on the Agent side.
api { # Notice: 'host' is the api server (default port 8008), not the web server. api_server: web_server: files_server: # Credentials are generated using the webapp, `
# Override with os environment: ...

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Going back to the open source, I think that adding the credentials as part of the source code might allow to have "credentials" auto populate as part of the remote execution, wdyt?

Not sure how this will work when i can't supply the credentials to ClearML programatically.

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

yes its on purpose, each user would have their own AWS credentials for default_output_uri.

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

I thought of another potential way but not sure if the SDK supports it.
We will perform manual save and upload of model using vanilla boto3 and credentials passed in as env var. Use ClearML SDK to update the Model Repo on the location of the model, without ClearML uploading it explicitly.Would the above work?

3 years ago

0 Hi, We Recently Upgraded Clearml To 1.1.1-135 . 1.1.1 . 2.14. The Task Init Is

I didn't track the version on this change in behaviour. But last I tried it was able to download the content after I provide the credentials.

3 years ago

0 Hi, We Recently Upgraded Clearml To 1.1.1-135 . 1.1.1 . 2.14. The Task Init Is

Hi,
I'm running on Dell ECS storage appliance, which offers S3 compatibility.
yes http://ECS.ai is the DNS name of the server.
ClearML-models is the bucket.
Let me try with ip:port.

3 years ago

0 Hi, My Devsecops Team Has Raised Some Issues Of Us Deploying Clearml For Use. In Particular, They Are Not Happy With Docker.Sock Configuration As It Would Potentially Expose The Entire Cluster To Unauthorised View. Can We Do Without It?

Thanks, its attached.
I also noted that the status on the ClearML is always in 'pending', unlike others which says 'Running'. Is this a side effect of using k8s glue?

3 years ago

Hi thanks. How about Agent, does its docker mode or k8s mode require docker.sock to be exposed?

3 years ago

Hi, please correct me if i am wrong, to use the glue, i need the following.
A k8s cluster A kubectl that is connected to the k8s cluster A pip install of clearml-agent 0.17.1
So i did all the above, I'm not what it meant by running the entire thing on own machine.

3 years ago

Unfortunately it's not. The problem previously encountered with the docker method surfaced again. In this case, the BASE DOCKER IMAGE
nvidia/cuda:10.1-runtime-ubuntu18.04 --env GIT_SSL_NO_VERIFY=true is not taking effect with the k8s glue.

3 years ago

Is this fix coming soon?

3 years ago

Thanks 👍 . Should i create an issue on Github?

3 years ago

Sorry i forgot to paste the logs.

3 years ago

0 Hi, The `

ah thanks. Hopefully the old ones get flushed out by Google soon.

3 years ago

It has always been there.

3 years ago

0 Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

I meant the dataset id.

3 years ago

0 Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

Hi, it make sense to automate this part just like how you automate the rest of the MLOps flow, especially when you already support Data Versioning/Lineage, Data Provenance (How it works with the experiment and as a model source) should be in too. Although i agree technically it's probably not possible to tell if the users actually used the indicated datasets after they do a datasets.get_copy() .

3 years ago

0 Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

AgitatedDove14 , i'm Jax, not Manoj! lol. 😅 😅

3 years ago

0 Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

Sorry AgitatedDove14 can you bump me to that thread?

3 years ago

0 Hi, Several Changes Occurred Recently And I Would Like To Know If There'S A Way To Verbose Catch All The Printout That Happening Within A K8S Glue Spawned Pod. We Have An Issue Where All Of Our New Remote_Execution Tasks Are Stuck In The 'Pending' Stage.

does the bash script need clearml-agent to be able to communicate to the https clearml-server first? If yes, there's a chicken/egg problem here.

3 years ago

Sorry, in case i misunderstood you. Are you refering to the extra_docker_shell_script .

3 years ago

Some breakthrough. The problem is because we switched the web, api and files server to use https (ssl) endpoint instead. I had switched back to http end points to test this theory.

Although its not printing the error, i suspect its not able to connect due to lack of the self signed cert. Previously this wasn't an issue, not sure what changed in clearml_agent=1.1.0.

There's a secondary issue resulting, i will put this on a new thread.

3 years ago

Its running as a long running POD on K8S. I'm using log -f to track its stdout.

3 years ago

Is there a way for k8s glue to pass on self signed cert information to the agent pods?

3 years ago

Ok i get the logic now. extra_docker_shell_script executes before clearml-agent talks to clearml server.

3 years ago

Show more results