SubstantialElk6

117 Questions, 310 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

282 × Eureka!

Questions 117
Answers 310

0 Votes

8 Answers

981 Views

0 Votes 8 Answers 981 Views

I Just Getting This In My Agent Run Task. Would Appreciate If Someone Can Advise Where I Externalrequirement Is Pointing At.

I just getting this in my agent run task. Would appreciate if someone can advise where i externalrequirement is pointing at. RequirementsManager handler rais...

mlops

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Just Wondering, Why Aren'T You Guys Getting Yourselves Known In Gtc?

Just wondering, why aren't you guys getting yourselves known in GTC?

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, Is There A Way I Can Supply Credentials To Clearml-Data (Cli And Python) Without Going Thru The Clearml.Conf?

Hi, is there a way I can supply credentials to clearml-data (cli and python) without going thru the ClearML.conf?

dataset

3 years ago

0 Votes

3 Answers

997 Views

0 Votes 3 Answers 997 Views

Hi, I Got This Screenshot From Our Workers Tab And Would Like To Clarify Some Understanding. I Noticed That The

Hi, i got this screenshot from our workers tab and would like to clarify some understanding. I noticed that the Total number of workers and Number of active ...

clearml

2 years ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

Hi, I'M Using The K8S Glue And Have A Few Questions.

Hi, I'm using the k8s glue and have a few questions. Noted that it's not requesting the http://nvidia.com/gpu thus the pod created doesn't have a GPU resourc...

clearml

3 years ago

0 Votes

17 Answers

1K Views

0 Votes 17 Answers 1K Views

Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

Hi I'm using clearml datasets. How do I tell from the ClearML UI which datasets version am I using?

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

[Security] Hi, One Of Our Teams Noted That Previews Of Clearml-Data Datasets Are Saved In The Files_Server (Indicated In Clearml.Conf) Instead Of The Indicated Output_Uri In The Dataset.Create Argument. This Results In A Security Breach. May I Ask If This

[Security] Hi, one of our teams noted that previews of clearml-data datasets are saved in the files_server (indicated in ClearML.conf) instead of the indicat...

dataset

one year ago

0 Votes

1 Answers

992 Views

0 Votes 1 Answers 992 Views

Hi, I'Ve Three Questions Regarding Clearml Pipelines.

Hi, I've three questions regarding clearml pipelines. - can I check when we use a clearml pipeline and data get transferred from stage to stage, do the data ...

clearml

one year ago

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

Hi, We Are Having Issues With Clearml-Session For Vscode. Apparently It'S Hardcoded To Download From

Hi, we are having issues with clearml-session for vscode. Apparently it's hardcoded to download from https://github.com/microsoft/vscode-python/releases but ...

remote-ssh

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Would You Have A Working Example On This?

Hi, would you have a working example on this?

clearml

3 years ago

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

Hi, V1 Of Agent Seems To Have Removed Agent.Package_Manager.Force_Repo_Requirements_Txt. Is This Still Available In Other Forms?

Hi, v1 of agent seems to have removed agent.package_manager.force_repo_requirements_txt. Is this still available in other forms?

mlops

3 years ago

0 Votes

12 Answers

1K Views

0 Votes 12 Answers 1K Views

Can I Ask How Often Does The Hosted Clearml Reset? I'M In A Hackathon And Thought Of Using It.

Can i ask how often does the hosted clearml reset? I'm in a hackathon and thought of using it.

clearml

3 years ago

0 Votes

20 Answers

1K Views

0 Votes 20 Answers 1K Views

Hi, We Are Having An Interesting Issue Here. We Serve Many Users And Each User Has Their Own Credentials In Accessing The Private Git Repo. We Can'T Seem To Find A Way For The End User To Pass In Their Git Credentials When They Run Their Codes In Both Age

Hi, we are having an interesting issue here. We serve many users and each user has their own credentials in accessing the private git repo. We can't seem to ...

mlops

3 years ago

0 Votes

2 Answers

940 Views

0 Votes 2 Answers 940 Views

Hi, We Have Been Using Clearml In Our Development Environment To Train Our Models And Benchmarking Them. I Was Wondering What Is Clearml'S Role In Transition To (Production. Two Specific Points, Deployment, And Automated Retraining Pipeline.

Hi, we have been using ClearML in our development environment to train our models and benchmarking them. I was wondering what is ClearML's role in transition...

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi I Upgraded Some Of My K8S Glue To Use The Latest Clearml-Agent 1.1.0 And Receive This Error When The Agent Pulls A Task. Pulling Task Xxxbbbxxxbb Launching On Kubernetes Cluster Pushing Task Xxxbbbxxxbb Into Temporary Pending Queue Kubernetes Scheduli

Hi I upgraded some of my k8s glue to use the latest clearml-agent 1.1.0 and receive this error when the agent pulls a task. Pulling task xxxbbbxxxbb launchin...

kubernetes mlops

3 years ago

0 Votes

2 Answers

911 Views

0 Votes 2 Answers 911 Views

Hi, Is There A Way To Use Api To Return The Urls Of All The Models In The Clearml Repository?

Hi, is there a way to use API to return the URLs of all the models in the ClearML repository?

clearml

one year ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, I Was Adding Data Using Clearml-Data And Get The Following Consistent Errors.

Hi, i was adding data using clearml-data and get the following consistent errors. Retrying (Retry(total=237, connect=237, read=240, redirect=240, status=240)...

dataset

2 years ago

0 Votes

3 Answers

982 Views

0 Votes 3 Answers 982 Views

Hi, I Have A Docker Image That Needs To Be Run In Privileged Mode. How Should I Do The Following?

Hi, i have a docker image that needs to be run in privileged mode. How should i do the following? clearml-session: Pass the --privileged option along --docker ?

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, Can I Choose Not Print The Clearml-Agent Config Logs In The Console? Reason Is We Are Passing Credentials Via Env Var To The K8S Glue And Its Being Displayed In The Console As ...

Hi, can i choose not print the clearml-agent config logs in the console? Reason is we are passing credentials via env var to the k8s glue and its being displ...

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, How Do I Switch Clearml Server To Run On Https With A Self Signed Cert?

Hi, how do I switch clearml server to run on https with a self signed cert?

clearml

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi, I Am Working On Creating Retraining Pipelines In Production. The Way I'M Doing This Is To Install Clearml-Server On My Production. Then I Recreate The Ingestion, Preprocessing And Training/Opt Tasks Into A Clearml-Pipeline. Thereafter, I Would Call

Hi, i am working on creating retraining pipelines in production. The way i'm doing this is to install clearml-server on my production. Then i recreate the in...

clearml

2 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

[Distributed Training] Hi, I Have A Clearml Setup With K8Sglue That Spins Up Pods Of 4 Gpus When Picking Tasks Off The Clearml Queue. We Would Now Want To Proceed With Multi-Node Training, And Some Of The Examples We Are Trying Are Here.

[Distributed Training] Hi, i have a ClearML setup with K8SGlue that spins up pods of 4 GPUs when picking tasks off the clearml queue. We would now want to pr...

clearml

one year ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, I'M Running The Following And Encountering Some Ssl Errors.

Hi, i'm running the following and encountering some SSL errors. SSL_CERT_FILE=ca.crt clearml-data upload --id 12314jhg42342j4j --storage clearml.storage - ER...

aws dataset

3 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Hi, i have been getting the following for a while. Is there a more detailed log i can look into? This happens on both https and http. 2021-05-27 08:47:02,539...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, I Noted That If I Run My Codes On My Laptop With Remote_Execute Off A Python3.8 Venv, And When The Remote Task Starts Executing But The Image Is Installed With A Different Version Of Python, Say Python3.8, We Would Encounter Errors With Venv. At This

Hi, i noted that if I run my codes on my laptop with remote_execute off a python3.8 venv, and when the remote task starts executing but the image is installe...

clearml

3 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hi, I Would Like To Start Logging How Often My Users Uses Clearml. How Might I Query This Kind Of Information?

Hi, i would like to start logging how often my users uses ClearML. How might i query this kind of information?

clearml

2 years ago

0 Votes

7 Answers

980 Views

0 Votes 7 Answers 980 Views

Hi, I'M Attempting To Upgrade My Clearml Server On Offline Env. I Wish To Retain All Existing Data. Can I Check If It Suffice To Just Docker-Compose Down --Remove-Orphans Replace Clearml-Server:Latest And Clearml-Agent-Services:Latest With Latest Pull.

Hi, I'm attempting to upgrade my clearml server on offline env. I wish to retain all existing data. Can I check if it suffice to just docker-compose down --r...

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Can You Run Clearml Experiments On Docker Images That Does Not Have Root?

Hi, can you run clearml experiments on docker images that does not have root?

clearml

2 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, The `

Hi, the https://github.com/allegroai/trains/blob/master/examples/services/jupyter-service/execute_jupyter_notebook_server.py file linked by following page is...

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Can Clearml-Server Support Replicaset In K8S?

Hi, can Clearml-Server support ReplicaSet in K8S?

clearml

2 years ago

Show more results

0 I'M Getting This When Running With Keras Framework. Clearml.Storage - Error - Failed Uploading: [Errno 21] Is A Directory: 'Model.Savedmodel'.

thanks. That seems to work. I got a question, does it save the best model or the model in the last epoch?

3 years ago

0 Prev, I Worked With Clearml (1 Year Back) And Back Then, We Config Seldon Core For The Deployment And Clearml For The Training.. Now There Is Clearml-Serving, Does It And Can It Fulfill A Similar Objective ?

Hi, i'm gonna hijack this thread a bit. My community uses ClearML and is looking at various model deployment strategies. We are looking at a seamless integration with Triton but noted they Triton does not support deployment strategies. ClearML-Serving seems to but the strategies are rather limited. Is there a roadmap to expand Clearml-serving?

2 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Do you have more info on vault?
Actually it only make sense if the entire department or organisation are saving their models in a common repo. In our case this is not possible due to client security (e.g. training data from clients can potentially be 'reverse engineered' from trained models in future). So each department and even projects will need their own repo.

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Hi, we are still not getting the model repo to work, mainly due to clearml.storage failing to save the models.
We tried a vanilla boto3 code and it works, but we can't figure out why we get connectionreseterror 104 when clearml does it.

How do we configure clearml in correspondence to following boto code?

S3= boto3.resource('s3', endpoint_url=' https://ecs.ai ', aws_access_key_id='mykey', aws_secret_access_key='mysevret', config=Config(signature_version='s3v4'), region_name='us-east-1', ve...

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Ok thanks. that explains alot. We have been doing this wrongly the whole time, thinking that the clearml.conf on the client side would be acknowledged by the remote agent execution. In reality, only the API section is utilised.

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

i see. Can i take it that when the client uses
task.execute_remotely(queue_name="1gpu", exit_process=True)then none of the content in its clearml.conf will be used, except for the API part. And Clearml simply uses whatever is on the Agent side.
api { # Notice: 'host' is the api server (default port 8008), not the web server. api_server: web_server: files_server: # Credentials are generated using the webapp, `
# Override with os environment: ...

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Going back to the open source, I think that adding the credentials as part of the source code might allow to have "credentials" auto populate as part of the remote execution, wdyt?

Not sure how this will work when i can't supply the credentials to ClearML programatically.

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

yes its on purpose, each user would have their own AWS credentials for default_output_uri.

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

I thought of another potential way but not sure if the SDK supports it.
We will perform manual save and upload of model using vanilla boto3 and credentials passed in as env var. Use ClearML SDK to update the Model Repo on the location of the model, without ClearML uploading it explicitly.Would the above work?

3 years ago

0 Hi, We Recently Upgraded Clearml To 1.1.1-135 . 1.1.1 . 2.14. The Task Init Is

I didn't track the version on this change in behaviour. But last I tried it was able to download the content after I provide the credentials.

3 years ago

0 Hi, We Recently Upgraded Clearml To 1.1.1-135 . 1.1.1 . 2.14. The Task Init Is

Hi,
I'm running on Dell ECS storage appliance, which offers S3 compatibility.
yes http://ECS.ai is the DNS name of the server.
ClearML-models is the bucket.
Let me try with ip:port.

3 years ago

0 Hi, My Devsecops Team Has Raised Some Issues Of Us Deploying Clearml For Use. In Particular, They Are Not Happy With Docker.Sock Configuration As It Would Potentially Expose The Entire Cluster To Unauthorised View. Can We Do Without It?

Thanks, its attached.
I also noted that the status on the ClearML is always in 'pending', unlike others which says 'Running'. Is this a side effect of using k8s glue?

3 years ago

Hi thanks. How about Agent, does its docker mode or k8s mode require docker.sock to be exposed?

3 years ago

Hi, please correct me if i am wrong, to use the glue, i need the following.
A k8s cluster A kubectl that is connected to the k8s cluster A pip install of clearml-agent 0.17.1
So i did all the above, I'm not what it meant by running the entire thing on own machine.

3 years ago

Unfortunately it's not. The problem previously encountered with the docker method surfaced again. In this case, the BASE DOCKER IMAGE
nvidia/cuda:10.1-runtime-ubuntu18.04 --env GIT_SSL_NO_VERIFY=true is not taking effect with the k8s glue.

3 years ago

Is this fix coming soon?

3 years ago

Thanks 👍 . Should i create an issue on Github?

3 years ago

Sorry i forgot to paste the logs.

3 years ago

It has always been there.

3 years ago

0 Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

I meant the dataset id.

3 years ago

0 Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

Hi, it make sense to automate this part just like how you automate the rest of the MLOps flow, especially when you already support Data Versioning/Lineage, Data Provenance (How it works with the experiment and as a model source) should be in too. Although i agree technically it's probably not possible to tell if the users actually used the indicated datasets after they do a datasets.get_copy() .

3 years ago

0 Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

AgitatedDove14 , i'm Jax, not Manoj! lol. 😅 😅

3 years ago

0 Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

Sorry AgitatedDove14 can you bump me to that thread?

3 years ago

0 Hi, Several Changes Occurred Recently And I Would Like To Know If There'S A Way To Verbose Catch All The Printout That Happening Within A K8S Glue Spawned Pod. We Have An Issue Where All Of Our New Remote_Execution Tasks Are Stuck In The 'Pending' Stage.

does the bash script need clearml-agent to be able to communicate to the https clearml-server first? If yes, there's a chicken/egg problem here.

3 years ago

Sorry, in case i misunderstood you. Are you refering to the extra_docker_shell_script .

3 years ago

Some breakthrough. The problem is because we switched the web, api and files server to use https (ssl) endpoint instead. I had switched back to http end points to test this theory.

Although its not printing the error, i suspect its not able to connect due to lack of the self signed cert. Previously this wasn't an issue, not sure what changed in clearml_agent=1.1.0.

There's a secondary issue resulting, i will put this on a new thread.

3 years ago

Its running as a long running POD on K8S. I'm using log -f to track its stdout.

3 years ago

Is there a way for k8s glue to pass on self signed cert information to the agent pods?

3 years ago

Ok i get the logic now. extra_docker_shell_script executes before clearml-agent talks to clearml server.

3 years ago

0 Can I Ask How Often Does The Hosted Clearml Reset? I'M In A Hackathon And Thought Of Using It.

ok thanks.

3 years ago

Show more results