SubstantialElk6

117 Questions, 310 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

282 × Eureka!

Questions 117
Answers 310

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hi, I Notice A New Behavuour With Clearml-Agent=1.1.0. When It Is Installing The Packages I Nrequirements.Txt, It Failed With.

Hi, i notice a new behavuour with clearml-agent=1.1.0. When it is installing the packages i nrequirements.txt, it failed with. clearml_agent: ERROR: HTTPSCOn...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, I'M Wondering If Clearml Did A Comparison Of Their Clearml Pipelines With Other Solutions Such As Apache Beam? Or If Clearml Supports Integration With Such Third Party Solutions?

Hi, i'm wondering if ClearML did a comparison of their ClearML Pipelines with other solutions such as Apache Beam? Or if ClearML supports integration with su...

clearml

2 years ago

0 Votes

11 Answers

1K Views

0 Votes 11 Answers 1K Views

Hi, Can I Do A Quick Check If All The Documentation I Find On Trains Are Still Valid For Clearml? Specifically, I Am Looking At Integration Of Clearml And Kubernetes.

Hi, can i do a quick check if all the documentation I find on TRAINS are still valid for ClearML? Specifically, i am looking at integration of ClearML and Ku...

clearml

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Sorry, I'M Asking Too Much Questions Today. I Gave Myself A Whole Day To Fully Evaluate Clearml...That'S Why. Here Goes. Regarding Automatic Logging (Automagikal), I Took Your Example (

Sorry, I'm asking too much questions today. I gave myself a whole day to fully evaluate ClearML...that's why. Here goes. Regarding Automatic Logging (Automag...

clearml

3 years ago

0 Votes

2 Answers

943 Views

0 Votes 2 Answers 943 Views

Hi, We Have Been Using Clearml In Our Development Environment To Train Our Models And Benchmarking Them. I Was Wondering What Is Clearml'S Role In Transition To (Production. Two Specific Points, Deployment, And Automated Retraining Pipeline.

Hi, we have been using ClearML in our development environment to train our models and benchmarking them. I was wondering what is ClearML's role in transition...

clearml

3 years ago

0 Votes

0 Answers

948 Views

0 Votes 0 Answers 948 Views

Current Configuration (Clearml_Agent V0.17.2Rc4, Location: /Root/Clearml.Conf): ---------------------- Agent.Worker_Id = Dgxstation-2:Gpu3 Agent.Worker_Name = Dgxstation-2 Agent.Force_Git_Ssh_Protocol = False Agent.Python_Binary = Agent.Package_Manager.T

Current configuration (clearml_agent v0.17.2rc4, location: /root/clearml.conf): ---------------------- agent.worker_id = dgxstation-2:gpu3 agent.worker_name ...

pytorch

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

[Security] Hi, One Of Our Teams Noted That Previews Of Clearml-Data Datasets Are Saved In The Files_Server (Indicated In Clearml.Conf) Instead Of The Indicated Output_Uri In The Dataset.Create Argument. This Results In A Security Breach. May I Ask If This

[Security] Hi, one of our teams noted that previews of clearml-data datasets are saved in the files_server (indicated in ClearML.conf) instead of the indicat...

dataset

one year ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Would Like To Check. So An Agent Pulled A Docker Image And Install The Pip Dependencies On It. What If I Have Os Library Dependencies As Well? (Apt Install, Rpm Install...Etc).

Hi, would like to check. So an agent pulled a docker image and install the pip dependencies on it. What if I have OS library dependencies as well? (Apt insta...

mlops

3 years ago

0 Votes

3 Answers

985 Views

0 Votes 3 Answers 985 Views

Hi, Can I Get Clearml To Not Print Anything Other Than The Prints From My Codes? The Reason Is Because Clearml Is Printing The Username And Passwords I Passed To The Container Via Env Vars.

Hi, can i get ClearML to not print anything other than the prints from my codes? The reason is because clearml is printing the username and passwords i passe...

clearml

3 years ago

0 Votes

20 Answers

1K Views

0 Votes 20 Answers 1K Views

Hi, We Are Having An Interesting Issue Here. We Serve Many Users And Each User Has Their Own Credentials In Accessing The Private Git Repo. We Can'T Seem To Find A Way For The End User To Pass In Their Git Credentials When They Run Their Codes In Both Age

Hi, we are having an interesting issue here. We serve many users and each user has their own credentials in accessing the private git repo. We can't seem to ...

mlops

3 years ago

0 Votes

2 Answers

935 Views

0 Votes 2 Answers 935 Views

Hi, After Reviewing Clearml For Months, I Am Considering Pushing Clearml To Be The Primary Mlops Tool Within My Dept. I Would Be Looking At Monitoring, Production Labelling And Automated Retraining In A Pipeline As Well. Just Wondering If Anyone Had Luck

Hi, after reviewing ClearML for months, i am considering pushing ClearML to be the primary MLOps tool within my dept. I would be looking at monitoring, produ...

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, Can I Choose Not Print The Clearml-Agent Config Logs In The Console? Reason Is We Are Passing Credentials Via Env Var To The K8S Glue And Its Being Displayed In The Console As ...

Hi, can i choose not print the clearml-agent config logs in the console? Reason is we are passing credentials via env var to the k8s glue and its being displ...

clearml

3 years ago

0 Votes

10 Answers

1K Views

0 Votes 10 Answers 1K Views

Hi, We Noted That Using K8S Glue, There Are Some Situations Where The Task Cannot Be Registered As Error And Will Be Stuck At Pending. An Example Of One Situation Is When The Task Is Pulling A Docker Image That Doesn'T Exist. Is There A Way To Catch Such

Hi, we noted that using K8S glue, there are some situations where the task cannot be registered as error and will be stuck at pending. An example of one situ...

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, I Had A Task Successfully Completed. Then I Cloned It And Enqueued It Again Without Any Changes. But The Task Ends Up With An Error. Here'S The Logs, Not Sure What Went Wrong.

Hi, i had a task successfully completed. Then i cloned it and enqueued it again without any changes. But the task ends up with an error. Here's the logs, not...

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, I Was Running My Agent And Had A Few Scripts For Agent.Extra_Docker_Shell_Script. But When I Looked Through The Logs, They Were Not Executed. Any Idea Why? Using Agent V1.01R1 In K8S Glue.

Hi, i was running my agent and had a few scripts for agent.extra_docker_shell_script. but when I looked through the logs, they were not executed. Any idea wh...

mlops

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, I Would Like To Ask Around If Anyone Has Following Languages Working With Clearml? It Can Be Direct From Clearml Sdk Or Via Any Indirect Method.

Hi, i would like to ask around if anyone has following languages working with ClearML? It can be direct from ClearML SDK or via any indirect method. Julia R ...

clearml

3 years ago

0 Votes

23 Answers

1K Views

0 Votes 23 Answers 1K Views

Hi I Saw This On The Clearml-Agent Docs But Other Than The Docker Image, I'M Not Sure How To Integrate This With Clearml Py And Clearml-Server. Please Advise.

Hi i saw this on the clearml-agent docs but other than the docker image, i'm not sure how to integrate this with clearml py and clearml-server. Please advise...

clearml

3 years ago

0 Votes

22 Answers

1K Views

0 Votes 22 Answers 1K Views

Hi, I Would Like To Pass In Some Pip Arguments That Clearml-Agent Would Include When Setting Up The Venv On The Containers. How Should I Specify This? The Argument In Question Are --Trusted-Host And --Find-Links . I Need Them As I'Ve Installed A Pypi Repo

Hi, I would like to pass in some pip arguments that clearml-agent would include when setting up the venv on the containers. How should I specify this? The ar...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, I Noted That If I Run My Codes On My Laptop With Remote_Execute Off A Python3.8 Venv, And When The Remote Task Starts Executing But The Image Is Installed With A Different Version Of Python, Say Python3.8, We Would Encounter Errors With Venv. At This

Hi, i noted that if I run my codes on my laptop with remote_execute off a python3.8 venv, and when the remote task starts executing but the image is installe...

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

[Distributed Training] Hi, I Have A Clearml Setup With K8Sglue That Spins Up Pods Of 4 Gpus When Picking Tasks Off The Clearml Queue. We Would Now Want To Proceed With Multi-Node Training, And Some Of The Examples We Are Trying Are Here.

[Distributed Training] Hi, i have a ClearML setup with K8SGlue that spins up pods of 4 GPUs when picking tasks off the clearml queue. We would now want to pr...

clearml

one year ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Would You Have A Working Example On This?

Hi, would you have a working example on this?

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, In Your Latest Changelog. There'S A New Function.

Hi, in your latest changelog. There's a new function. Task.launch_multi_node() for distributed experiment execution In the context of using with K8S glue, wi...

clearml

one year ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Is There A Pdf Version Of Your Documentation At

Hi, is there a pdf version of your documentation at https://clear.ml/docs/latest/docs ? We work mostly in an offline environment and would benefit from this ...

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

So I'Ve Install Allegro On Kubernetes Using Helm, How To I Perform

So i've install allegro on Kubernetes using helm, how to i perform trains-init ?

clearml

4 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Hi, I Would Like To Understand More On How Clearml Deal With Codes.

Hi, i would like to understand more on how ClearML deal with codes. I noticed that i am able to read the source codes of the python script that i have used a...

clearml

3 years ago

0 Votes

8 Answers

997 Views

0 Votes 8 Answers 997 Views

Hi, Is There Any Code Examples Of How Dataops Is Being Established?

Hi, is there any code examples of how DataOps is being established? https://clear.ml/products/clearml-dataops/

clearml

3 years ago

0 Votes

1 Answers

872 Views

0 Votes 1 Answers 872 Views

Hi, Is It Possible For Clearml To Support Languages Other Than Python?

Hi, is it possible for ClearML to support languages other than Python?

clearml

one year ago

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

Hi, Just To Check. Does The K8S Glue Install Torch By Default? I'M Getting

Hi, just to check. Does the k8s glue install torch by default? I'm getting Warning: could not resolve python wheel replacement for torch==1.8.0 even though i...

tensorflow

3 years ago

0 Votes

3 Answers

987 Views

0 Votes 3 Answers 987 Views

Hi, I Have A Docker Image That Needs To Be Run In Privileged Mode. How Should I Do The Following?

Hi, i have a docker image that needs to be run in privileged mode. How should i do the following? clearml-session: Pass the --privileged option along --docker ?

clearml

3 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hi, I'M Working On A Post Deployment Data And Model Monitoring Using Clearml. The Idea Is This.

Hi, i'm working on a post deployment data and model monitoring using ClearML. The idea is this. Use ClearML to serve my model out to Triton. Data MonitoringC...

clearml

3 years ago

Show more results

0 Hi

Hi i missed this. Is there a recording?

3 years ago

0 Hi, We Recently Upgraded Clearml To 1.1.1-135 . 1.1.1 . 2.14. The Task Init Is

Hi TimelyPenguin76 ,

If you notice in the last screenshot, it state the bucket name to be http://ecs.ai . It then it tries to open http://s3.amazonaws.com/ecs.ai/clearml-models/artifact/uploading_file?X-Amz-Algorithm= ....

3 years ago

0 Hi, My Devsecops Team Has Raised Some Issues Of Us Deploying Clearml For Use. In Particular, They Are Not Happy With Docker.Sock Configuration As It Would Potentially Expose The Entire Cluster To Unauthorised View. Can We Do Without It?

Hi thanks. How about Agent, does its docker mode or k8s mode require docker.sock to be exposed?

3 years ago

0 Hi, We Are Having An Interesting Issue Here. We Serve Many Users And Each User Has Their Own Credentials In Accessing The Private Git Repo. We Can'T Seem To Find A Way For The End User To Pass In Their Git Credentials When They Run Their Codes In Both Age

Hi, just wondering if this 'feature: Passing env via the code' is in the works?
https://clearml.slack.com/archives/CTK20V944/p1616677400127900?thread_ts=1616585832.098200&cid=CTK20V944

3 years ago

0 Hi, Can I Default A Docker Image When Running A Pipeline? I Currently Set It As

Hi CostlyOstrich36 , What you described is task. I was referring to the pipeline controller.

2 years ago

0 Hi, I Had A Task Successfully Completed. Then I Cloned It And Enqueued It Again Without Any Changes. But The Task Ends Up With An Error. Here'S The Logs, Not Sure What Went Wrong.

Hi AgitatedDove14 , i dug a bitt deeper. I saw this in installed packages in the original completed task. When the task is cloned, this is copied over and thus the problem. Can i ask, how ClearML create the list of installed packages? Why is it that some of them (E.g. attr is being pulled from @ file:///tmp/build/80754af9/attrs_1604765588209/work)

` absl-py==0.11.0
alabaster==0.7.12
antlr4-python3-runtime==4.8
apex==0.1
appdirs==1.4.4
argon2-cffi==20.1.0
ascii-graph==1.5.1
async-gener...

3 years ago

0 Hi, We Are Having Issues With Clearml-Session For Vscode. Apparently It'S Hardcoded To Download From

Yeah that sounds good. But from user perspective, especially the untrained, they wouldn't know what to point to. Example, some may think it's an exe, some think it's a zip bundle, and others think it's any github repo with the word vscode.

3 years ago

0 Hi, We Are Having Issues With Clearml-Session For Vscode. Apparently It'S Hardcoded To Download From

Hi, building a container with vscode is not possible. If i have an alternative location for the vscode, where should i indicate in the configuration?

3 years ago

0 Hi, We Are Having Issues With Clearml-Session For Vscode. Apparently It'S Hardcoded To Download From

where should i indicate in the configuration?

Any idea?

3 years ago

0 Hi, We Are Having Issues With Clearml-Session For Vscode. Apparently It'S Hardcoded To Download From

The agent is running on a disconnected server on docker mode. I have a client that runs clearml-session and i saw from the agent's logs that the installation of vscode fails.

3 years ago

0 Hi, Clearml Console Leaks Credentials Passed In As Env Vars. The Issue Remains With Clearml Version==1.1.1.135 - 1.1.1 - 2.1.4 (As Listed On The Profile Page) I Am Using K8S Glue And The Clearml.Conf Has The Following In The Agent Section.

I also see this on my logs, noting that the config is read in but its still printing the supposedly hidden keys on the logs and UI.
agent.hide_docker_command_env_vars.enabled = true agent.hide_docker_command_env_vars.extra_keys.0='TRAINS_AGENT_GIT_USER' ..... docker_cmd=harbor.ai/public/detectron2:v3 --env TRAINS_AGENT_GIT_USER=gituser

3 years ago

0 Hi, I Was Adding Data Using Clearml-Data And Get The Following Consistent Errors.

I would say its intermittent.

2 years ago

0 Hi, If I'Ve Clearml Agents Installed On Several Servers, Each With A Single Gpu. How Can I Train A Gpt2 Model That Would Require Multiple Gpus?

Thanks. The challenge we encountered is that we only expose our Devs to the ClearML queues, so users have no idea what's beyond the queue except that it will offer them the resources associated with the queue. In the backend, each queue is associated with more than one host.

So what we tried is as followed.
We create a train.py script much like what Tobias shared above. In this script, we use the socket library to pull the ipaddr.

import socket
hostname=socket.gethostname()
ipaddr=dock...

one year ago

Alright i will try that.

3 years ago

Unfortunately it's not. The problem previously encountered with the docker method surfaced again. In this case, the BASE DOCKER IMAGE
nvidia/cuda:10.1-runtime-ubuntu18.04 --env GIT_SSL_NO_VERIFY=true is not taking effect with the k8s glue.

3 years ago

ok, i'll wait till i get my hands on vault then. thanks.

3 years ago

0 Hi, I Have A Future Roadmap Question On Clearml-Datasets. The Current Implementation Works Well For Small Datasets But Its Rather In Effective For Very Large Datasets. For Example, Let'S Say I Have 10 Million Images Just For The Training Dataset, And My T

This one can be solved with shared cache + pipeline step, refreshing the cache in the shared cache machine.

Would you have an example of this in your code blogs to demonstrate this utilisation?

3 years ago

0 Hi Recently Upgraded All The Clearml, Clearml-Server, Clearml-Agent. Now Running K8S Glue With Clearml-Agent=1.0.1Rc1.

Ok thanks, looking forward to it. Would you advise on the bug you encountered?

3 years ago

0 Hi Recently Upgraded All The Clearml, Clearml-Server, Clearml-Agent. Now Running K8S Glue With Clearml-Agent=1.0.1Rc1.

Hi SuccessfulKoala55 , thanks, tested the patch and its working as expected now.

3 years ago

0 Hi, There'S Something I Don'T Find Too Logical When Using Clearml And Its Agents. I Will Need To Run My Code Once On My Client Computer This Is Without Gpus. And Then I Will Need To Run It Via The Ui On Clearml Server That Has Gpus. Why Can'T I Configure

Hi, it make sense if i only had to change hyperparameters, but it's not so when i am still changing the model architecture (training code) and train and repeat.

3 years ago

0 Hi, I Would Like To Pass In Some Pip Arguments That Clearml-Agent Would Include When Setting Up The Venv On The Containers. How Should I Specify This? The Argument In Question Are --Trusted-Host And --Find-Links . I Need Them As I'Ve Installed A Pypi Repo

So these (PIP_INDEX_URL) weren't used when clearml starts running pip.

3 years ago

I did another test by running
kubectl exec pod-name -- echo $PIP_INDEX_URL and it returned nothing. So the env are not passed to the container at all.

3 years ago

0 When Using The Clearml Agent, Is It Possible To Let The Agent Decide If It Should Schedule A Pod On A Cpu Or Gpu? Using The K8S Glue It Would Have To Have:

Then you pass the tolerations definition through a different pod template?

Yup.

3 years ago

0 Hi, I Was Trying Out The Steps On This (

Hi yes, still getting the SSLs. It looks like some incompatibility with the OS ssl libraries.

3 years ago

0 Hi, I'Ve Multiple Tasks Setup In A Complex Pipeline. How Can I;

What type of pipeline steps are you running? From task, decorator or function?

We were trying with 'from task' at the moment. But the question apply to all methods.

If they're all running on the same container why not make them the same task and do things in parallel?

The tasks were created by different teams and their tasks content is rather independent and modular. Usage of them is usually optional. For example, task1 performs 'image whitening', task2 performs 'image resize'.

2 years ago

0 Hi, Is There A Command I Can Use To Generate A Report That Can

Yes I am.

3 years ago

0 Hi, Is There A Command I Can Use To Generate A Report That Can

No issues. I know its hard to track open threads with Slack. I wish there's a plugin for this too. 🙂

3 years ago

0 Hi, Is There A Command I Can Use To Generate A Report That Can

Can i dig into the mongodb or ES to pull these data?

3 years ago

0 Hi, Is There A Command I Can Use To Generate A Report That Can

Any idea where i can find the relevant API calls for this?