SubstantialElk6

117 Questions, 310 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

282 × Eureka!

Questions 117
Answers 310

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hi, I Would Like To Start Logging How Often My Users Uses Clearml. How Might I Query This Kind Of Information?

Hi, i would like to start logging how often my users uses ClearML. How might i query this kind of information?

clearml

2 years ago

0 Votes

30 Answers

1K Views

0 Votes 30 Answers 1K Views

Hi, I'M Getting This Long Error When Running

Hi, i'm getting this long error when running task.execute_remotely(queue_name="1gpu", exit_process=True) . I also notices an error Failed to fetching activit...

clearml

3 years ago

0 Votes

2 Answers

940 Views

0 Votes 2 Answers 940 Views

Hi, We Have Been Using Clearml In Our Development Environment To Train Our Models And Benchmarking Them. I Was Wondering What Is Clearml'S Role In Transition To (Production. Two Specific Points, Deployment, And Automated Retraining Pipeline.

Hi, we have been using ClearML in our development environment to train our models and benchmarking them. I was wondering what is ClearML's role in transition...

clearml

3 years ago

0 Votes

15 Answers

1K Views

0 Votes 15 Answers 1K Views

Hi, I Noted That Clearml-Serving Does Not Support Spacy Models Out Of The Box And That Clearml-Serving Only Supports Following;

Hi, i noted that clearml-serving does not support Spacy models out of the box and that Clearml-Serving only supports following; Support Machine Learning Mode...

clearml

2 years ago

0 Votes

7 Answers

979 Views

0 Votes 7 Answers 979 Views

Hi, I'M Attempting To Upgrade My Clearml Server On Offline Env. I Wish To Retain All Existing Data. Can I Check If It Suffice To Just Docker-Compose Down --Remove-Orphans Replace Clearml-Server:Latest And Clearml-Agent-Services:Latest With Latest Pull.

Hi, I'm attempting to upgrade my clearml server on offline env. I wish to retain all existing data. Can I check if it suffice to just docker-compose down --r...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, The `

Hi, the https://github.com/allegroai/trains/blob/master/examples/services/jupyter-service/execute_jupyter_notebook_server.py file linked by following page is...

clearml

3 years ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, I'Ve Multiple Tasks Setup In A Complex Pipeline. How Can I;

Hi, I've multiple tasks setup in a complex pipeline. How can I; Define prior to running the pipeline, which tasks to be running on which remote queue using w...

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi I Upgraded Some Of My K8S Glue To Use The Latest Clearml-Agent 1.1.0 And Receive This Error When The Agent Pulls A Task. Pulling Task Xxxbbbxxxbb Launching On Kubernetes Cluster Pushing Task Xxxbbbxxxbb Into Temporary Pending Queue Kubernetes Scheduli

Hi I upgraded some of my k8s glue to use the latest clearml-agent 1.1.0 and receive this error when the agent pulls a task. Pulling task xxxbbbxxxbb launchin...

kubernetes mlops

3 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi, I Am Working On Creating Retraining Pipelines In Production. The Way I'M Doing This Is To Install Clearml-Server On My Production. Then I Recreate The Ingestion, Preprocessing And Training/Opt Tasks Into A Clearml-Pipeline. Thereafter, I Would Call

Hi, i am working on creating retraining pipelines in production. The way i'm doing this is to install clearml-server on my production. Then i recreate the in...

clearml

2 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Can Clearml-Server Support Replicaset In K8S?

Hi, can Clearml-Server support ReplicaSet in K8S?

clearml

2 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi, I Had A Task Successfully Completed. Then I Cloned It And Enqueued It Again Without Any Changes. But The Task Ends Up With An Error. Here'S The Logs, Not Sure What Went Wrong.

Hi, i had a task successfully completed. Then i cloned it and enqueued it again without any changes. But the task ends up with an error. Here's the logs, not...

clearml

3 years ago

0 Votes

2 Answers

911 Views

0 Votes 2 Answers 911 Views

Hi, Is There A Way To Use Api To Return The Urls Of All The Models In The Clearml Repository?

Hi, is there a way to use API to return the URLs of all the models in the ClearML repository?

clearml

one year ago

0 Votes

12 Answers

1K Views

0 Votes 12 Answers 1K Views

Can I Ask How Often Does The Hosted Clearml Reset? I'M In A Hackathon And Thought Of Using It.

Can i ask how often does the hosted clearml reset? I'm in a hackathon and thought of using it.

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

[Security] Hi, One Of Our Teams Noted That Previews Of Clearml-Data Datasets Are Saved In The Files_Server (Indicated In Clearml.Conf) Instead Of The Indicated Output_Uri In The Dataset.Create Argument. This Results In A Security Breach. May I Ask If This

[Security] Hi, one of our teams noted that previews of clearml-data datasets are saved in the files_server (indicated in ClearML.conf) instead of the indicat...

dataset

one year ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Hi, I Was Adding Data Using Clearml-Data And Get The Following Consistent Errors.

Hi, i was adding data using clearml-data and get the following consistent errors. Retrying (Retry(total=237, connect=237, read=240, redirect=240, status=240)...

dataset

2 years ago

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

Hi I Saw This Announcement From Nvidia On Tao'S Integration With Clearml. How Can We Use It?

Hi i saw this announcement from NVIDIA on TAO's integration with ClearML. How can we use it?

clearml

one year ago

0 Votes

17 Answers

1K Views

0 Votes 17 Answers 1K Views

Hi I'M Using Clearml Datasets. How Do I Tell From The Clearml Ui Which Datasets Version Am I Using?

Hi I'm using clearml datasets. How do I tell from the ClearML UI which datasets version am I using?

clearml

3 years ago

0 Votes

0 Answers

58 Views

0 Votes 0 Answers 58 Views

Hi Can I Ask How Clearml Support Distributed Training Via K8Sglue? Kubeflow Operator Support Distributed Training On Kubernetes Cluster, Managing The Pods Seamlessly.

Hi Can i ask how ClearML support distributed training via K8SGlue? Kubeflow Operator support distributed training on Kubernetes cluster, managing the pods se...

clearml

15 days ago

0 Votes

3 Answers

981 Views

0 Votes 3 Answers 981 Views

Hi, I Have A Docker Image That Needs To Be Run In Privileged Mode. How Should I Do The Following?

Hi, i have a docker image that needs to be run in privileged mode. How should i do the following? clearml-session: Pass the --privileged option along --docker ?

clearml

3 years ago

0 Votes

1 Answers

999 Views

0 Votes 1 Answers 999 Views

Hi, In Your Latest Changelog. There'S A New Function.

Hi, in your latest changelog. There's a new function. Task.launch_multi_node() for distributed experiment execution In the context of using with K8S glue, wi...

clearml

one year ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, How Is The Priority Of The Configuration Like? Which One Takes Precedence? For Example, Output_Uri

Hi, how is the priority of the configuration like? Which one takes precedence? For example, output_uri default_output_uri in clearml.conf on client files_ser...

dataset

2 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hi, I'M Working On A Post Deployment Data And Model Monitoring Using Clearml. The Idea Is This.

Hi, i'm working on a post deployment data and model monitoring using ClearML. The idea is this. Use ClearML to serve my model out to Triton. Data MonitoringC...

clearml

3 years ago

0 Votes

8 Answers

1K Views

0 Votes 8 Answers 1K Views

Hi, I'M Using The K8S Glue And Have A Few Questions.

Hi, I'm using the k8s glue and have a few questions. Noted that it's not requesting the http://nvidia.com/gpu thus the pod created doesn't have a GPU resourc...

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Can You Run Clearml Experiments On Docker Images That Does Not Have Root?

Hi, can you run clearml experiments on docker images that does not have root?

clearml

2 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hi, We Are Planning To Move On To Openshift. Can I Ask If K8S-Glue Supports Openshift?

Hi, we are planning to move on to openshift. Can I ask if k8s-glue supports openshift?

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, Would You Have A Working Example On This?

Hi, would you have a working example on this?

clearml

3 years ago

0 Votes

1 Answers

1K Views

0 Votes 1 Answers 1K Views

Hi, I Would Like To Understand The Dataflow When Using Clearml-Data. When I Use Clearml-Data Cli To Get Versioned Data. Does The Data Proxy Though Clearml Server Before Arriving To The Client, Or Clearml-Data Is Directly Pulling From The S3 Storage? Assum

Hi, i would like to understand the dataflow when using clearml-data. When i use clearml-data CLI to get versioned data. Does the data proxy though ClearML Se...

dataset

2 years ago

Show more results

0 Hi, In The New Datasets Ui. It Doesn'T Seem To Display The Entire Lineage Of The Datasets. For Example. If A Dataset Is Create As Such Id1 (Parent)->Id2, Then Another Dataset Created As Id2(Parent)-> Id3. When You Look At Id3, It Only Shows Id2 As Parent.

Hi CostlyOstrich36 , That's correct.

2 years ago

0 Hi, I Can'T Seem To Set A Password To Clearml, Anyone Seems To Be Able To Just Enter The Username And They Can Enter That Username'S Workspace.

It's a local deployment. I was only presented with username without a need to enter passwords. When I'm in, I don't see an option in my profile to set a password as well. Neither is there integration with ldap for example.

3 years ago

0 Hi, Just To Check. Does The K8S Glue Install Torch By Default? I'M Getting

The problem is resolved by doing a git push. Somehow the git diff didn't capture the difference in requirements.txt in the project. I can't reproduce the same issue after this as well.

3 years ago

0 Hi, Just To Check. Does The K8S Glue Install Torch By Default? I'M Getting

Hi AgitatedDove14 , that's what i am trying to figure out as well. The task has nothing to do with torch, and the requirements.txt doesn't have any torch packages as well.

3 years ago

0 Hi, Just To Check. Does The K8S Glue Install Torch By Default? I'M Getting

Hi, i can't seem to find the source. What are the kind of situations where it will try to install torch outside of user requirements?

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Hi.

We tried as advised above and it still didn't work.
Host: http://ecs.ai:443
output_uri = S3://ecs.ai:443/bucketname

This time round the client gave this error.
Botocore.exceptions.connectiinclosederror: connection was closed before we received a valid response from endpoint URL: ' http://ecs.ai/bucketname/.clearml.test '.

It's quite apparent that whatever clearml passed to boto3 ends up as a http call instead of https, which is wrong.

3 years ago

0 Hi We Have Had Some Crashes On Clearml Server And It Was Caused By Clearml Uploading The Models Into Clearml Server (By Default). Is It Possible To Have An Overriding Config So Clients Can Never Upload To Clearml Server Itself As Default?

Hi. If we disable the API service, how will it affect the system? How do we disable?

2 years ago

0 Hi Guys, Thanks For The Previous Discussion On Ml-Ops With Clearml Agent. I'M Still Not Sure How To Monitor A Training Job On K8S (That Wasn'T Scheduled By Clearml). My Clearml Server Is Deployed And Functional For Tracking Non-K8S Jobs. But For A K8S Job

Hi HelpfulDeer76 , I'm facing similar issues. Would you mind describing in detail how you deploy clearml-agent? Is it running as a pod on k8s?

3 years ago

0 Hi, I Am Running Several Python Scripts But All For The Same Project/Task. Is It Possible To Task.Init To Existing Running/Completed Task And Adding On The Results?

Hi,
basically i run this block first and ended the script.
task = Task.init(project_name="afro-nmt", task_name=args.taskname, continue_last_task=args.taskid) Logger.current_logger().report_scalar(title="BLEU",series="JW300",value=args.jwbleu, iteration=args.lastiter)Then i run another script, with series different.
` task = Task.init(project_name="afro-nmt", task_name=args.taskname, continue_last_task=args.taskid)
Logger.current_logger().report_scalar(title="BLEU",series="SS900",value=arg...

3 years ago

0 Hi, I Am Running Several Python Scripts But All For The Same Project/Task. Is It Possible To Task.Init To Existing Running/Completed Task And Adding On The Results?

It didn't work as expected.
` task init
task report iter 10

task init
task report iter 10

The second task pushed the reporting iteration to 20 instead. `

3 years ago

0 Hi, I Am Running Several Python Scripts But All For The Same Project/Task. Is It Possible To Task.Init To Existing Running/Completed Task And Adding On The Results?

Hi TimelyPenguin76 , i am adding a debug sample to an existing task using the above method. What should i put for the iteration? I do not want to overwrite existing ones but i do not know what's the last count. This is for both scalar and media reporting.

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

It would make sense on a very large resource cluster. Unfortunately we only have less than 50 GPUs to share across. A multi-tenant SAAS would cut the resources into even more smaller clusters and not help with efficiency. Or would you have a suggestion?

3 years ago

0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Ok, let me check this out first thing on Monday, thanks AgitatedDove14 .

3 years ago

0 Hi, How Might I Use The Sdk To Pull Parameters Of The Agent'S Clearml.Conf Into My Code During Runtime? For Example, If I Wish To Pull The Configuration For Aws.S3.Credentials.Key And Aws.S3.Credentials.Secret?

thanks, let me try that.

3 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

Ok that works. thanks.

3 years ago

0 Hi, Clearml Console Leaks Credentials Passed In As Env Vars. The Issue Remains With Clearml Version==1.1.1.135 - 1.1.1 - 2.1.4 (As Listed On The Profile Page) I Am Using K8S Glue And The Clearml.Conf Has The Following In The Agent Section.

Can this issue be solved with vault? It doesn't make sense to expose secrets like that.

3 years ago

0 Hi, I'M Having Problems With The Installed Packages When Creating An Experiment. The Installed Packages Used To Be A List With The Versions Of All The Installed Packages In The Venv. However, Now I Get The Following:

Previously we had similar issues when we switched images used in agent. Might want to check on that.

3 years ago

0 Hi, I'Ve Multiple Tasks Setup In A Complex Pipeline. How Can I;

What type of pipeline steps are you running? From task, decorator or function?

We were trying with 'from task' at the moment. But the question apply to all methods.

If they're all running on the same container why not make them the same task and do things in parallel?

The tasks were created by different teams and their tasks content is rather independent and modular. Usage of them is usually optional. For example, task1 performs 'image whitening', task2 performs 'image resize'.

2 years ago

0 Hi, Can I Do A Quick Check If All The Documentation I Find On Trains Are Still Valid For Clearml? Specifically, I Am Looking At Integration Of Clearml And Kubernetes.

Thanks. This appears to be solely for web UI and API, What if i want to orchestrate on K8S?

3 years ago

0 Hi, I Am Running Several Python Scripts But All For The Same Project/Task. Is It Possible To Task.Init To Existing Running/Completed Task And Adding On The Results?

Thanks TimelyPenguin76 , let me try it out now.

3 years ago

0 Hi, We Recently Upgraded Clearml To 1.1.1-135 . 1.1.1 . 2.14. The Task Init Is

Hi, when i tried ip:port, it references the right host and bucket....BUT... the file is not found on the ECS S3 even though i can see from the logs that it states Completed model upload to s3://ecs.ai:80/clearml-models/artifacts/ ...

3 years ago

0 Hi, Several Changes Occurred Recently And I Would Like To Know If There'S A Way To Verbose Catch All The Printout That Happening Within A K8S Glue Spawned Pod. We Have An Issue Where All Of Our New Remote_Execution Tasks Are Stuck In The 'Pending' Stage.

I did notice that in the tmp folder, .clearml_agent.xxxxx.cfg does not exists.

3 years ago

and out of curiosity, what did you think we were talking about? cos i didn't see anywhere else that might print the secrets.

3 years ago

0 Hi! I'M Trying Clearml 1.1.3. I'M Trying To Get A Dataset With

Thought this looked familiar.
https://clearml.slack.com/archives/CTK20V944/p1635323823155700?thread_ts=1635323823.155700&cid=CTK20V944

3 years ago

0 Hi Everyone! I'Ve Noticed That If I Run An Experiment And It Fails, The Clearml Agent Will Delete All Datasets That Have Been Downloaded During The Run. Is It Correct Behavior? How Can I Force The Agent To Preserve Such Datasets?

From an efficiency perspective, we should be pulling data as we feed into training. That said, always a good idea to uncompress large zip files and store them as smaller ones that allow you to batch pull for training.

2 years ago

0 Hi, We Are Using Gitlab And It Is A Security Requirement To Use Ssh Keys To Access The Repos For Each Individual. We Are Also Using K8S Glue. Is There Any Provisions To Do This Seamlessly?

And any roadmap on this? The organisation's on ssh auth is firm. This can end up not possible to use ClearML for remote execution.

3 years ago

This is the log i extracted.

3 years ago

0 I Had A Good Look At All The Introduction Video On Youtube And Had Some Questions. Context: If We Are Going To Deploy And Maintain Clearml Servers Our Self In Azure:

I also think it make sense that when you do certain definitive CI actions like publish, it would support some custom scripts to run.

one year ago

0 Hi, Is There A Way To Use Api To Return The Urls Of All The Models In The Clearml Repository?

Ok thanks.

one year ago

0 Hi, How Can I Make A Stage In A Clearml Pipeline Non-Blocking? The Scenario Is That Stages Downstream Needed Runtime Info From The First Stage, However The First Stage Needs To Continue Running To Act As A Monitor For The Other Downstream Stages.

The first stage is a rank0 pytorch script. The downstream stages are rankN scripts, they are waiting for the IP address of the first stage. But the first stage doesn’t return, it simply waits for the rankN scripts to connect to it. But in this case, the rankN scripts doesn’t start. So its probably necessary to have just a single stage.

If i were to start a single rank0, and subsequent rankN tasks, it would be rather messy on ClearML Dashboard. Best to have either a single clearml application...

one year ago

Show more results