CostlyOstrich36

0 Questions, 4213 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Answers 4213

Hi @<1597762318140182528:profile|EnchantingPenguin77> , you can set this in the docker extra arguments section of the task

one year ago

0 Ahoy! I Have A Pro Account Which I Need To Downgrade. The Account Page Says I Have 8 Gb Of Metrics Data, Preventing The Downgrade. However, No Matter What I Do -- Archiving Many Experiments & Models And Then Deleting Them From The Archive -- This Number D

Do you have the associated email to the account or the workspace ID?

8 months ago

0 Hi All, What Options Do I Have For Setting Up Autoscaling Queues On Azure? Anything Else Other Than Kubernetes? And If Going The Kubernetes Route Is Enterprise Needed Or Would The Pro Package Be Enough? Would Scaling To 0 Be Possible? Also, Any Way To Ea

Hi @<1714451218161471488:profile|ClumsyChimpanzee54> , the Azure autoscaler is available only in the Scale/Enterprise plan. It functions the same as the GCP/AWS autoscalers. Basically scaling from 0 to as many as configured and then spinning them down automatically once the workload is over spin all the machines down like you described

one year ago

0 Hi! I'M Trying To Set Up Clear Ml, I Don'T Know How To Setup Azure Container Storage, For Some Reason It Raises Errors. I Tried This

How did you configure the files_server in clearml.conf ?

one year ago

0 Hi, I'M Trying To Use Clearml On Pytorch-Lightning With Multiple Gpus, But It Seems As If The Server Does Not Monitor The Experiment. I Can See No Progress In The Console (Steps Counter Stays On 0) Nor Any Tensorboard Loggings. On A Single Gpu Everything

Also, how many GPUs are you trying to run off?

3 years ago

0 Hello, I’Ve Enabled The Web Authentication For My Clearml Server And I’M Wondering Whether There’S A Way To Avoid Restarting All The Clearml Services Every Time A New User Is Added. For Example If Some Experiments Are Ongoing A New User Can’T Be Added Wit

Hi RattyLouse61 , how are you adding users? Are you adding them as fixed users in one of the configuration files?

3 years ago

0 Has Anyone Found A Workaround For The Bug?

ReassuredTiger98 , BitterLeopard33 , I think I've encountered this 4GB http limit before. I think this should be fixed in next SDK release 🙂

3 years ago

0 My Task Is In Pending State, After I Enqueued The Task Its In Pending State, Need Help!!

Hi @<1570583227918192640:profile|FloppySwallow46> , please don't @ the entire channel for help 🙂
If a task is in pending it means that no agent picked it up yet. Maybe the agent is unavailable or the process crashed. Check in that direction

2 years ago

0 Is There An Example Of Simple Pipeline Using Task Scheduler? I Want To Create A Simple Pipeline Where First A Folder Is Monitored Using A Taskscheduler, And Under Certain Conditions, Data Is Uploaded And Dataset Id Is Given To The Next Step In The Pipeli

From the error you provided it looks like virtualenv isn't installed on the environment

3 years ago

0 Hi Everyone, I Have A Quick Question: When I Use

default_output_uri is for artifacts & models while files_server is for debug samples and plots (if they are files)

2 years ago

0 Hi, I Am Trying To Save My Trained Model Weights In S3 Bucket Instead Of Using Clearml Storage When Using Clearml-Task For Ml Training Remotely. I Tried To Use --Skip-Task-Init In Clearml-Task And Set Task.Init In My Scripts, But It Doesn'T Seem To Work.

Hi @<1597762318140182528:profile|EnchantingPenguin77> , can you please add the full log?

2 years ago

0 Hey, Im Trying To Run Some Example Pipelines But Have The Problem, That Inside The Steps No Requirements Are Installed. In The Controller Task Itself All Requirements From The Repo Requirements.Txt Are Installed But In The Steps Only Cython. Are The Steps

How are you building the pipeline?

3 years ago

0 Hi Everyone, Quick Question: When Clearml-Agent Sets Up The Virtual Environment With Pip, Is Finding The Correct Cuda Version For Pytorch Something That Pip Or That Clearml Does?

Hi ReassuredTiger98 ,
I think it is something that was logged during the initial run, then the clearml-agent simply recreates the environment 🙂

3 years ago

0 Hi. I'M Using Clearml To Log My Experiments. During Training, I Load Files Using Torch.Load.Clearml Displays The Following Message:

SoreDragonfly16 Hi, what is your usage when saving/loading those files? You can mute both the save/load messages but not each one separately.

Also, do you see all these files as input models in UI?

4 years ago

0 Why Is Async_Delete Not Working?

2024-02-08 11:23:52,150 - clearml.storage - ERROR - Failed creating storage object

Reason: Missing key and secret for S3 storage access (

)

(edited)

This looks unrelated, to the hotfix, it looks like you misconfigured something and therefor failing to write to s3

one year ago

0 Hi Again:) Is There A Similar Way Like

You can use the API to call tasks.get_by_id and get that specific information. In the response it sits in
data.tasks.0.completed

3 years ago

0 Hello, I Have A Question Regarding Clearml Task And Pytorch Lightning I Am Training A Model And I Want Clearml To Plot The Accuracy And Loss. According To

It's unrelated. Are you running the example and no scalers/plots are showing?

2 years ago

0 A Question On Pipeline Artifacts And Their Use In Steps. I Want To Add An Artifact To The Pipeline (See Below), And Use It As Input To A Function Step. But The Following Does Not Work Because

Hi @<1649221402894536704:profile|AdventurousBee56> , I'm not sure I understand. Can you add the full log and explain step by step what's happening?

one year ago

0 I Installed My Own Clearml-Server And Things Seems To Work Fine, But When I Delete An Experiment (I First Archived It Using Web Interface Popup Menu) Then Delete It Using The Web Inferface Popup Menu In Archive View)... Experiment Disapears From Web Inter

Usually the location is on the file server at opt/clearml/data/fileserver

The addresses seems strange, is this the hostname?

It seems like a mix between hostname and IP?

3 years ago

0 Hi, I Am Giving Another Try To Clearml-Session And I Am Blocked At The Current Error Shown When The Cli Try To Establish The Tunneling:

Well not really

Please elaborate 🙂

3 years ago

0 Hi, We'Ve Been Trying To Run The Same Experiments As We Did At Previous Times But Have Been Getting Configuration Errors After Credentials Errors Over And Over. I'Ve Went Ahead And Replaced All Of Our Existing Credentials With New Ones, Went Through The

Yep 🙂

one year ago

0 Hi All. In Self Hosted Clearml Suddenly Scalars Stopped To Be Shown In Scalars Tab. Any Ideas?

Hi @<1523701553372860416:profile|DrabOwl94> , can you check if there are some errors in the Elastic container?

2 years ago

0 Our Hpo App Ran >500 Experiments Even "Limit Total Hpo Experiments=100", And Based On Our "Parameters To Optimize" It Should Be Just 80 Experiments. How To Fix That?

Hi @<1523701062857396224:profile|AttractiveShrimp45> , can you please add the configuration of your HPO app and the log?

2 years ago

0 Hello Everyone! Thanks To The Team For The Amazing Work On Clearml! I'Ve Brought Up A Couple Clearml-Servers And I'Ve Put A Load Balancer On Top Of Them. I Was Wondering Where Can I Find The Credentials Of Each User On Clearml-Server, So That I Can Someho

For more info - None

2 years ago

0 Hi, I'Ve Run Into A Problem And Would Appreciate Some Help. I Installed Clearml Locally. When I Run A New Task On A Remote Server And In The Python Training Code I Set It To Only Train On One Gpu. Everything Works Fine And I See All The Scalars Automatica

What versions of clearml-agent & clearml are you using? Is it a self hosted server?

11 months ago

0 Cannot Upload A Dataset With A Parent - Seems Very Odd! Clearml Versions I Tried: 1.6.1, 1.6.2 Scenario: * Create Parent Dataset (With Storage On S3) * Upload Data * Close Dataset * Create Child Dataset (Tried With Storage On Both S3 Or On Clearml Serv

Can you try it with clearml==1.6.0 please?
Also, can you list the exact commands you ran?

3 years ago

0 Hi, When Downloading Datasets Using The Python Sdk On A Runner Initiated Using

Hi @<1795626098352984064:profile|SoggyElk61> , is it possible you have multiple environments?

2 months ago

0 Hi There, I Am Intending To Work More Often With The Datasets, But Not Sure If There Is A Way To Retrieve Specific Files From A Uploaded Dataset. I Saw I Can Retrieve Chunks Of Data, But Not Sure How That Would Work With A Dataset Of Parquet Files. If I H

ShallowGoldfish8 , I think the best would be storing them as separate datasets per day and then having a "grand" dataset that includes all days and new days are being added as you go.

What do you think?

3 years ago

0 Hey, I’M Running Jobs Remotely On An Agent But One Of My Packages Uses Numpy 1.21.2 Which Needs Python3.7 And Above But The Python Version On The Agent Is 3.6… How Can I Change/Upgrade The Python Version? (Without Creating A New Docker Image) Thanks

Are you running in docker mode? You could maybe use another docker image that has python in it.

3 years ago

0 Hpo Question: I Created Hpo App In Clearml Ui, With Certain Parameter To Optimize, With Minumal Value=0.0, Max=1.0 And Step=1.0. So As Far As I Understand This Hpa Should Launch 10 Experiments. In Fact, It'S Launching Tens Of Experiments (>30 And Still

Hi AttractiveShrimp45 . You input min value as 0, max value as 1 and step as 1?

2 years ago

Show more results