GrievingTurkey78

34 Questions, 125 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

119 × Eureka!

Questions 34
Answers 125

0 Votes

9 Answers

1K Views

0 Votes 9 Answers 1K Views

Hi! Does Clearml Have A Way To Turn On/Off Virtual Machines Depending If There Are Experiments On Queue?

Hi! Does ClearML have a way to turn on/off virtual machines depending if there are experiments on queue?

clearml

3 years ago

0 Votes

2 Answers

886 Views

0 Votes 2 Answers 886 Views

Hi! I Have The Previous Trains Server Configured With Multiple Experiments; I Created It Using The Gcloud Images Provided. If I Want To Update The Server To The Newest Clearml Version Should I Follow These Steps

Hi! I have the previous trains server configured with multiple experiments; I created it using the gcloud images provided. If I want to update the server to ...

clearml

3 years ago

0 Votes

1 Answers

931 Views

0 Votes 1 Answers 931 Views

Quick Question On The

Quick question on the clearml-data package, Can I add files to a dataset from google storage instead of having to download them?

dataset

3 years ago

0 Votes

2 Answers

916 Views

0 Votes 2 Answers 916 Views

Hi All! Currently I Am Trying To Create A Tool That Can Perform Certain Operations On Dataset Ids, This Is A Skeleton Of What I Have In Mind (Based On The Examples):

Hi all! Currently I am trying to create a tool that can perform certain operations on dataset ids, this is a skeleton of what I have in mind (based on the ex...

clearml

3 years ago

0 Votes

17 Answers

890 Views

0 Votes 17 Answers 890 Views

Hi! I Am Using The Modelcheckpoint Callback From Tensorflow To Save The Best Model. When The Experiment Finishes If I Go On The Server To Experiment > Artifacts > Output Model I Can See The Model And Subsequently By Clicking On It The Weights. How Can I

Hi! I am using the ModelCheckpoint callback from Tensorflow to save the best model. When the experiment finishes if I go on the server to Experiment > Artifa...

clearml

3 years ago

0 Votes

15 Answers

978 Views

0 Votes 15 Answers 978 Views

Hi 👋 I am trying to set up a trains server on GCP. I followed all the steps listed here https://allegro.ai/docs/deploying_trains/trains_server_gcp/ . I also...

clearml

4 years ago

0 Votes

12 Answers

916 Views

0 Votes 12 Answers 916 Views

Hi All! Is There A Way For Trains To Recognize The Cli Arguments When Using

Hi all! Is there a way for trains to recognize the CLI arguments when using https://github.com/google/python-fire instead of argparse?

clearml

4 years ago

0 Votes

2 Answers

993 Views

0 Votes 2 Answers 993 Views

Hi! I Changed From Trains To Clearml And Ran Some Experiments Using Keras But It Seems The Metrics Are Not Being Tracked Automagically, Has Anyone Ran Into The Same Issue? I Can Even See The Metrics On The Progress Bar During The Fit Process.

Hi! I changed from trains to clearml and ran some experiments using keras but it seems the metrics are not being tracked automagically, has anyone ran into t...

clearml

3 years ago

0 Votes

10 Answers

931 Views

0 Votes 10 Answers 931 Views

I Am Also Experiencing A Weird Behaviour When Running A Script Using The Module Flag. For Example I Run:

I am also experiencing a weird behaviour when running a script using the module flag. For example I run: python -m module.script arg1 arg 2And after the scri...

clearml

4 years ago

0 Votes

2 Answers

954 Views

0 Votes 2 Answers 954 Views

I Am Trying To Upgrade From Clearml Server 0.16 To The Newest Version But I Am Getting Some Errors When Spinning Up The New Containers:

I am trying to upgrade from clearml server 0.16 to the newest version but I am getting some errors when spinning up the new containers: WiredTiger error (-31...

clearml

3 years ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hi! I Am Saving Some Intermediate

Hi! I am saving some intermediate .pt files on the experiments and clearml automatically detects them as models, this makes the clearml.model - INFO message ...

clearml

3 years ago

0 Votes

7 Answers

971 Views

0 Votes 7 Answers 971 Views

Hi! I Am Trying To Download Data From Gs Using

Hi! I am trying to download data from GS using StorageManager.get_local_copy() . It works fine when I point it to a file i.e gs://bucket/dataset/image.png bu...

clearml

4 years ago

0 Votes

13 Answers

922 Views

0 Votes 13 Answers 922 Views

Hi, I Was Getting A Really Weird Error Due To Mismatch On The Versions Between The Installed Libraries In My Environment And The Ones Ran In The Node (I Manually Changed The Installed Packages And Everything Worked). How Can I Force Trains To Use Exactly

Hi, I was getting a really weird error due to mismatch on the versions between the installed libraries in my environment and the ones ran in the node (I manu...

clearml

4 years ago

0 Votes

3 Answers

956 Views

0 Votes 3 Answers 956 Views

Hi! I Have Some Clearml Agents On Gcp And Sometimes The Instance Seems To Reboot Making The Experiment Fail And All The Progress Is Lost. What Is The Best Way To Resume An Experiment?

Hi! I have some ClearML agents on GCP and sometimes the instance seems to reboot making the experiment fail and all the progress is lost. What is the best wa...

clearml

2 years ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Hi! If I Have A Pipeline On Gitlab That Uses Clearml For Some Tests Is There Some Way To Setup The Credentials So That It Doesn’T Fail?

Hi! If I have a pipeline on gitlab that uses ClearML for some tests is there some way to setup the credentials so that it doesn’t fail?

clearml

3 years ago

0 Votes

2 Answers

943 Views

0 Votes 2 Answers 943 Views

Hi! What Would Be The Way For Manually Uploading A Model? I Have Intermediate

Hi! What would be the way for manually uploading a model? I have intermediate .pt files which I don't want to upload. Is there a way to turn off clearml capt...

clearml

3 years ago

0 Votes

11 Answers

992 Views

0 Votes 11 Answers 992 Views

Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

Hi! Is there a way to run a task without reporting to the server? For example if I want to debug a script by running it locally without it appearing on the s...

clearml

3 years ago

0 Votes

4 Answers

954 Views

0 Votes 4 Answers 954 Views

Hi! If I Have A Folder With Multiple

Hi! If I have a folder with multiple ckpt files would the manual way to upload them be the following: output_model = OutputModel(task) output_model.update_we...

clearml

2 years ago

0 Votes

30 Answers

962 Views

0 Votes 30 Answers 962 Views

Hi! Is There Something Happening With The

Hi! Is there something happening with the ModelCheckpoint callback on tensorflow==2.4.0 ? Using 2.2.0 gave me an input model on the artifacts tab in the GUI 😢

clearml

3 years ago

0 Votes

21 Answers

962 Views

0 Votes 21 Answers 962 Views

Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

Hi! Any idea why clearml fails to detect iteration reporting? ClearML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-fr...

clearml

3 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Hi, Is There A Way To Force The Requirements.Txt? I Have A Package I Installed Directly From Github But The Version Is Always Wrong. Any Other Way To Do This?

Hi, is there a way to force the requirements.txt? I have a package I installed directly from github but the version is always wrong. Any other way to do this?

clearml

3 years ago

0 Votes

3 Answers

996 Views

0 Votes 3 Answers 996 Views

Hi! I Am Trying To Run Some Experiments On An Agent I Have Configured To Use The Requirements.Txt The Problem Is It Only Shows Cython On The List Of Installed Packages. It Crashes Due To Missing Packages.

Hi! I am trying to run some experiments on an agent I have configured to use the requirements.txt the problem is it only shows Cython on the list of installe...

mlops

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi 👋 I am logging some figures on pytorch lightning using the example here. The figures are correctly saved on Tensorboard's images tab but unfortunately ar...

pytorch

2 years ago

0 Votes

10 Answers

952 Views

0 Votes 10 Answers 952 Views

Hi! I Am Getting The Following Error On An Agent:

Hi! I am getting the following error on an agent: /usr/local/bin/python3.8: No module named virtualenv clearml_agent: ERROR: Command '['python3.8', '-m', 'vi...

clearml

2 years ago

0 Votes

2 Answers

944 Views

0 Votes 2 Answers 944 Views

Hi ! While Restarting The Server

Hi ! While restarting the server I got ERROR: for agent-services removal of container 8f1d8539340d6d073eb5b51294f5f5d802048a3614d459b5c4fb1d38a05ce538 is alr...

clearml

3 years ago

0 Votes

8 Answers

820 Views

0 Votes 8 Answers 820 Views

Hello

Hello 👋 I am using a self hosted clearml setup using the requirments file of the project. When I run the task it is failing and I get: Collecting torch==2.0...

clearml

one year ago

0 Votes

7 Answers

1K Views

0 Votes 7 Answers 1K Views

Hi! I Am Currently Using Hydra+Clearml And Wanted To Know If There Are Still Some Updates Coming. At The Moment, If I Change The Defaults Hydra Uses From The

Hi! I am currently using Hydra+ClearML and wanted to know if there are still some updates coming. At the moment, if I change the defaults hydra uses from the...

clearml

3 years ago

0 Votes

3 Answers

920 Views

0 Votes 3 Answers 920 Views

Hi! I Recently Updated My Server And My Clearml Version, Now When I Set A Task To Be Executed Remotely Its Default State Is Aborted Hence I Have To Reset And Enqueue, Is There Something I Am Doing Wrong (I Am Using Hydra Too)?

Hi! I recently updated my server and my clearml version, now when I set a task to be executed remotely its default state is aborted hence I have to reset and...

clearml

3 years ago

0 Votes

5 Answers

1K Views

0 Votes 5 Answers 1K Views

Hi! I Was Taking A Look At The

Hi! I was taking a look at the https://pytorch-lightning.readthedocs.io/en/latest/common/lightning_cli.html and wanted to know if anyone has used clearml wit...

clearml

3 years ago

0 Votes

4 Answers

946 Views

0 Votes 4 Answers 946 Views

Hi! I Am Having Some Problems With A Loss After A Good Amount Of Training, What Would Be The Best Way To Log A Value To Have A Better Idea Of What Is Happening?

Hi! I am having some problems with a loss after a good amount of training, what would be the best way to log a value to have a better idea of what is happening?

clearml

2 years ago

Show more results

0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

I set the number to a crazy value and it fails around the same iteration

3 years ago

0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

Last question CostlyOstrich36 sorry to poke you! Seems even though if I set an extremely long time it will still fail when the first plots are reported. The first plots are generated automatically by pytorch lightning and track the cpu and gpu usage. Do you think this could be the cause? or should it also detect the iteration.

3 years ago

0 Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

AgitatedDove14 task.set_archived(True) + the cleanup service should do it 👌 If we run in debug mode the experiment goes directly to the archive and gets cleaned and we don’t pollute the main experiment page.

3 years ago

0 Hi! Is There Something Happening With The

This works:
filepath = self.log_dir + os.sep + "checkpoint" self.callbacks.append( ModelCheckpoint( filepath, monitor="val_loss", mode="min", save_best_only=True, save_weights_only=True, ) )And this doesn’t:
` filepath = self.log_dir + os.sep + "checkpoint.hdf5"
self.callbacks.append(
ModelCheckpoint(
filepath,
...

3 years ago

0 Hello

Sure! For torch I have:

torch==2.0.1
    # via
    #   monai
    #   pytorch-lightning
    #   torchio
    #   torchmetrics

one year ago

Show more results