Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
GrievingTurkey78
Moderator
34 Questions, 125 Answers
  Active since 10 January 2023
  Last activity 8 months ago

Reputation

0

Badges 1

119 × Eureka!
0 Votes
7 Answers
591 Views
0 Votes 7 Answers 591 Views
Hi! I am currently using Hydra+ClearML and wanted to know if there are still some updates coming. At the moment, if I change the defaults hydra uses from the...
3 years ago
0 Votes
17 Answers
529 Views
0 Votes 17 Answers 529 Views
3 years ago
0 Votes
10 Answers
520 Views
0 Votes 10 Answers 520 Views
I am also experiencing a weird behaviour when running a script using the module flag. For example I run: python -m module.script arg1 arg 2And after the scri...
3 years ago
0 Votes
15 Answers
519 Views
0 Votes 15 Answers 519 Views
Hi
Hi ๐Ÿ‘‹ I am trying to set up a trains server on GCP. I followed all the steps listed here https://allegro.ai/docs/deploying_trains/trains_server_gcp/ . I also...
3 years ago
0 Votes
8 Answers
414 Views
0 Votes 8 Answers 414 Views
Hello ๐Ÿ‘‹ I am using a self hosted clearml setup using the requirments file of the project. When I run the task it is failing and I get: Collecting torch==2.0...
8 months ago
0 Votes
2 Answers
511 Views
0 Votes 2 Answers 511 Views
Hi! I changed from trains to clearml and ran some experiments using keras but it seems the metrics are not being tracked automagically, has anyone ran into t...
3 years ago
0 Votes
4 Answers
552 Views
0 Votes 4 Answers 552 Views
Hi! I am having some problems with a loss after a good amount of training, what would be the best way to log a value to have a better idea of what is happening?
2 years ago
0 Votes
3 Answers
556 Views
0 Votes 3 Answers 556 Views
Hi! I have some ClearML agents on GCP and sometimes the instance seems to reboot making the experiment fail and all the progress is lost. What is the best wa...
2 years ago
0 Votes
6 Answers
520 Views
0 Votes 6 Answers 520 Views
Hi! I have some agents on GCP. Lately I have been getting some experiments that simply stop running (no signs that the experiment crashed). Here is a plot th...
2 years ago
0 Votes
2 Answers
545 Views
0 Votes 2 Answers 545 Views
Hi! What would be the way for manually uploading a model? I have intermediate .pt files which I don't want to upload. Is there a way to turn off clearml capt...
2 years ago
0 Votes
2 Answers
530 Views
0 Votes 2 Answers 530 Views
Hi
Hi AgitatedDove14 ! Regarding the Hydra integration, which pattern should be used? Call the task inside the decorated function? Will this store the parameter...
3 years ago
0 Votes
1 Answers
536 Views
0 Votes 1 Answers 536 Views
Quick question on the clearml-data package, Can I add files to a dataset from google storage instead of having to download them?
3 years ago
0 Votes
6 Answers
568 Views
0 Votes 6 Answers 568 Views
Hi! I am saving some intermediate .pt files on the experiments and clearml automatically detects them as models, this makes the clearml.model - INFO message ...
2 years ago
0 Votes
5 Answers
541 Views
0 Votes 5 Answers 541 Views
Hi! I was taking a look at the https://pytorch-lightning.readthedocs.io/en/latest/common/lightning_cli.html and wanted to know if anyone has used clearml wit...
2 years ago
0 Votes
3 Answers
558 Views
0 Votes 3 Answers 558 Views
Hi! I am trying to run some experiments on an agent I have configured to use the requirements.txt the problem is it only shows Cython on the list of installe...
2 years ago
0 Votes
2 Answers
562 Views
0 Votes 2 Answers 562 Views
I am trying to upgrade from clearml server 0.16 to the newest version but I am getting some errors when spinning up the new containers: WiredTiger error (-31...
2 years ago
0 Votes
21 Answers
552 Views
0 Votes 21 Answers 552 Views
Hi! Any idea why clearml fails to detect iteration reporting? ClearML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-fr...
2 years ago
0 Votes
7 Answers
548 Views
0 Votes 7 Answers 548 Views
Hi! I am trying to download data from GS using StorageManager.get_local_copy() . It works fine when I point it to a file i.e gs://bucket/dataset/image.png bu...
3 years ago
0 Votes
13 Answers
510 Views
0 Votes 13 Answers 510 Views
3 years ago
0 Votes
2 Answers
510 Views
0 Votes 2 Answers 510 Views
Hi! Regarding the artifact.get_local_copy() method, since there is no way to specify the path where the artifact will be downloaded, I wanted to confirm that...
3 years ago
0 Votes
30 Answers
538 Views
0 Votes 30 Answers 538 Views
Hi! Is there something happening with the ModelCheckpoint callback on tensorflow==2.4.0 ? Using 2.2.0 gave me an input model on the artifacts tab in the GUI ๐Ÿ˜ข
3 years ago
0 Votes
7 Answers
645 Views
0 Votes 7 Answers 645 Views
Hi! If I have a pipeline on gitlab that uses ClearML for some tests is there some way to setup the credentials so that it doesn’t fail?
2 years ago
0 Votes
2 Answers
540 Views
0 Votes 2 Answers 540 Views
Hi ! While restarting the server I got ERROR: for agent-services removal of container 8f1d8539340d6d073eb5b51294f5f5d802048a3614d459b5c4fb1d38a05ce538 is alr...
2 years ago
0 Votes
2 Answers
512 Views
0 Votes 2 Answers 512 Views
Hi! I have the previous trains server configured with multiple experiments; I created it using the gcloud images provided. If I want to update the server to ...
2 years ago
0 Votes
2 Answers
545 Views
0 Votes 2 Answers 545 Views
Hi all! Currently I am trying to create a tool that can perform certain operations on dataset ids, this is a skeleton of what I have in mind (based on the ex...
3 years ago
0 Votes
5 Answers
546 Views
0 Votes 5 Answers 546 Views
Hi, with the upcoming version of Hydra it seems the binding breaks. Specifically in the run_job function the argument order changed from https://github.com/f...
2 years ago
0 Votes
3 Answers
550 Views
0 Votes 3 Answers 550 Views
Hi! I recently updated my server and my clearml version, now when I set a task to be executed remotely its default state is aborted hence I have to reset and...
2 years ago
0 Votes
4 Answers
644 Views
0 Votes 4 Answers 644 Views
Hi, is there a way to force the requirements.txt? I have a package I installed directly from github but the version is always wrong. Any other way to do this?
2 years ago
0 Votes
5 Answers
612 Views
0 Votes 5 Answers 612 Views
Hi
Hi ๐Ÿ‘‹ I am logging some figures on pytorch lightning using the example here. The figures are correctly saved on Tensorboard's images tab but unfortunately ar...
2 years ago
0 Votes
11 Answers
589 Views
0 Votes 11 Answers 589 Views
Hi! Is there a way to run a task without reporting to the server? For example if I want to debug a script by running it locally without it appearing on the s...
2 years ago
Show more results questions
0 Hi! I Was Taking A Look At The

Yes AgitatedDove14 , I am not sure what they use by default. Here is a simple working example:
` from typing import Optional

import torch
from clearml import Task
from pytorch_lightning import LightningDataModule, LightningModule
from pytorch_lightning.utilities.cli import LightningCLI
from torch.utils.data import DataLoader, Dataset, Subset

class RandomDataset(Dataset):
def init(self, size, length):
self.len = length
self.data = torch.randn(length, size)

def ...
2 years ago
0 Hi! I Was Taking A Look At The

Nice catch AgitatedDove14 ! Sure Iโ€™ll open the issue right now.

2 years ago
0 Hi! I Am Trying To Run Some Experiments On An Agent I Have Configured To Use The Requirements.Txt The Problem Is It Only Shows Cython On The List Of Installed Packages. It Crashes Due To Missing Packages.

` [package_manager.force_repo_requirements_txt=true] Skipping requirements, using repository "requirements.txt"
Using base prefix '/opt/conda'
New python executable in /home/ramon/.clearml/venvs-builds/3.7/bin/python3.7
Also creating executable in /home/ramon/.clearml/venvs-builds/3.7/bin/python
Installing setuptools, pip, wheel...
2021-06-10 09:57:56
done.
Collecting pip<20.2
Using cached pip-20.1.1-py2.py3-none-any.whl (1.5 MB)
Installing collected packages: pip
Attempting uninstall: p...

2 years ago
0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

I'll give that a try! Thanks CostlyOstrich36

2 years ago
0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

CostlyOstrich36 That seemed to do the job! No message after the first epoch, with the caveat of losing resource monitoring. Any idea of what could be causing this? If the resource monitor is the first plot then the iteration detection will fail? Are there any hacks to keep the resource monitoring? Thanks a lot! ๐Ÿ™Œ

2 years ago
0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

Sure! Could you point me out how its done

2 years ago
0 Hi! I Am Saving Some Intermediate

So I would have to disconnect pytorch? And then upload the model at the end

2 years ago
0 Hi! I Am Saving Some Intermediate

Thanks! This should work perfectly ๐Ÿ‘Œ

2 years ago
0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

Last question CostlyOstrich36 sorry to poke you! Seems even though if I set an extremely long time it will still fail when the first plots are reported. The first plots are generated automatically by pytorch lightning and track the cpu and gpu usage. Do you think this could be the cause? or should it also detect the iteration.

2 years ago
0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

Oh I think I am wrong! Then it must be the clearml monitoring. Still it fails way before the timer ends.

2 years ago
0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

I set the number to a crazy value and it fails around the same iteration

2 years ago
0 Hi! Any Idea Why Clearml Fails To Detect Iteration Reporting?

I set it to 200000 ! But the problem stems from when the first plot is the clearml cpu and gpu monitoring, were you able to reproduce it? Even if I set the number fairly large when the monitoring plot was reported the message appeared.

2 years ago
0 Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

I feel itโ€™s easier not to report than cleaning after but please correct me if I am overthinking it. Iโ€™ll check if I could wrap the code in something that calls the Task.delete if debugging

2 years ago
0 Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

AgitatedDove14 Downloading a dataset would not be possible using this right? I want to be able to access the data just avoid reporting the experiment results

2 years ago
0 Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

AgitatedDove14 task.set_archived(True) + the cleanup service should do it ๐Ÿ‘Œ If we run in debug mode the experiment goes directly to the archive and gets cleaned and we donโ€™t pollute the main experiment page.

2 years ago
0 Hi! If I Have A Pipeline On Gitlab That Uses Clearml For Some Tests Is There Some Way To Setup The Credentials So That It Doesn’T Fail?

I need to fetch a dataset for some simple tests but since it doesnโ€™t have credentials to the self-hosted server it wont find the dataset

2 years ago
0 Hi! Is There Something Happening With The

This works:
filepath = self.log_dir + os.sep + "checkpoint" self.callbacks.append( ModelCheckpoint( filepath, monitor="val_loss", mode="min", save_best_only=True, save_weights_only=True, ) )And this doesnโ€™t:
` filepath = self.log_dir + os.sep + "checkpoint.hdf5"
self.callbacks.append(
ModelCheckpoint(
filepath,
...

3 years ago
0 Hi! Is There Something Happening With The

Hey AgitatedDove14 after playing around seems that if the callback filepath points to an hdf5 file it is not uploaded.

3 years ago
Show more results compactanswers