DepravedBee82

6 Questions, 38 Answers

Active since 19 July 2024

Last activity 28 days ago

Reputation

Badges 1

38 × Eureka!

Questions 6
Answers 38

0 Votes

3 Answers

824 Views

0 Votes 3 Answers 824 Views

Hi All, Is There A Way To Completely Disable All Artifact Logging?

Hi all, is there a way to completely disable all artifact logging?

clearml

12 months ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Hi All, We Have Clearml-Server Running On A Kube Pod, And Then A Gpu Server Running The Clearml-Agent Which We Use To Queue Jobs. For Some Reason, Our Kube Pod Restarted (We'Re Looking Into Why), But In The Process Of This Happening All Jobs On The Worke

Hi all, we have clearml-server running on a kube pod, and then a GPU server running the clearml-agent which we use to queue jobs. For some reason, our kube p...

clearml

one year ago

0 Votes

3 Answers

929 Views

0 Votes 3 Answers 929 Views

Hi All - I Have A Large Dataset And Have Preprocessed It And Saved Each Item In .Pt Files, Which Are Loaded Using

Hi all - I have a large dataset and have preprocessed it and saved each item in .pt files, which are loaded using torch.load in my Dataset . The issue is tha...

clearml

one year ago

0 Votes

4 Answers

835 Views

0 Votes 4 Answers 835 Views

Hi All, What Is The Best Way Of Getting Clearml To Pull Code From Github Repos? At The Moment We Can Pull Using A Users Ssh Credentials, But Afaik It'S Not Possible To Create "Service Accounts" For Github. The Preferred Method For

Hi all, what is the best way of getting ClearML to pull code from GitHub repos? At the moment we can pull using a users SSH credentials, but AFAIK it's not p...

clearml

7 months ago

0 Votes

3 Answers

191 Views

0 Votes 3 Answers 191 Views

Hi All, Is It Possible To Pass Hydra Args Via The

Hi all, is it possible to pass Hydra args via the clearml-task CLI? Using --args doesn't seem to work as it should with Hydra - they do appear as args but ar...

clearml

29 days ago

0 Votes

46 Answers

137K Views

0 Votes 46 Answers 137K Views

Hi All, I'Ve Successfully Run A Task Locally, And Now I'M Trying To Clone It And Send It To A Queue. It Looks Like The Environment Is Built Successfully, But It Hangs Here:

Hi all, I've successfully run a Task locally, and now I'm trying to clone it and send it to a Queue. It looks like the environment is built successfully, but...

clearml

one year ago

0 Hi All, I'Ve Successfully Run A Task Locally, And Now I'M Trying To Clone It And Send It To A Queue. It Looks Like The Environment Is Built Successfully, But It Hangs Here:

Thank you! Although it's still really weird how it was failing silently - would it be worth changing the logging level for that error somewhere?

one year ago

0 Hi All - I Have A Large Dataset And Have Preprocessed It And Saved Each Item In .Pt Files, Which Are Loaded Using

Thanks John, but is there a way to do this via the CLI?

Or is Task.init() the only way?

one year ago

0 Hi All - I Have A Large Dataset And Have Preprocessed It And Saved Each Item In .Pt Files, Which Are Loaded Using

Also is there a way to disable this by default?

The reason I ask is that I want to send many jobs to a queue via the CLI. so I don't really want to be messing around with Task.init() .

I've even tried renaming my files to *pth and *.data to stop this behaviour

one year ago

0 Hi All, Is There A Way To Completely Disable All Artifact Logging?

Which auto_connect_* arg do I use and what value to I set it to? At the end of my training run I'm making .png plots of everything in my test set, and I don't want these to be logged as artifacts.

It's not covered here either: None

12 months ago

0 Hi All, I'Ve Successfully Run A Task Locally, And Now I'M Trying To Clone It And Send It To A Queue. It Looks Like The Environment Is Built Successfully, But It Hangs Here:

Thanks Martin - will try that and see what I can find. Really appreciate your patience with this! 🙂

one year ago

0 Hi All, We Have Clearml-Server Running On A Kube Pod, And Then A Gpu Server Running The Clearml-Agent Which We Use To Queue Jobs. For Some Reason, Our Kube Pod Restarted (We'Re Looking Into Why), But In The Process Of This Happening All Jobs On The Worke

Hi @<1523701087100473344:profile|SuccessfulKoala55> thanks for the reply! The output above is from grep -i network /var/log/syslog on the machine running the agent. That's good to hear that clearml is pretty resilient to network outages 🙂 . Do you have any suggestions on how we can start tracking down the cause of this?

This is the only clue that was logged to the console in clearml server: 2024-11-21 06:57:13 Process terminated by user . The first errors on the agent logs appea...

one year ago

0 Hi All, What Is The Best Way Of Getting Clearml To Pull Code From Github Repos? At The Moment We Can Pull Using A Users Ssh Credentials, But Afaik It'S Not Possible To Create "Service Accounts" For Github. The Preferred Method For

Ok thanks for confirming!

7 months ago

0 Hi All, I'Ve Successfully Run A Task Locally, And Now I'M Trying To Clone It And Send It To A Queue. It Looks Like The Environment Is Built Successfully, But It Hangs Here:

Ok so my train.py now looks like this:

print("Before import")

from pathlib import Path

import hydra
import lightning as L
import torch
from coolname import generate_slug
from omegaconf import DictConfig

from src.datasets import JobDataModule
from src.models import JobModel
from src.utils import LogSummaryCallback, get_num_steps, prepare_loggers_and_callbacks

from clearml import Task

for i in range(torch.cuda.device_count()):
    print(torch.cuda.get_device_properties(i).name)

...

one year ago

Show more results