Hi everybody, I’m getting errors with automatic model logging on pytorch (running on a dockered agent). 2022-07-14 10:24:06,334 - clearml.storage - INFO - St...
one year ago
Hi clearML community I am working on a pytorch neural transfer project and I'm trying to use clearml agent for remote execution. I'm trying to install requir...
Proper way to upload artifacts Good day to all. I am running a pytorch lightning notebook with ClearML integration. As was advised in lightning example just ...
one year ago
Hi, can i ask how i can make Clearml-Datasets in comparison with PyTorch datasets/dataloader? In particular, pytorch dataloaders would be able to batch pull ...
2 years ago
Hi, folks !! I have two questions on pytorch lightning example in particular logging. The comment says “Connecting ClearML with the current process, from her...
2 years ago
Hi, when trying to use a remote agent to train a model, the initial environment setup on the remote machine fails because the list of requirements located in...
anyone having problems with ClearML slowing down pytorch experiments? auto_connect_framework={“pytorch”: False} helps, but it’s not a great solution. we thin...
2 years ago
Hi, Anyone also stuck with the exception encountered uploading pytorch model file? The dataset upload works fine, though.
2 years ago
Hello, I am working with a pytorch model, and I would like to save it as an Outputmodel. I read that Clearml has pytorch auto-logging ebabled automatically. ...
2 years ago
How do you save models and store the classes as well? When we save the model using pytorch we just save the state dict and not the custom classes for differe...
2 years ago
Has anybody used albumentations with pytorch in remote execution on clearml? The task is freezing during DataLoader initialization every time I put any augme...
2 years ago
Hi, I am trying to upload a model but I am getting the following error: 2021-07-05 17:21:14,625 - clearml.storage - ERROR - Failed uploading: cannot schedule...
2 years ago
So, here's a question. Does clearml automatically save everything necessary to continue training a pytorch language model? Specifically, I've been looking at...
2 years ago
Hi, I am running a file like this python train_it.py Task.force_requirements_env_freeze(False) task = Task.init(project_name='playground', task_name='base') ...
2 years ago
Did someone here already try the https://ngc.nvidia.com/catalog/containers/nvidia:pytorch with clearml? It seems like there is a base conda environmenet wher...
2 years ago
Hi there, congrats for releasing v1 😄 I observed that with pytorch ignite (4.2.0), the metrics of the validation engines are delayed by one epoch. I am not ...
2 years ago
Fyi: Conda installation of PyTorch is broken again. My old tasks which worked before now fail since they do not find torch. However, I can see in the executi...
3 years ago
Current configuration (clearml_agent v0.17.2rc4, location: /root/clearml.conf): ---------------------- agent.worker_id = dgxstation-2:gpu3 agent.worker_name ...
3 years ago
Hi, i shifted my clearml setup to an on-premise disconnected env, which has a pip repo setup. I noted this warning, Trying pip install: /root/.clearml/venvs-...
3 years ago
Hi, I'm trying out the clearml-agent on an Azure machine connecting to your managed server. I see the worker on the queue, and the job reaches it - but nothi...
Hi, how can I get the logs from the pytorch ignite early stopping handler to be logged in clearml?
3 years ago
Hi, i’m currently running clearml with pytorch and everytime i run into torch.load(os.path.join(root, self.feat_pt))there’ll be a message: Task connect, seco...
3 years ago
I am using pytorch lightning with ddp accelerator on 4 gpus, and I found every checkpoint is recorded 4 times on web UI with different ids. One is on default...
3 years ago
Just want to share with you something that really excited me - I'm doing a workshop for ClearML today (intro only, nothing that will be shocking for you guys...
3 years ago
Hey guys, I am setting up a new machine with two rtx 3070 GPUs where I created two agents (one for each GPU). On both agents, my experiments fail with error:...
3 years ago
Hey guys. I tried running the pytorch mnist example on a train-agent by running it locally and then resetting the experiment and then enqueue-ing it to the d...
Hi guys! For some reason I can't get Trains to log images to Debug Samples tab using Tensorboard SummaryWriter.add_image . Also tried explicit reporting, pyt...
3 years ago
Hi everyone! I've had a problem. But when I was describing it here it was solved. Maybe it will help someone. I use pytorch and training accidentally freezes...
3 years ago
I am looking for a "hello world" example to do 3 Tasks: data=preprocessData() //local processing model=trainModel(data) // on cloud processing on a custom co...
3 years ago