AdventurousButterfly15

17 Questions, 77 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Badges 1

75 × Eureka!

Questions 17
Answers 77

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

When I Set Agent Management To Conda It Tries To Create Envs With Python 3.1 And Fails.

When I set agent management to conda it tries to create envs with python 3.1 and fails. Executing Conda: /home/adamastor/anaconda3/bin/conda create --yes --m...

mlops

3 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Has Anyone Had Success Using Clearml With Huggingface Models? I Create My Hf

Has anyone had success using clearml with huggingface models? I create my HF Trainer with the ClearML callback, but the only thing I get in the logs is this ...

clearml

2 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

How Can I Extend A Dataset? On Community Edition I Have An Existing Dataset, I Want To Add Some Files And Make A New Version. I Tried Just Doing A

How can I extend a dataset? On community edition I have an existing dataset, I want to add some files and make a new version. I tried just doing a Dataset()....

clearml

2 years ago

0 Votes

22 Answers

2K Views

0 Votes 22 Answers 2K Views

Why Does My Task Execution Freeze After Pip Installation (Running Agent In Foreground Mode)? The Task Is:

Why does my task execution freeze after pip installation (running agent in foreground mode)? The task is: from clearml import Task task = Task.init(project_n...

mlops

3 years ago

0 Votes

21 Answers

2K Views

0 Votes 21 Answers 2K Views

Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

Do I understand correctly that python versions must match between client (my mac, sends task for remote execution) and clearml-agent? I don’t really get how ...

mlops pytorch

3 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

When I Run A Task With

When I run a task with Dataset.get the agent requests the dataset from a weird url. adamastor.gaiavf.local in this case. 2022-10-03 17:50:17,556 - clearml.st...

mlops

3 years ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

I Am Saving A Model With Pickle, But It Doesn’T Show Up As An Artifact. Why?

I am saving a model with pickle, but it doesn’t show up as an artifact. Why? Task.init(..., output_uri=True) model = SklearnPipeline() ... pickle.dump(model,...

clearml

2 years ago

0 Votes

14 Answers

2K Views

0 Votes 14 Answers 2K Views

Clearml Task Execution Fails Trying To Pull Data From Gitlab. The Credentials Are Correct (Username + Access Token), But I Get This Error:

ClearML task execution fails trying to pull data from Gitlab. The credentials are correct (username + access token), but I get this error: remote: HTTP Basic...

clearml

2 years ago

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

I Am Getting

I am getting CLEARML Monitor: Could not detect iteration reporting, falling back to iterations as seconds-from-start . How do fix it? The docs are not helpfu...

pytorch

2 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

When I Try To Run Any Task The Agent Tries To Mount Something Vscode Related:

When I try to run any task the agent tries to mount something vscode related: 683637074988 adamastor:gpuall INFO Executing: ['docker', 'run', '-t', '--gpus',...

mlops

2 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

How Can I Stop Clearml From Uploading Temporary Models? I Am Running Cross_Validation, Training A Bunch Of Models In A Loop Like This:

How can I stop clearml from uploading temporary models? I am running cross_validation, training a bunch of models in a loop like this: models = [] for X_trai...

clearml

2 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Is There A Way To Clear Clearml Caches, Maybe Some Command? My Server Ran Out Of Space And I Lost A Whole Weekend Of Training. My

Is there a way to clear ClearML caches, maybe some command? My server ran out of space and I lost a whole weekend of training. My venvs-cache folder was over...

clearml

2 years ago

0 Votes

22 Answers

2K Views

0 Votes 22 Answers 2K Views

Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

clearml-session fails ssh tunneling. It does not use key auth, instead sets up some weird password and then fails to auth: Remote machine is ready Setting up...

remote-ssh

3 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Hey, Loving Clearml So Far. I Create An Agent With 1 Gpu And I Am Sending A Task To It. But It Says That It Couldn’T Create A Docker With Gpu Access. How Can I Fix That?

Hey, loving ClearML so far. I create an agent with 1 gpu and I am sending a task to it. But it says that it couldn’t create a docker with gpu access. How can...

mlops

3 years ago

0 Votes

11 Answers

3K Views

0 Votes 11 Answers 3K Views

I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

I am trying to do a remote execution of a test task, but it fails during env setup due to trying to install an obscure version of pytorch. Been trying to sol...

mlops pytorch

3 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

I Am Training A Model With Pytorch Lightning. I Save

I am training a model with Pytorch lightning. I save .ckpt checkpoints. But they never get uploaded to clearml! How do I make clearml detect the checkpoints?...

clearml

2 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

I Trained A Model, Saved It. Now I Am Trying To Access It From Another Machine, But The Model Url Is A Local Path. How Can I Download Models From Clearml?

I trained a model, saved it. Now I am trying to access it from another machine, but the model url is a local path. How can I download models from Clearml?

clearml

2 years ago

0 Is There A Way To Clear Clearml Caches, Maybe Some Command? My Server Ran Out Of Space And I Lost A Whole Weekend Of Training. My

Thanks!

2 years ago

0 Has Anyone Had Success Using Clearml With Huggingface Models? I Create My Hf

I also use TB.

I solved the issue by implementing my own ClearML logger

2 years ago

0 Do I Understand Correctly That Python Versions Must Match Between Client (My Mac, Sends Task For Remote Execution) And Clearml-Agent? I Don’T Really Get How The Environments Are Managed. All I Want To Do Is Take My Code And Execute It On The Agent Machin

What I am seeing is that the agent always fails trying to install some packages when I am not asking it at all

3 years ago

I have no idea what it is doing

3 years ago

0 I Am Training A Model With Pytorch Lightning. I Save

I did that, but it didnt work

2 years ago

0 I Am Training A Model With Pytorch Lightning. I Save

Yes

2 years ago

CostlyOstrich36 in installed packages it has:
` # Python 3.10.6 | packaged by conda-forge | (main, Aug 22 2022, 20:41:22) [Clang 13.0.1 ]

Pillow == 9.2.0
clearml == 1.7.1
minio == 7.1.12
numpy == 1.23.1
pandas == 1.5.0
scikit_learn == 1.1.2
tensorboard == 2.10.1
torch == 1.12.1
torchvision == 0.13.1
tqdm == 4.64.1 `Which is the same as I have locally and on the server that runs clearml-agent

3 years ago

Yeah, pytorch is a must. This script is a testing one, but after this I need to train stuff on GPUs

3 years ago

0 I Trained A Model, Saved It. Now I Am Trying To Access It From Another Machine, But The Model Url Is A Local Path. How Can I Download Models From Clearml?

Thanks, I will try

2 years ago

0 I Am Getting

Version: 1.11.1

2 years ago

0 Hey, Loving Clearml So Far. I Create An Agent With 1 Gpu And I Am Sending A Task To It. But It Says That It Couldn’T Create A Docker With Gpu Access. How Can I Fix That?

I am doing clearml-agent --docker … --foreground --gpus 1

3 years ago

0 When I Set Agent Management To Conda It Tries To Create Envs With Python 3.1 And Fails.

CostlyOstrich36 CLEARML-AGENT version 1.3.0

3 years ago

0 How Can I Stop Clearml From Uploading Temporary Models? I Am Running Cross_Validation, Training A Bunch Of Models In A Loop Like This:

@<1523701205467926528:profile|AgitatedDove14> thanks!

2 years ago

Locally I have a conda env with some packages and a basic requirements file.
I am running this thing:
` from clearml import Task, Dataset
task = Task.init(project_name='Adhoc', task_name='Dataset test')
task.execute_remotely(queue_name="gpu")

from config import DATASET_NAME, CLEARML_PROJECT
print('Getting dataset')

dataset_path = Dataset.get(
dataset_name=DATASET_NAME,
dataset_project=CLEARML_PROJECT,
).get_local_copy()#.get_mutable_local_copy(DATASET_NAME)

print('Dataset path', d...

3 years ago

0 How Can I Stop Clearml From Uploading Temporary Models? I Am Running Cross_Validation, Training A Bunch Of Models In A Loop Like This:

"realmodelonly.pkl"

should be the full path, or just the file name?

2 years ago

Pytorch is configured on the machine that’s running the agent. It’s also in requirements

3 years ago

0 Hey, Loving Clearml So Far. I Create An Agent With 1 Gpu And I Am Sending A Task To It. But It Says That It Couldn’T Create A Docker With Gpu Access. How Can I Fix That?

The issue was that nvidia-docker2 was not installed on the machine where I was trying to run the agent. Following this guide fixed it:
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#docker

3 years ago

0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

Despite having manually installed this torch version, during task execution agent still tries to install it somehow and fails:
INFO:clearml_agent.commands.worker:Downloading " " to pip cache Collecting torch==1.12.1+cu116 File was already downloaded /home/boris/.clearml/pip-download-cache/cu117/torch-1.12.1+cu116-cp310-cp310-linux_x86_64.whl Successfully downloaded torch INFO:clearml_agent.commands.worker:Downloading " ` " to pip cache
Collecting torchvision==0.13.1+cu116
File was...

3 years ago

0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

I resolved the issues by making my own docker image and making all envs the same:
The env that runs clearml-agent The docker env for running tasks in The env that requests task execution (my client)

3 years ago

0 How Can I Extend A Dataset? On Community Edition I Have An Existing Dataset, I Want To Add Some Files And Make A New Version. I Tried Just Doing A

Thanks

2 years ago

0 Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

I mean if I enter my host machine ssh password it works. But we will disable password auth in future, so it’s not an option

3 years ago

0 Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

The task log is here:
the log on my local machine is here:

3 years ago

0 Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

All ports are open (both agent machine and client machine are working within same VPN)

3 years ago

0 I Am Getting

I dont have a short version.
I am using community clearml. How do I find out my version?

2 years ago

0 Is There A Way To Get A Task'S Docker Container Id/Name? I'M Generally Interested In Resource Profiling Of Each Container, So I Noticed I Can Use

For a hacky way you can do docker ps and see the docker run command. I believe it contains the task id, so you can grep by task id

3 years ago

0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

I will try soon

3 years ago

0 Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

Ok, it makes sense. But it’s running in docker mode and it is trying to ssh into the host machine and failing

3 years ago

0 Clearml-Session Fails Ssh Tunneling. It Does Not Use Key Auth, Instead Sets Up Some Weird Password And Then Fails To Auth:

I can telnet the port from my mac:
(base) *[main][~/Documents/plant_age]$ telnet 192.168.1.55 10022 Trying 192.168.1.55... Connected to 192.168.1.55. Escape character is '^]'. SSH-2.0-OpenSSH_8.4p1 Debian-5+deb11u1 ^C

3 years ago

0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

Also manually installing this torch version succeeds:
` (base) boris@adamastor:~$ python3.10 -m pip install /home/boris/.clearml/pip-download-cache/cu117/torch-1.12.1+cu116-cp310-cp310-linux_x86_64.whl
Processing ./.clearml/pip-download-cache/cu117/torch-1.12.1+cu116-cp310-cp310-linux_x86_64.whl
Requirement already satisfied: typing-extensions in ./miniconda3/lib/python3.10/site-packages (from torch==1.12.1+cu116) (4.3.0)
Installing collected packages: torch
Attempting uninstall: torch
...

3 years ago

Here’s the agent config. It’s basically default
https://justpaste.it/4ozm3

3 years ago

Show more results