i am getting error when cloning tfs repository with error . We user https to clone and have no possibility yet with ssh. "Check if remote-worker has valid cr...
2 years ago
Another AWS autoscaler question. The docker-compose.yml automatically adds a ClearML agent to the services queue. When I run python aws_autoscaler.py --remot...
2 years ago
AWS autoscale question: can the autoscaler use the IAM role of the EC2 instance it’s running on rather than needing to be provided AWS keys?
2 years ago
Hi there! :) I have an issue regarding the get_local_copy(..) function of the Model class. Whenever this function is called in the course of a hyperparameter...
2 years ago
ClearML agent can work on GPU machine in no internet environment where we have proxy for pip packages and ubuntu updates ? If yes, How to configure these pro...
2 years ago
When I try to run any task the agent tries to mount something vscode related: 683637074988 adamastor:gpuall INFO Executing: ['docker', 'run', '-t', '--gpus',...
2 years ago
Is there an autoscaler equivalent for K8s? That is, a service that will launch pods based on incoming requests?
2 years ago
My autoscaled instance fails when running "git clone" on a private repo. I do have the SSH key placed at /root/.ssh/id_rsa on the machine, and when I SSH int...
2 years ago
Hello, I have the following scenario: - I am running a clearml-agent on a Windows machine; - To run on the agents, I am using virtual envs; - In the file tha...
2 years ago
Well, we accidentally leaked some super powerful credentials today. Is there a way to not have the UI or console logs show our AWS access keys, git credentia...
2 years ago
@<1523701087100473344:profile|SuccessfulKoala55> hi jake.. is there any way to create clearml token from shell( linux ). why am asking means .. i am trying t...
2 years ago
Hi! I have deployed a CM server + agent on a BM Kubernetes cluster and am trying to set the default docker image for running trains. I have specified the def...
2 years ago
file available.. same directory.. agent also available
2 years ago
Hi all! I want to run my task remotely on an agent, but I'm having trouble with the requirements setup. I have a requirements.txt with many packages to insta...
2 years ago
Need help ! I am able to train models from our local machines and log everything on the ClearML server without any issues, the same training gets stuck when ...
2 years ago
Hi all, im executing a task remotely via a queue. I don't want it to cache the env or install anything before the run, just to run the task on the agent mach...
2 years ago
clearml-session not working for me.. agent also there.. its showimg clearml-session not found
2 years ago
Hi, I am using two different environment for mlops and model training. I have setup the agent in the training environment. I can start the queue using comman...
2 years ago
Hi Everyone, I have a task that installs a python requirement from bitbucket over ssh. The requirement are defined in a requirements.txt file. In the agent, ...
2 years ago
Hi All, I have seen this error somewhere else in the channel where the docker agent gets stuck at this warning, and does not move forward, Requirement alread...
2 years ago
hi all! I would first like to say how amazing clearml is! I am trying to setup an agent with clearml, but for some reason it does not allow the docker image ...
2 years ago
Hey channel, I would like to setup Kubernetes for serving my models only. Does it mean I can use clearml-serving helm chart alone? What would be the use case...
2 years ago
Hi, i have a remote task that i want to run on some of the GPUs of the remote machine. I set the agent in the remote machine on two of the GPUs. How do i que...
2 years ago
Hello channel, I am struggling a lot on an issue linked to ClearMl agent and AWS Autoscaler . This issue is very problematic and urgent, please help me out! ...
2 years ago
Hi all! I'm struggling with a specific scenario, maybe you could help. I have 2 machines types (for example titan/a100) and 10 types of models with a generic...
2 years ago
Hello, I'm trying to run a training for a detectron2 model from clearml pipeline. I tried to install all the required packages using Task.add_requirements() ...
2 years ago
Hi, When running a task to training on the clearml agent it installs packages not with the correct version that my requirments.txt has, I've installed a envi...
2 years ago
Hello, I have the same issue as this github issue: None I tried setting up my AWS autoscaler conf file with the following params: sdk.development.store_uncom...
2 years ago