hey, how can I add a private key in order to let the clearml agent to clone from a private git repository?
3 years ago
Hello! I am trying to play around with the platform in order to gain some understanding of it. I am using this example: https://github.com/allegroai/clearml/...
3 years ago
Hello everyone, I have a question about SSH/credentials: Let's say I have multiple users / multiple ssh credentials that I do not want to share with the clea...
3 years ago
Hi, we are having an interesting issue here. We serve many users and each user has their own credentials in accessing the private git repo. We can't seem to ...
3 years ago
Hi, we are running on disconnected on prem with a k8s glue. When a pod is spawned, we noted that an apt-get command is performed on the pod. SHort of changin...
3 years ago
Hi, would like to check. So an agent pulled a docker image and install the pip dependencies on it. What if I have OS library dependencies as well? (Apt insta...
3 years ago
Hi, coming back with the venv caching: with the following setting: I call Task._update_requirements(["."]) setup.py has the following install_requires=["my-p...
3 years ago
folks, could you please clarify/help? I correct understand, if --docker is enable that will means every new experiments will be executed into dedicated agent...
3 years ago
Hi, I started my agent using. clearml-agent daemon --gpus 0 --queue gpu --docker --foreground, with the following parameters in clearml.conf. default_docker:...
3 years ago
Hi, how can i pass a env variable to the docker that's running the agent when i run this? I'm havving issues with the agent's git clone where it requires ssl...
3 years ago
Hi! I was wondering if it's possible for a clearml agent to create an environment from a conda environment.yml file every time an experiment is run
4 years ago
is there a way to copy the entire folder an agent in a machine (different from server machine) builds to the server machine?
4 years ago
Hi, I am trying to using ClearML agent version 0.16 ("trains_agent"), and I always get a silent exit after starting task execution: > Environment setup compl...
4 years ago
Hi, I'm trying out the clearml-agent on an Azure machine connecting to your managed server. I see the worker on the queue, and the job reaches it - but nothi...
Hi, in one of my agents with CUDA Version: 11.1 (from nvidia-smi), clearml agent 0.17.1 detects version 100 (I can see from experiments logs: agent.cuda_vers...
4 years ago
Hi folks, Is it possible to use an aws p3 instance (which as several GPUs) with one agent per GPU, all controlled through ClearML AWS AutoScheduler? So Clear...
Also is there a way to just run a Dockerfile? So for example my project includes a Dockerfile in the top-level dir. When sending to the agent I would like it...
4 years ago
I have a self-hosted clearm-server and and clearml-agent started with clearml-agent daemon --queue default --foreground I try to execute the following simple...
4 years ago
Hi, I'm using the aws autoscaler to spin instances. I'd like to use the clearml agent on the created instances with docker containers. However even if I set ...
Is there a way to interface with ClearML agent (CLI?) to handle model repositories and data versioning (but so, not experimentation, tight integration, pipel...
4 years ago
what is the python version an agent runs a task with? the one the agent was launched with, or the task? i.e. if Iaunch an agent with python 3.8.5, but a task...
4 years ago
Hi there. I'm following the training instructions for testing clearml agent ( https://allegro.ai/clearml/docs/docs/tutorials/tutorial_tuning_exp.html#step-3-...
4 years ago
Hi again, it seems like the aws autoscaler is not spinning instances with the EBS configuration I configured. Here is the configuration: resource_configurati...
Hi all, Would it be possible to make the aws autoscaler log each scale in/out operation in the console to help debugging/understanding the course of events?
Hi there, I have a problem with PyJWT: I am using trains==0.16.4 and trains-agent==0.16.3 in my agents. I installed PyJWT==1.7.1 in the agent (through extra_...
4 years ago
Very weird error, trying to run an experiment through an agent in docker mode, and I get this error docker: Error response from daemon: create /home/elior/De...
4 years ago
Hey there, happy new year to all of you 🍾 I have several tasks that are stuck while training a model with pytorch/ignite, more precisely right after uploadi...
4 years ago
Continuing on https://allegroai-trains.slack.com/archives/CTK20V944/p1607012505242500 we'd like to minimize startup time for the agent-started experiments si...
4 years ago
Hey, great Product! I've installed trains agent on a python3 venv, but when I run a script on the worker, it calls python2 instead of python 3. How to change...
4 years ago