Is there a way to interface with ClearML agent (CLI?) to handle model repositories and data versioning (but so, not experimentation, tight integration, pipel...
3 years ago
what is the python version an agent runs a task with? the one the agent was launched with, or the task? i.e. if Iaunch an agent with python 3.8.5, but a task...
3 years ago
Hi there. I'm following the training instructions for testing clearml agent ( https://allegro.ai/clearml/docs/docs/tutorials/tutorial_tuning_exp.html#step-3-...
3 years ago
Hi again, it seems like the aws autoscaler is not spinning instances with the EBS configuration I configured. Here is the configuration: resource_configurati...
Hi all, Would it be possible to make the aws autoscaler log each scale in/out operation in the console to help debugging/understanding the course of events?
Hi there, I have a problem with PyJWT: I am using trains==0.16.4 and trains-agent==0.16.3 in my agents. I installed PyJWT==1.7.1 in the agent (through extra_...
3 years ago
Very weird error, trying to run an experiment through an agent in docker mode, and I get this error docker: Error response from daemon: create /home/elior/De...
3 years ago
Hey there, happy new year to all of you 🍾 I have several tasks that are stuck while training a model with pytorch/ignite, more precisely right after uploadi...
3 years ago
Continuing on https://allegroai-trains.slack.com/archives/CTK20V944/p1607012505242500 we'd like to minimize startup time for the agent-started experiments si...
3 years ago
Hey, great Product! I've installed trains agent on a python3 venv, but when I run a script on the worker, it calls python2 instead of python 3. How to change...
3 years ago
Hi guys, I would like to start using the AWS autoscaler shipped in trains. I need to create a IAM user to get and I would like to know what are the minimal p...
3 years ago
Hi! In my project I need to run a lot of experiments on different subsets of my trainset, collect score and perform some calculations based on it. I have mai...
3 years ago
Hi, I was running a trains agent (version 1.16.1 ) on a remote machine. I notice that even if in the trains.conf agent.git_user, agent.git_pass was set, the ...
4 years ago
Hello there, I would like to do run cleanup code in case the user aborts one task from the dashboard (the agent is not using the task in docker). What signal...
4 years ago
Hey guys. I tried running the pytorch mnist example on a train-agent by running it locally and then resetting the experiment and then enqueue-ing it to the d...
Quick question on trains-agent and HPO. Say I have 10 experiments enqueued to a trains-agent . I understand the agent runs the experiment one-by-one. But can...
4 years ago
Hi guys, Last night one of our agents (0.16.1) was disconnected from our trains-server while executing an experiment. I saw that because the experiment it wa...
4 years ago
I'm looking to utilize the Trains AWS autoscaler functionality, but after going through its docs a few times I still don't get it. Ultimately, my setup is th...
4 years ago
Thank you for your help so far. I have a question about trains authentication and privacy when deploying on k8s. I want integrate building a trains-server in...
4 years ago
Hey guys, another question about deploying my own trains server. I have a trains-server deployed on my k8s cluster using the trains helm chart (which is awes...
4 years ago
Hey there, is there a way to access the trains configuration programmatically at runtime in a task (the configuration that is dumped by the agent in the logs...
4 years ago
Hi, I'm using the dockerized version of trains get an understanding of trains. While trying to play with the trains.conf settings in ~/trains.conf I got in a...
4 years ago
Hi, I need your help setting up an trains agent running in docker. I have an python script calling wget as system command which runs fine on my dev engine. W...
4 years ago
Hi everyone, Looking for ML management tools I stumbled upon Trains, I must say it has been awesome so far. I just have a (probably stupid) question: I'm try...
4 years ago
Hi guys, Firstly, thank you for your efforts and your support. I'm trying to use allegro trains to handle the experiments of a git repo. The repo is structur...
4 years ago
Sorry for the bombarding with errors.. but here comes another one 🙂 I have torch installed locally (through the transformers library) and when sending it to...
4 years ago
hey, trying to use trains-agent to run an experiment on my computer. when trying to execute a job from the queue on my agent im getting an error that numpy i...
4 years ago
i am using 0.16.1 version of server and 0.16.2 version of agent
4 years ago