Hey there, happy new year to all of you 🍾 I have several tasks that are stuck while training a model with pytorch/ignite, more precisely right after uploadi...
4 years ago
Continuing on https://allegroai-trains.slack.com/archives/CTK20V944/p1607012505242500 we'd like to minimize startup time for the agent-started experiments si...
4 years ago
Hey, great Product! I've installed trains agent on a python3 venv, but when I run a script on the worker, it calls python2 instead of python 3. How to change...
4 years ago
Hi guys, I would like to start using the AWS autoscaler shipped in trains. I need to create a IAM user to get and I would like to know what are the minimal p...
4 years ago
Hi! In my project I need to run a lot of experiments on different subsets of my trainset, collect score and perform some calculations based on it. I have mai...
4 years ago
Hi, I was running a trains agent (version 1.16.1 ) on a remote machine. I notice that even if in the trains.conf agent.git_user, agent.git_pass was set, the ...
4 years ago
Hello there, I would like to do run cleanup code in case the user aborts one task from the dashboard (the agent is not using the task in docker). What signal...
4 years ago
Hey guys. I tried running the pytorch mnist example on a train-agent by running it locally and then resetting the experiment and then enqueue-ing it to the d...
Quick question on trains-agent and HPO. Say I have 10 experiments enqueued to a trains-agent . I understand the agent runs the experiment one-by-one. But can...
4 years ago
Hi guys, Last night one of our agents (0.16.1) was disconnected from our trains-server while executing an experiment. I saw that because the experiment it wa...
4 years ago
I'm looking to utilize the Trains AWS autoscaler functionality, but after going through its docs a few times I still don't get it. Ultimately, my setup is th...
4 years ago
Thank you for your help so far. I have a question about trains authentication and privacy when deploying on k8s. I want integrate building a trains-server in...
4 years ago
Hey guys, another question about deploying my own trains server. I have a trains-server deployed on my k8s cluster using the trains helm chart (which is awes...
4 years ago
Hey there, is there a way to access the trains configuration programmatically at runtime in a task (the configuration that is dumped by the agent in the logs...
4 years ago
Hi, I'm using the dockerized version of trains get an understanding of trains. While trying to play with the trains.conf settings in ~/trains.conf I got in a...
4 years ago
Hi, I need your help setting up an trains agent running in docker. I have an python script calling wget as system command which runs fine on my dev engine. W...
4 years ago
Hi everyone, Looking for ML management tools I stumbled upon Trains, I must say it has been awesome so far. I just have a (probably stupid) question: I'm try...
4 years ago
Hi guys, Firstly, thank you for your efforts and your support. I'm trying to use allegro trains to handle the experiments of a git repo. The repo is structur...
4 years ago
Sorry for the bombarding with errors.. but here comes another one 🙂 I have torch installed locally (through the transformers library) and when sending it to...
4 years ago
hey, trying to use trains-agent to run an experiment on my computer. when trying to execute a job from the queue on my agent im getting an error that numpy i...
4 years ago
i am using 0.16.1 version of server and 0.16.2 version of agent
4 years ago
Hello, I have a question, is it possible to create multiple train-agent per gpu? I see cases of multiple gpu's per agent on the git page but I'm wondering if...
4 years ago
Hi there, I am trying to start an agent in services mode with trains-server being on localhost (but not started together with the docker-compose!). My trains...
5 years ago
Hey Trains riders, This must be something simple I am missing, but still I couldn't realize what the problem is. I am trying to run trains-agent on my experi...
5 years ago
Hey, I have one question regarding the cleanup_service task in the DevOps project: Does it assume that the agent in services mode is in the trains-server mac...
5 years ago
Hello everyone! I'd like to mount some data on trains agent into docker container directory that contains cloned source code from repo, like this: BASE DOCKE...
5 years ago
Hi, I have an agent that is running two experiments at the same time: one that was running for a long time (11h) and one that the agent picked up afterwards,...
5 years ago