Hi folks, Is it possible to use an aws p3 instance (which as several GPUs) with one agent per GPU, all controlled through ClearML AWS AutoScheduler? So Clear...
Hi, I'm using the aws autoscaler to spin instances. I'd like to use the clearml agent on the created instances with docker containers. However even if I set ...
Hi, I'm running the latest clearml server on aws for a week now and I regularily run into the "Fetch Experiments failed". I can't really find any more inform...
4 years ago
Hi guys, since I am done with implementing the AWS autoscaler, I would like to share some pain points that I encountered in the process with the hope that th...
4 years ago
Hi again, it seems like the aws autoscaler is not spinning instances with the EBS configuration I configured. Here is the configuration: resource_configurati...
Hi all, Would it be possible to make the aws autoscaler log each scale in/out operation in the console to help debugging/understanding the course of events?
How can I integrate trains-server to aws ec2 api
5 years ago
I'm having issues running trains-agent on my aws, it seems to not be able to install pytorch... I have miniconda 4.8.2 , python 3.7.6 , trains-agent 0.14.1 i...
5 years ago