[Auto scaler / API client does not see tasks in queue] We had used the AWS auto scaler (based on the aws_autoscaler.py script in the repo) and it worked grea...
one year ago
How do I view Debug Samples images in the browser when the output_uri is on Google Cloud Storage ( None )? Unlike for AWS storage, I do not get a popup windo...
one year ago
[Task gets interrupted / aborted / reset when in offline mode] For local testing, we have added a --no-clearml option to our code that sets task.set_offline(...
2 years ago
Is it possible to run multiple agent on EC2 machines started by the Autoscaler? Or have the one agent run multiple Queue jobs at once? E.g. having the Autosc...
one year ago
[Caching of environment and storage when using AWS auto scaler] First off : We are aiming to set up ClearML for large-scale DL training for multiple projects...
2 years ago
[ClearML with Pytorch-based distributed training} Hi everyone! Is the combination of ClearML with torch.distributed.launch or torchrun actively supported? A ...
one year ago
[Injecting secrets into a ClearML Agent / accessing clearml.conf from within a Task] Hi everyone, we are using the ClearML AWS Autoscaler (still awesome 😉 )...
2 years ago
[Errors when migrating ClearML Server from AWS to GCP] Hi everyone! As we’re using ClearML quite a bit, we’d love to take it with us when migrating our cloud...
one year ago
[Instance AutoScaler for GCP] In case someone else is interested, we have build an AutoScaler for GCP, too. It works similar to the AWS one in the ClearML re...
one year ago
Quick question: Is there a way for a task that is executing remotely to find out which ClearML queue it is in or was in?
one year ago
Hi everyone, quick question: Is there any easy way to get a task's full output directory ? E.g. when I create a task with task = Task.init(..., output_uri=" ...
2 years ago
[WebUI-based options injection not working] Hey everyone! Since our training repo has gotten quite complex, we configure all setup in an options.yml file whi...
2 years ago
[Plot not showing up in UI when setting file_server to S3 bucket] As a somewhat in depth question, we’ve set our output_uri and file_server to an S3 bucket t...
2 years ago
Hi everyone, I’m getting an error during model upload to S3. The error shows up in the console like below and I don’t see any uploaded objects in S3: 2022-10...
2 years ago
[Potential bug where the script path option is changed for remote runs] Hi everyone! We’re still using ClearML quite a bit, usually by running the first, sma...
one year ago