Hi, I'm using the k8s glue and have a few questions. Noted that it's not requesting the http://nvidia.com/gpu thus the pod created doesn't have a GPU resourc...
4 years ago
Btw: There seems to be no support for videos in tensorboard/experiment view (e.g. https://tensorboardx.readthedocs.io/en/latest/tensorboard.html#tensorboardX...
4 years ago
Hello, I have an error while installing git dependencies of local package: So far I used task. update _requirements(“[.]“) with my local package referencing ...
4 years ago
Another strange behavior of the python SDK CLI: after executing python my_task.py, where my_task.py creates and send to the queue an experiment, the command ...
4 years ago
Hi all! Please tell me there are examples of ClearML and pytorch-lightning integration
4 years ago
Hi, I have a self-hosted instance running quite well, pretty good job. I'm wondering if there is any way to have a read-only user? Is it available in the api...
4 years ago
Hi friends! I'm trying to upgrade the https://aws.amazon.com/marketplace/pp/B085D8W5NM AMI over to ClearML. The steps seem easy enough, just docker compose d...
4 years ago
Hey all. I'm seeing a strange error when trying to run hyperparameter optimisation by cloning a base training task Action failed <500/0: tasks.clone/v1.0 (ke...
4 years ago
Hi people, are there any other good options to view a csv table on the ClearML UI other then the artifacts preview?
4 years ago
Hi, I would like to pass in some pip arguments that clearml-agent would include when setting up the venv on the containers. How should I specify this? The ar...
4 years ago
Hey there, I would like to increase the ulimit for the number of files opened at the same time in a ec2 instance. According to this https://stackoverflow.com...
4 years ago
https://github.com/allegroai/clearml/blob/master/examples/optimization/hyper-parameter-optimization/base_template_keras_simple.py Hi, I am running this code ...
4 years ago
Hello Everyone, I deployed clearml ( allegroai/clearml:017 ) on a kubernetes cluster in it works fine. I tried to limit access as mentioned here : https://al...
4 years ago
Hi, I am looking to upload "already trained models" as experiments in my ClearML Server. How should I go about doing that? ClearML picks up the Tensorboard a...
4 years ago
Hi, we are running on disconnected on prem with a k8s glue. When a pod is spawned, we noted that an apt-get command is performed on the pod. SHort of changin...
4 years ago
Hi! I have a question concerning dynamic environment variables. I managed to create some env variables from the apiserver.conf and now I would like to set so...
4 years ago
Hi, we are having an interesting issue here. We serve many users and each user has their own credentials in accessing the private git repo. We can't seem to ...
4 years ago
Hello everyone, I have a question about SSH/credentials: Let's say I have multiple users / multiple ssh credentials that I do not want to share with the clea...
4 years ago
hey, I have a question: I got the following message: trains_agent: ERROR: Failed getting token (error 401 from http://192.168.40.210:8008 ): Unauthorized (in...
4 years ago
I have a notebook which is uncommited. It is being run on a remote machine with clearml-agent through clearml-session. Everything with newest versions, serve...
4 years ago
I wonder if there is a way to setup Task such that it would raise an error if the env where execution happens is not configured to track things on our custom...
4 years ago
Hello guys 🙂 I have a question about parameters & Controller. From a Controller instance I would like to clone a task, and set some update parameters with s...
4 years ago
Feature request: group series in the Plots section like in the Scalars section. I'd like to group PR curves from different iterations. That's it 🙂
4 years ago
Hi all, I am getting a bunch of this kind of log messages "clearml.storage - INFO - Starting upload: /tmp/.clearml.upload_model_6ou50pb1.tmp =>" I am pretty ...
4 years ago
multiprocessing.pool.RemoteTraceback: """ Traceback (most recent call last): File "/usr/lib/python3.6/multiprocessing/pool.py", line 119, in worker result = ...
4 years ago
Fyi: Conda installation of PyTorch is broken again. My old tasks which worked before now fail since they do not find torch. However, I can see in the executi...
4 years ago
Hi! I deployed clearml server along with jupyterhub on Azure K8s (AKS). The way it works is that every user is assigned a new pod that is spawned with a dock...
4 years ago
Hey, I see this in between my training epochs, what could be causing this? Because I see no affect of the following INFO on the training or reporting to Clea...
4 years ago
Hello! I have started a reddit discussion that is gaining some momentum: https://www.reddit.com/r/MachineLearning/comments/mfca0p/d_whats_the_simplest_most_l...
4 years ago
Hey! I'm trying to play with the clearml-session , I started it on an existing queue in my hosted environment, and I see the task running without any errors....
4 years ago