Hi, I have a few questions regards to Publish Experiment , I would like to understand what is the use case for it? It turns a experiment into read-only(view)...
4 years ago
Is there a way to reset the clearml workspace?
one year ago
Hello everyone, is the logging of "AutoModelForCausalLM" models supported? Even with manually logging of Llama-2-7b-hf it does not appear in the model list (...
one year ago
I keep getting errors when trying to compare a lot of experiments at the same time (>10). what's evern worse is that trains start working much slower in gene...
5 years ago
Are the env variables passed to trains-agent available in experiments run by this trains-agent?
5 years ago
Hi, if I am starting my training with the following command: python -u -m torch.distributed.launch --nproc_per_node=2 --use_env train.py --config configs/tra...
3 years ago
[Auto scaler / API client does not see tasks in queue] We had used the AWS auto scaler (based on the aws_autoscaler.py script in the repo) and it worked grea...
2 years ago
where I can find logs of clearML server which should go into ELK (elastic search ) for security.
2 years ago
I tried using clearml.automaton.TriggerScheduler in the following way: """Add a trigger to test the model once it is published.""" from clearml.automation im...
11 months ago
Hello! I add to inject the configuration into clearml with task.connect_configuration( http://OmegaConf.to _container(cfg, resolve=True)) because I use hydra...
4 years ago
Hi Can i ask how ClearML support distributed training via K8SGlue? Kubeflow Operator support distributed training on Kubernetes cluster, managing the pods se...
11 months ago
Hi, can you help with this one: clearml_agent: ERROR: [Errno 2] No such file or directory: '/home/dhzus/.clearml/venvs-builds/3.10/bin/python The agent can't...
one year ago
Hi, what is the expect input of the series parameter of the logger.report_line_plot function? I am passing list of two lists (like [[1,2,3], [4,5,6]] ) to pl...
2 years ago
Hello, I'm a bit lost in the docs for the MLOps, I have script which already integrate clearml logging, should I use clearml-task to launch it on an agent ? ...
4 years ago
Hey, did someone saw this error? trains.Metrics - WARNING - Retrying, previous request failed : Could not find a suitable TLS CA certificate bundle, invalid ...
5 years ago
Hello everyone, is there a way to log scalars that were already written in a tensorboard file other than iterating over all values?
2 years ago
ClearML tracks the executed .py file, so you can re-run the experiment. But... - With Metaflow DAGs, you pass a run argument to your training script, e.g. py...
2 years ago
Hi there, would it be possible to add some Neural Architecture Search example, as for the HyperParameter Optimizer examples?
4 years ago
hey guys, do you have any plans to add functionality to export training config with all hyperparameters to the different formats, such as training command li...
5 years ago
Hello, I’m trying to update our ClearML server running on kubernetes (1.6.0-213) but I get this error: > helm upgrade clearml allegroai/clearml --namespace c...
2 years ago
clearml/automation/cloud_driver.py has bash_script_template and docker support is hardcoded >>python -m clearml_agent --config-file ~/clearml.conf daemon --q...
3 years ago
Hi, in the context of multi-gpu training, is Model.get_local_copy() multi-process safe? or should make sure only the first process calls it first, then others
3 years ago
apart from the server migration doc , is there another upgrade guide somewhere from trains to clearml?
4 years ago
Hi Could someone please point me to a page that shows how to set up google cloud storage so I can use it to push/pull data from it using clearml-data ? thanks
3 years ago
please tell me, no one faced the problem that clearml does not intercept tb images? although these images are in tb and the following errors appear in the te...
3 years ago
Hi - you're all welcome to join for updates concerning the ClearML SaaS Free/Pro Service
3 years ago
I have a problem: I’m upload my model weight on a self hosted clearml server, using the update_wights method. I was thinking that would delete the old weight...
4 months ago
Hello, I have an issue with self hosted clearml server, everything was running find but since today i get the following error: clearml.Metrics - ERROR - Acti...
2 years ago