
SubstantialElk6
117
Questions,
310
Answers
Active since 10 January 2023
Last activity
one year ago
Reputation
0
Badges 1
282 × Eureka!Hi I upgraded some of my k8s glue to use the latest clearml-agent 1.1.0 and receive this error when the agent pulls a task. Pulling task xxxbbbxxxbb launchin...
3 years ago
Hi, how can i make a stage in a clearml pipeline non-blocking? The scenario is that stages downstream needed runtime info from the first stage, however the f...
2 years ago
Hi, we have recurring disk space issues on our ClearML server (Drop of many GB in a few days). After some analysis, we noted /opt/clearml/data/elastic_7 to b...
3 years ago
Hi, i have a future roadmap question on clearml-datasets. The current implementation works well for small datasets but its rather in effective for very large...
3 years ago
Hi, in your latest changelog. There's a new function. Task.launch_multi_node() for distributed experiment execution In the context of using with K8S glue, wi...
one year ago
Current configuration (clearml_agent v0.17.2rc4, location: /root/clearml.conf): ---------------------- agent.worker_id = dgxstation-2:gpu3 agent.worker_name ...
4 years ago
Hi I'm using clearml datasets. How do I tell from the ClearML UI which datasets version am I using?
3 years ago
Hi, i was adding data using clearml-data and get the following consistent errors. Retrying (Retry(total=237, connect=237, read=240, redirect=240, status=240)...
2 years ago
Hi, can i do a quick check if all the documentation I find on TRAINS are still valid for ClearML? Specifically, i am looking at integration of ClearML and Ku...
4 years ago
Hi, I have this python package that's located on my base image..(e.g. /code/app/flair). Within then folder there's a package called flair and a data.py file....
4 years ago
Hi, is it possible to have a compatibility matrix between clearml sdk, ClearML agent, ClearML server...etc? Recently we updated the ClearML sdk without updat...
2 years ago
Hi, how is the priority of the configuration like? Which one takes precedence? For example, output_uri default_output_uri in clearml.conf on client files_ser...
2 years ago
Hi, is there a way I can supply credentials to clearml-data (cli and python) without going thru the ClearML.conf?
3 years ago
Hi, i have been getting the following for a while. Is there a more detailed log i can look into? This happens on both https and http. 2021-05-27 08:47:02,539...
3 years ago
Hi, i would like to check what would be the recommended hardware specs for the server host ClearML server. I had one configured with 32 CPU cores, 64GB ram a...
4 years ago
Clearml-Agent didn't seem to take the CA store from the OS. Where can i point ClearML to the CA certs, in particular for uploading of models into S3. At the ...
3 years ago
Hi Can i ask how ClearML support distributed training via K8SGlue? Kubeflow Operator support distributed training on Kubernetes cluster, managing the pods se...
4 months ago
Hi, we are planning to move on to openshift. Can I ask if k8s-glue supports openshift?
3 years ago
Hi, we would like to incorporate some approval process in ClearML. One of the needs is to attach some PDFs and word docs to a published experiment, preferbly...
2 years ago
Hi, can you run clearml experiments on docker images that does not have root?
2 years ago
Hi, my DevSecOps team has raised some issues of us deploying ClearML for use. In particular, they are not happy with docker.sock configuration as it would po...
4 years ago
Hi, i was trying out the steps on this ( https://github.com/allegroai/clearml/blob/master/docs/clearml-task.md ) and encountered the following error. Environ...
4 years ago
Hi, i was using the K8S Glue and it worked fine on one project but didn't work on another. At the point just before a git clone was executed, i get the error...
4 years ago
Hi, i have a question about clearml-data. Clearml-Data probably does well on Data Versioning, but when it comes to actual loading of data, are there examples...
3 years ago
Hi, trying to understand clearml-session. I have an agent running on a machine monitoring a queue Then I ran clearml-session --queue myqueu --docker torch-im...
3 years ago
Hi, i was running my agent and had a few scripts for agent.extra_docker_shell_script. but when I looked through the logs, they were not executed. Any idea wh...
3 years ago
I just getting this in my agent run task. Would appreciate if someone can advise where i externalrequirement is pointing at. RequirementsManager handler rais...
3 years ago
Hi, i am trying to use clearml-data to upload my data to S3, which is password protected. How should i indicate the credentials after i set --storage s3://.....
4 years ago
Hi, i'm getting this long error when running task.execute_remotely(queue_name="1gpu", exit_process=True) . I also notices an error Failed to fetching activit...
3 years ago
Hi, we noted that using K8S glue, there are some situations where the task cannot be registered as error and will be stuck at pending. An example of one situ...
3 years ago
Show more results
questions