Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
SubstantialElk6
Moderator
117 Questions, 310 Answers
  Active since 10 January 2023
  Last activity 2 years ago

Reputation

0

Badges 1

282 × Eureka!
0 Votes
8 Answers
2K Views
0 Votes 8 Answers 2K Views
I just getting this in my agent run task. Would appreciate if someone can advise where i externalrequirement is pointing at. RequirementsManager handler rais...
4 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
4 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
Hi, i would like to understand more on how ClearML deal with codes. I noticed that i am able to read the source codes of the python script that i have used a...
4 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
Hi recently upgraded all the clearml, clearml-server, clearml-agent. Now running k8s glue with clearml-agent=1.0.1rc1. python3 k8s_glue_example.py --queue 1b...
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
Hi, i was adding data using clearml-data and get the following consistent errors. Retrying (Retry(total=237, connect=237, read=240, redirect=240, status=240)...
3 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
2 years ago
0 Votes
10 Answers
2K Views
0 Votes 10 Answers 2K Views
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
Hi, I've multiple tasks setup in a complex pipeline. How can I; Define prior to running the pipeline, which tasks to be running on which remote queue using w...
3 years ago
0 Votes
1 Answers
2K Views
0 Votes 1 Answers 2K Views
Hi, in your latest changelog. There's a new function. Task.launch_multi_node() for distributed experiment execution In the context of using with K8S glue, wi...
2 years ago
0 Votes
5 Answers
2K Views
0 Votes 5 Answers 2K Views
Hi, how might i use the SDK to pull parameters of the agent's clearml.conf into my code during runtime? For example, if i wish to pull the configuration for ...
4 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
Hi, we have been using ClearML in our development environment to train our models and benchmarking them. I was wondering what is ClearML's role in transition...
4 years ago
0 Votes
23 Answers
2K Views
0 Votes 23 Answers 2K Views
Hi i saw this on the clearml-agent docs but other than the docker image, i'm not sure how to integrate this with clearml py and clearml-server. Please advise...
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
Hi, is there a way I can supply credentials to clearml-data (cli and python) without going thru the ClearML.conf?
4 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
Hi, we have recurring disk space issues on our ClearML server (Drop of many GB in a few days). After some analysis, we noted /opt/clearml/data/elastic_7 to b...
3 years ago
0 Votes
11 Answers
2K Views
0 Votes 11 Answers 2K Views
Hi, can i do a quick check if all the documentation I find on TRAINS are still valid for ClearML? Specifically, i am looking at integration of ClearML and Ku...
4 years ago
0 Votes
9 Answers
2K Views
0 Votes 9 Answers 2K Views
Hi, just to check. Does the k8s glue install torch by default? I'm getting Warning: could not resolve python wheel replacement for torch==1.8.0 even though i...
4 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
Hi, i was using the K8S Glue and it worked fine on one project but didn't work on another. At the point just before a git clone was executed, i get the error...
4 years ago
0 Votes
1 Answers
3K Views
0 Votes 1 Answers 3K Views
[Distributed Training] Hi, i have a ClearML setup with K8SGlue that spins up pods of 4 GPUs when picking tasks off the clearml queue. We would now want to pr...
2 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
4 years ago
0 Votes
0 Answers
2K Views
0 Votes 0 Answers 2K Views
4 years ago
0 Votes
0 Answers
687 Views
0 Votes 0 Answers 687 Views
Hi Can i ask how ClearML support distributed training via K8SGlue? Kubeflow Operator support distributed training on Kubernetes cluster, managing the pods se...
11 months ago
0 Votes
1 Answers
2K Views
0 Votes 1 Answers 2K Views
So i've install allegro on Kubernetes using helm, how to i perform trains-init ?
5 years ago
0 Votes
1 Answers
2K Views
0 Votes 1 Answers 2K Views
Clearml-Agent didn't seem to take the CA store from the OS. Where can i point ClearML to the CA certs, in particular for uploading of models into S3. At the ...
4 years ago
0 Votes
0 Answers
2K Views
0 Votes 0 Answers 2K Views
Hi, we are working on a mini project to 'integrate' ClearML Datasets with CKAN. Wondering if the community could share some ideas.
3 years ago
0 Votes
30 Answers
2K Views
0 Votes 30 Answers 2K Views
Hi, i'm getting this long error when running task.execute_remotely(queue_name="1gpu", exit_process=True) . I also notices an error Failed to fetching activit...
4 years ago
0 Votes
30 Answers
2K Views
0 Votes 30 Answers 2K Views
4 years ago
0 Votes
20 Answers
2K Views
0 Votes 20 Answers 2K Views
4 years ago
0 Votes
6 Answers
2K Views
0 Votes 6 Answers 2K Views
4 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
2 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
Hi, i am trying to understand clearml-data and only found this piece of article explaining it. https://github.com/allegroai/clearml/blob/master/docs/datasets...
4 years ago
Show more results questions
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Setting the credentials on agent machine means the users cannot use their own credentials since an k8s glue agent serves multiple users.

Referencing your suggestion, we can configure output_uri on task.set_base_docker() but how should we do this for the credentials?

4 years ago
0 Hi, We Have Recurring Disk Space Issues On Our Clearml Server (Drop Of Many Gb In A Few Days). After Some Analysis, We Noted

ok thanks. this would mean that increasing the disk space for my ClearML is the only option as we are not at liberty to delete.

3 years ago
0 Hi, My Devsecops Team Has Raised Some Issues Of Us Deploying Clearml For Use. In Particular, They Are Not Happy With Docker.Sock Configuration As It Would Potentially Expose The Entire Cluster To Unauthorised View. Can We Do Without It?

Hi, clearml-agent==0.17.2rc3 did work. I'm on a 1.19 k8s cluster, and has this error when a task is pulled. Is the glue not compatible with 1.19?
` Pulling task 3a90802d1dfa4ec09fbccba0beffbaa8 launching on kubernetes cluster
Pushing task 3a90802d1dfa4ec09fbccba0beffbaa8 into temporary pending queue
Kubernetes scheduling task id=3a90802d1dfa4ec09fbccba0beffbaa8
kubectl output:
Flag --replicas has been deprecated, has no effect and will be removed in the future.
Flag --generator has been depre...

4 years ago
0 Hi, Just To Check. Does The K8S Glue Install Torch By Default? I'M Getting

Hi AgitatedDove14 , that's what i am trying to figure out as well. The task has nothing to do with torch, and the requirements.txt doesn't have any torch packages as well.

4 years ago
0 Hi, I Shifted My Clearml Setup To An On-Premise Disconnected Env, Which Has A Pip Repo Setup. I Noted This Warning,

AlertBlackbird30 , Actually the log says 10.2.
docker_cmd = nvidia/cuda:10.2-devel-ubuntu18.04 -e GIT_SSL_NO_VERIFY=true

4 years ago
0 Hi, V1 Of Agent Seems To Have Removed Agent.Package_Manager.Force_Repo_Requirements_Txt. Is This Still Available In Other Forms?

I managed to find out why. The docker image I'm using is not set as root user thus the error. But I'm wondering why this is the case as docker best practices does indicate we should use a non root on production images.

4 years ago
0 Hi, How Can I Pass A Env Variable To The Docker That'S Running The Agent When I Run This? I'M Havving Issues With The Agent'S Git Clone Where It Requires Sslverification To Be Disabled. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground

Executing task id [228caa5d25d94ac5aa10fa7e1d02f03c]:
repository = https://192.168.50.88:18443/tkahsion/pytorchmnist
branch = master
version_num = cfb833bcc70f3e10d3b6a96cfad3225ed682382b
tag = 
docker_cmd = nvidia/cuda:10.1-runtime-ubuntu18.04
entry_point = pytorch_mnist.py
working_dir = .
Warning: could not locate requested Python version 3.9, reverting to version 3.6
Using base prefix '/usr'
New python executable in /root/.clearml/venvs-builds/3.6/bin/python3.6
Also creating executable i...

4 years ago
4 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

It would make sense on a very large resource cluster. Unfortunately we only have less than 50 GPUs to share across. A multi-tenant SAAS would cut the resources into even more smaller clusters and not help with efficiency. Or would you have a suggestion?

4 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Hi.

We tried as advised above and it still didn't work.
Host: http://ecs.ai:443
output_uri = S3://ecs.ai:443/bucketname

This time round the client gave this error.
Botocore.exceptions.connectiinclosederror: connection was closed before we received a valid response from endpoint URL: ' http://ecs.ai/bucketname/.clearml.test '.

It's quite apparent that whatever clearml passed to boto3 ends up as a http call instead of https, which is wrong.

4 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Going back to the open source, I think that adding the credentials as part of the source code might allow to have "credentials" auto populate as part of the remote execution, wdyt?

Not sure how this will work when i can't supply the credentials to ClearML programatically.

4 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Do you have more info on vault?
Actually it only make sense if the entire department or organisation are saving their models in a common repo. In our case this is not possible due to client security (e.g. training data from clients can potentially be 'reverse engineered' from trained models in future). So each department and even projects will need their own repo.

4 years ago
0 Hi, I Have Been Getting The Following For A While. Is There A More Detailed Log I Can Look Into? This Happens On Both Https And Http.

Ok thanks. that explains alot. We have been doing this wrongly the whole time, thinking that the clearml.conf on the client side would be acknowledged by the remote agent execution. In reality, only the API section is utilised.

4 years ago
0 Hi, I Am Running Several Python Scripts But All For The Same Project/Task. Is It Possible To Task.Init To Existing Running/Completed Task And Adding On The Results?

Hi,
basically i run this block first and ended the script.
task = Task.init(project_name="afro-nmt", task_name=args.taskname, continue_last_task=args.taskid) Logger.current_logger().report_scalar(title="BLEU",series="JW300",value=args.jwbleu, iteration=args.lastiter)Then i run another script, with series different.
` task = Task.init(project_name="afro-nmt", task_name=args.taskname, continue_last_task=args.taskid)
Logger.current_logger().report_scalar(title="BLEU",series="SS900",value=arg...

4 years ago
0 Hi, Is There Any Code Examples Of How Dataops Is Being Established?

The first is probably done using pipeline controllers, the second using Datasets or HyperDatasets. Its not very clear how the last one is achieved, especially on the searchable data catalogs.

4 years ago
0 Hi, We Are Having Issues With Clearml-Session For Vscode. Apparently It'S Hardcoded To Download From

The agent is running on a disconnected server on docker mode. I have a client that runs clearml-session and i saw from the agent's logs that the installation of vscode fails.

4 years ago
Show more results compactanswers