
Reputation
Badges 1
55 × Eureka!I don't think I am on the pro version, is that a paid one?
configurations:
extra_clearml_conf: 'sdk.aws.s3.region="us-west-2"
agent.extra_docker_arguments=["--shm-size=90g"]
agent.extra_docker_shell_script=["git config --global credential.helper cache --timeout=604800",]'
extra_trains_conf: ''
extra_vm_bash_script: ''
queues:
gcp-v100:
- - gcp-v100
- 4
gcp-l4:
- - gcp-l4
- 4
gcp-cpu:
- - gcp-cpu
- 4
resource_configurations:
gcp-v100:
...
That is the configuration yaml.
I didn't write this conf, but it works.
I see. So, is it same thing when network is slow and there is a mistake in url?
You are right. My colleague wrote it I think starting from the aws autoscaler.
Yeah actually you are right. I can report the stats as a table, not the whole data.
Not this key pair. I removed all my previous key pairs and created a new one
I have started the autoscaler with python? @<1523701070390366208:profile|CostlyOstrich36>
import yaml
from clearml.automation.auto_scaler import AutoScaler, ScalerConfig
from gcp_driver import GCPDriver
with open('gcp_autoscaler.yaml') as f:
conf = yaml.load(f, Loader=yaml.SafeLoader)
driver = GCPDriver.from_config(conf)
conf = ScalerConfig.from_config(conf)
autoscaler = AutoScaler(conf, driver)
autoscaler.start()
That is the python code.
@<1523701070390366208:profile|CostlyOstrich36>
I don't say I am expert in this thing, but does the autoscaler have different version than the clearml?
@<1523705004920147968:profile|CloudySwallow27>
At least I can do that along with matplotlib
Another very related question is, does uncommitted changes work for sub modules as well? I mean, when there is a directory from a different repository cloned as a submodule.
@<1576381444509405184:profile|ManiacalLizard2>
But, it uses the committed changes instead of these values.
They are actually from tracked files. Actually, I get the uncommitted changes under Execution
tab.
deployed it myself. It worked fine before I changed to ubuntu24.04 yesterday. We have been using clearml for years this way
report_table seems to be the most straight forward without matplotlib integration. Do you think it has plotting features?
The worker machines are on gcp
clearml==1.14.1
That is the version.
It is an autoscaler for gcp. I think, there are unnecessary configs that were used in aws.
they are different tasks. I start a new task but it can be same commit sometimes.
Sorry, It just worked now. I think It was slow internet connection issue. It just went away today.
Oh, I think that is for a very small data. I don't think it works for me.
@<1523701070390366208:profile|CostlyOstrich36> I don't get that.
It works on other machines. Can I clearml-init on a virtual environment? I installed clearml in a virtual environment.