Can I do it while i have multiple ongoing training?
Yes, but does add_external_files makes chunked zips as add_files do?
ok, is dataset path stored in mongo?
Im unable to find it in elasticsearch (debug images were here)
maybe someone on your end can try to parse such a config and see if they also have the same problem
You can check out boto3 python client (This is what we use to download / upload all S3 stuff), but minio-client probably already uses it under the hood.
We also use aws cli to do some downloading, it is way faster than python.
Regarding pdfs, yes, you have no choice but to preprocess it
WebApp: 1.16.0-494 • Server: 1.16.0-494 • API: 2.30
But be careful, upgrading is extremely dangerous
how to get rid of this auto appended line
i also think that if my package manager is set to uv, then it should only use uv and ignore pip at all
We had a similar problem. Clearml doesnt support data migration (not that I know of)
So you have two ways to fix this:
- Recreate the dataset when its already in Azure
- Edit each elasticsearch database file entry to point to new destination (we did this)
has 8 cores, so nothing fancy even
Is it possible to split the large elasticsearch indexes? I know elasticsearch has something called rollover, but im not sure that clearml supports this
I guess I fucked up something when moving files
Im basically trying to force the agent to use uv defined python
We fixed the issue, thanks, had to update everything to latest.
i can add "source /workspace/.venv/bin/activate", to clearml.conf docker_init_bash_script
However it then tries to access pip, but i dont need no pip, how to disable it, i already have my packages, and uv doesnt even require pip