Because setting env vars and ensuring they exist on the remote machine during execution etc is more complicated ๐
There are always ways around, I was just wondering what is the expected flow ๐
Ah, the API server /users.get_all
, I see!
It's of course not an MLOps issue so I understand it's not high on the priority list, but would be kinda cool to just have a simple view presenting the content of users.get_all
๐
From the log you shared, the task is picked up by theย
worker_d1bd92a3b039400cbafc60a7a5b1e52b_4e831c4cbaf64e02925b918e9a3a1cf6_<hostname>:gpu0,1
ย worker
I can try and target the default one if it helps..?
We're using self hosted account
Thanks SuccessfulKoala55 and AgitatedDove14 ! We'll go through the hoops of setting up mongo on AWS then.
We're working to decouple the data from the helm chart, seems like a dangerous idea to store long term data on k8s in case of failure ๐
I realized it might work too, but looking for a more definitive answer ๐ Has no-one attempted this? ๐ค
TimelyPenguin76 that would have been nice but I'd like to upload files as artifacts (rather than parameters).
AgitatedDove14 I mean like a grouping in the artifact. If I add e.g. foo/bar
to my artifact name, it will be uploaded as foo/bar
.
Btw TimelyPenguin76 this should also be a good starting point:
First create the target directory and add some files:sudo mkdir /data/clearml sudo chmod 777 -R /data/clearml touch /data/clearml/foo touch /data/clearml/bar touch /data/clearml/baz
Then list the files using the StorageManager. It shouldn't take more than a few miliseconds.` from clearml import StorageManager
%%timeit
StorageManager.list("/data/clearml")
-> 21.2 s ยฑ 328 ms per loop (mean ยฑ std. dev. of 7 runs, 1 loop each) `
We have a mini default config (if you remember from a previous discussion we had) that actually uses the second form you suggested.
I wrote a small "fixup" script that combines this default with the one generated by clearml-init
, and it simply does:def_config = ConfigFactory.parse_file(DEF_CLEARML_CONF, resolve=False) new_config = ConfigFactory.parse_file(new_config_file, resolve=False) updated_new_config = ConfigTree.merge_configs(new_config, def_config)
When is the next release expected? ๐
Alternatively, it would be good to specify both some requirements and auto-detect ๐ค
It's self-hosted TimelyPenguin76
SuccessfulKoala55 WebApp: 1.4.0-175 โข Server: 1.4.0-175 โข API: 2.18
We're using the example autoscaler, nothing modified
I have seen this quite frequently as well tbh!
Will try!
Curious - is there a temporary changelog for 1.2.0? ๐ Always fun to poke at the upcoming features
EDIT: Wait, should the clearml RC be installed outside the venv for the agent as well?
Another side effect btw is that some of our log files (we add a file handler to the logger) end up at 0 bytes. This specifically happens with Ray and ClearML and does not reproduce locally
Without knowing anything, I'm assuming maybe ClearML patches plt.title
and not Axes.set_title
?
I'll try it out, but I would not like to rewrite that code myself maintain it, that's my point ๐
Or are you suggesting I Task.import_offline_session
?
Odd; switching to virtual environment results infatal: could not read Username for '
': terminal prompts disabled
even though it does earlier show that:agent.git_user = xxx
Sorry, not necessarily RBAC (although that is tempting ๐ ), but for now was just wondering if an average joe user has access to see the list of "registered users"?
This also appears in the error log:
` StorageManager.download_folder(cache_dir.as_posix(), local_folder=".")
File "/home/idan/.clearml/venvs-builds/3.7/lib/python3.7/site-packages/clearml/storage/manager.py", line 278, in download_folder
for path in helper.list(prefix=remote_url):
File "/home/idan/.clearml/venvs-builds/3.7/lib/python3.7/site-packages/clearml/storage/helper.py", line 596, in list
res = self._driver.list_container_objects(self._container, ex_prefix=prefix)
Fi...
Any thoughts @<1523701070390366208:profile|CostlyOstrich36> ?
I wouldnโt want to run the entire notebook, just a specific part of it.
I can elaborate in more detail if you have the time, but generally the code is just defined in some source files.
Iโve been trying to play around with pipelines for this purpose, but as suspected, it fails finding the definition for the pickled objectโฆ
Iโll give the create_function_task
one more try ๐ค
Yes, thanks AgitatedDove14 ! It's just that the configuration
object passed onwards was a bit confusing.
Is there a planned documentation overhaul? ๐ค