And how is the model being saved?
Also a small clarification:
ClearML doesn't build the docker image itself. You need to have a docker image already built to be used by ClearML
TartSeagull57 , you said the problem was with automatic reporting. Can you give an example of how you solved the issue for yourself?
AbruptWorm50 , it looks like the application issue was solved for us 🙂
ContemplativeGoat37 , Hi 🙂
You can do the following configuration in your ~/clearml.conf
sdk.development.default_output_uri: "
s3://my_bucket/ "
Hmmm, looking at F12 it appears to be version/build/api numbers
VexedCat68 , correct. But not only arg parse. The entire configuration section 🙂
I'm not sure, maybe @<1523701087100473344:profile|SuccessfulKoala55> might have an idea 🙂
Also, can you try with dataset.upload(output_url="/home/user/server_local_storage/clearml_training_dataset/")
(note the added '/' at the end of the line)
GiganticTurtle0 , are you using the latest release or the RC?
GiganticTurtle0 , I tried running the same script as before and added dataset.get_local_copy()
at the end and it managed to work fine. Do you have any other changes? Are you on the latest repo code?
I think that might be the issue. Transfering from pip to Conda package managers can sometimes be problematic. Try to manually edit the requirements to reflect the settings in https://pytorch.org/
Hi OddShrimp85 ,
Please note that Datasets in this context are part of the HyperDatasets feature. This is an advanced feature for managing unstructured data. So in context of HyperDatasets, a Dataset is a collection of DatasetVersions that are structured by some logic within.
Hi RotundSquirrel78 , can you try clearing local cache? For me everything is showing properly
Hi ShallowGoldfish8 ,
You can get specific chunks/files using the part
argument:
https://clear.ml/docs/latest/docs/references/sdk/dataset#get_local_copy
Hi ObedientToad56 , you can simply delete all of them since it's only cache. It's safe to delete cache 🙂
Hi SlimyDove85 ,
Regarding the easiest method to track models - you can try tagging them.
Can you please elaborate on your use case?
PanickyMoth78 , let me check on that 🙂
PanickyMoth78 , please try with us-east-1a
Hi AbruptHedgehog21 , what are you trying to do when you're getting this message? Are you running a self hosted server?
Can you seee if there are errors in the apiserver?
Hi @<1673501379764686848:profile|VirtuousSeaturtle4> , what do you mean? Connect to a server someone else set up?
It's a way to execute tasks remotely and even automate the entire process of data pre processing -> training -> output model 🙂
You can read more here:
https://github.com/allegroai/clearml-agent
Hi @<1603560525352931328:profile|BeefyOwl35> , can you please elaborate on what you mean by running the build command?
Can you add a full log?
Not from the top of my head, let me take a look 🙂
Hi AttractiveCockroach17 , in the first question - clearml
captures the packages used during the run. What does your script use and what does clearml
capture when running locally on your machine?
You can configure the clearml
to capture your entire environment as well.
Regarding 2:
Can you please expand on the entire process?