TimelyPenguin76

If you like, you can save the model for each epoch by having a unique name for it. The model will be saved in the output_uri path you have in the Task.init command.
For example, this code will save a model for every epoch:

for epoch in range(num_of_epoch): # Create a model torch.save(model, "model_epoch_{}".format(epoch))

4 years ago

0 Hi, Is There A Way I Can Supply Credentials To Clearml-Data (Cli And Python) Without Going Thru The Clearml.Conf?

Hey SubstantialElk6 ,

You can try adding environment vars with that info:

os.environ["CLEARML_API_HOST"] = api_server os.environ["CLEARML_WEB_HOST"] = web_server os.environ["CLEARML_FILES_HOST"] = files_server os.environ["CLEARML_API_ACCESS_KEY"] = access_key os.environ["CLEARML_API_SECRET_KEY"] = secret_key

3 years ago

Hi UnsightlyShark53 ,

Trying to understand the scenario, so you want the model to be saved in trains_storage dir but trains saves it in trains_storage/trains_storage ? Or the torch.save doesn't save in the path?

4 years ago

0 Hey All, We Already Know That Clearml Magic Collects Metrics And Plots From Tensorboard And Matplotlib, Can We Please Add Collection For Plotly And Seaborn ?

Hi LethalCentipede31

You can report plotly with task.get_logger().report_plotly , like in https://github.com/allegroai/clearml/blob/master/examples/reporting/plotly_reporting.py
For seaborn, once you use plt.show it will be in the UI (example https://github.com/allegroai/clearml/blob/master/examples/frameworks/matplotlib/matplotlib_example.py#L48 )

3 years ago

0 Hi. Question About Dataset Upload Errors: When Uploading A

Hi PanickyMoth78 ,

Can you try with pip install clearml==1.8.1rc0 ? it should include a fix for this issue

2 years ago

0 One More Follow-Up Still; We'Re Trying To Run Non-Gpu Scaler, And I'Ve Finally Sorted Out Subnet And Security Groups Issues, Only To Run Into This:

Hi UnevenDolphin73 ,

which agent version are you using? Do you setup the env variable in the agent’s machine too?

Can you set env var CLEARML_DOCKER_SKIP_GPUS_FLAG to true?

Regarding this - https://clearml.slack.com/archives/CTK20V944/p1657525402861009?thread_ts=1657291641.224139&cid=CTK20V944 - can you add some more info? maybe the log?

2 years ago

0 Hi, I Started My Agent Using. Clearml-Agent Daemon --Gpus 0 --Queue Gpu --Docker --Foreground, With The Following Parameters In Clearml.Conf.

So according to it, you are using the repo requirements, and you have torch there?

3 years ago

0 When Using

the controller task? same as here - https://github.com/allegroai/clearml/blob/master/examples/pipeline/pipeline_controller.py

3 years ago

0 Hi, I Am Using The Aws Autoscaler And Getting The Following Error While Trying To Spin Up Spot Instances:

Hi JitteryCoyote63 ,

thanks for reporting it, I was able to reproduce the issue, will update here once a fix is out 🙂

3 years ago

0 How Come I Use

Hi WackyRabbit7 ,

not sure I get it, you dont see the report in the UI?

3 years ago

0 How Come I Use

try report_interactive=False

3 years ago

0 How Come I Use

logger.report_matplotlib_figure(title="some title", series="some series", figure=fig, iteration=1, report_interactive=Fasle)

3 years ago

0 Hi, I Have A Train Server Set Up On Some Ip And Accessing The Web Interface Works Great. But How Do I Configure

Hi CooperativeSealion8 ,

trains is configured according to ~/trains.conf file, in this file you should define the app, api and files servers.

You can do it with our great wizard, just type
trains-init in your CLI and follow the instructions,
` ❯ trains-init
TRAINS SDK setup process

Please create new trains credentials through the profile page in your trains web app (e.g. )
In the profile page, press "Create new credentials", then press "Copy to clipboard".

Paste cop...

4 years ago

0 Hi, I Have A Train Server Set Up On Some Ip And Accessing The Web Interface Works Great. But How Do I Configure

We can certainly add a trains.conf brief, thanks for the feedback 🙂

4 years ago

0 Hi All, Is There Documentation \ Example Describing How Does Clearml Works With Hydra?

thanks for the answer, so for example (to make sure I understand) with the example you gave above when I’ll print the config I’ll see the new edited parameters?

Correct

What about the second part of the question, would it be parsed according to the type hinting?

It should

3 years ago

0 I Have A Question Regarding "Imitating" An Agent Pulling Some Task For Debugging Purposes I Am Trying To Do Something Like: Creating A Task On The Server

DepressedChimpanzee34 how do you generate the task thats running remotely? once the agent pulled the task, this is your running configuration (it will pull the same configuration from the server as you see in the UI)

3 years ago

0 Hi, When I Use The Autoscaler To Start Jobs, I Noticed Some Of Them Randomly Abort In The Middle Of The Jobs And Give The Following Error:

python invoked oom-killerOut of memory, CloudySwallow27 in the scaler app task, can you check if you have scalers reporting?

2 years ago

0 I Also Have A General Question. Most Frequently, We Work With Our Ml Models Not As A Single Script, But A Project - Multi Level Directory Of Files. How Do We Create Task Out Of It, Do We Type

Hi RoundMosquito25 ,

Are you running your project as part of a git repository? If so, you can just add Task.init() call from the main script you are running (e.g. your train.py/main_file.py file) and all should be logged automatically.

for reporting https://clear.ml/docs/latest/docs/guides/reporting/artifacts , you can use your task object doing so:
` task = Task.init(project_name="My project", task_name="My task")
...
task.upload_artifact(
'my artifact name',
artifact_obj...

2 years ago

0 Hi, Is It Possible To Delete A Parameter From A Task After It Is Connected? So Far I'Ve Tried Using Connect(...) And Set_Parameters_As_Dict() Method, But They Appear To Merge But Never Remove Parameters. Is There Another Way To Reset/Remove Parameters?

If you want to clear the parameters, you can try overriding with an empty dict

cloned_task.set_parameters({})

3 years ago

0 Is There A Way To Do S3 -> S3 Copy While Doing A Dataset? I Don’T Want To Get It To Local From S3 And Then Upload As A Dataset To S3