Reputation
Badges 1
11 × Eureka!Yes. CostlyOstrich36
I’m running on an on-prem machine, and trying to automate the whole process of training. meaning bringing the data -> creating the dataset -> running the training task.
When creating a dataset and adding files it seems there is a Task being created in the background (which is awesome).
My question is - can I run this task on a remote machine instead of the machine which i’m running the CLI on?
I finally figured out the issue.
It seems like the configuration is cached in a way even when you change the CLI parameters.
After adding explicit JSON with configuration I managed to run it
@<1523701205467926528:profile|AgitatedDove14> @<1523701070390366208:profile|CostlyOstrich36> Thanks for the help
It cached my SSH parameters and finally after removing all of them it worked
Yes sure - this is what I see in the logs
> Setting up openssh-sftp-server (1:8.2p1-4ubuntu0.5) ...
> Setting up python3-distro (1.4.0-1) ...
Remote machine is ready
Setting up connection to remote session
Starting SSH tunnel
I see now an interesting warning
2023-02-15 12:49:22,813 - clearml - WARNING - Could not retrieve remote configuration named 'SSH'
The thing is - when I try to connect with normal SSH there are no issues
ssh user@ip
I’m trying to connect for Mac to Linux @<1523701070390366208:profile|CostlyOstrich36>
Clearml-agent is installed on another machine in the internal network @<1523701205467926528:profile|AgitatedDove14>
File "examples/notebook.ipynb", line 260, in <module> "execution_count": null, NameError: name 'null' is not defined
Hey @<1523701070390366208:profile|CostlyOstrich36> , do I need to set those up on the agent's machine or pass them to the task?
I mean SSH through the terminal works fine.
The issue is with Clearml-session.
I tried to remove the username/password and remote-host yesterday but it ended up asking me for the password when connecting and not accepting it.