The thing is - when I try to connect with normal SSH there are no issues
ssh user@ip
I’m trying to connect for Mac to Linux @<1523701070390366208:profile|CostlyOstrich36>
Clearml-agent is installed on another machine in the internal network @<1523701205467926528:profile|AgitatedDove14>
Yes sure - this is what I see in the logs
> Setting up openssh-sftp-server (1:8.2p1-4ubuntu0.5) ...
> Setting up python3-distro (1.4.0-1) ...
Remote machine is ready
Setting up connection to remote session
Starting SSH tunnel
@<1523701205467926528:profile|AgitatedDove14> @<1523701070390366208:profile|CostlyOstrich36> Thanks for the help
I finally figured out the issue.
It seems like the configuration is cached in a way even when you change the CLI parameters.
After adding explicit JSON with configuration I managed to run it
It cached my SSH parameters and finally after removing all of them it worked
I see now an interesting warning
2023-02-15 12:49:22,813 - clearml - WARNING - Could not retrieve remote configuration named 'SSH'
Hi @<1523704461418041344:profile|EnormousCormorant39> , is there any chance this could be indeed network related if it does manage to work sometimes?
Can you add a larger portion of the log with errors?
Also what type of machines are these? Linux to linux?
I mean SSH through the terminal works fine.
The issue is with Clearml-session.
I tried to remove the username/password and remote-host yesterday but it ended up asking me for the password when connecting and not accepting it.
It seems like the configuration is cached in a way even when you change the CLI parameters.
@<1523704461418041344:profile|EnormousCormorant39> nice!
Yes the configuration is cached so that after you set it once you can just call clearml-session again without all the arguments
What was the actual issue ? Should we add something to the printout?
Hmm, any suggestion on making it more visible or on the interface ? (I mean deleting the cache file is always a solution, but it sounded quite painful to debug, hence the question)
Sometimes it is working fine, but sometimes I get this error message
@<1523704461418041344:profile|EnormousCormorant39> can I assume there is a gateway at --remote-gateway <internal-ip>
?
Could it be that this gateway has some network firewall blocking some of the traffic ?
If this is all local network, why do you need to pass --remote-gateway ?
2023-02-15 12:49:22,813 - clearml - WARNING - Could not retrieve remote configuration named 'SSH'
This is fine, it means it uses the default identity keys
The thing is - when I try to connect with normal SSH there are no issues
Now I'm lost, so when exactly do you see the issue ?