Hi ColossalReindeer77 , you can run the agent in --debug
mode to get more info
In general, the agent will simply use git to clone the repository, so the local system settings should apply
Hei SuccessfulKoala55 it just worked. Maybe there was some github refresh delay … not sure but thanks anyways for the debug
suggestion. 👍
Hi SuccessfulKoala55 it’s failing again.. I haven’t rebooted the agent or changed anything and I am able to connect with ssh with ssh -vT
git@github.com on a different tmux sess.
This is the error I am seeing running the agent with the -debug
flag:
Using cached repository in "/home/ubuntu/.clearml/vcs-cache/clearml-tutorial.git.e1c2351b09f3d661b6f0dbf85e92be2e/clearml-tutorial.git"
git@github.com: Permission denied (publickey).
fatal: Could not read from remote repository.
Please make sure you have the correct access rights
and the repository exists.
error: Could not fetch origin
Repository cloning failed: Command '['git', 'fetch', '--all', '--recurse-submodules']' returned non-zero exit status 1.
clearml_agent: ERROR: Failed cloning repository.
but from a terminal I can do:
ubuntu@***:~/sw/clearml-tutorial$ git fetch --all --recurse-submodules
Fetching origin
and it works
Hey SuccessfulKoala55 just updating you here. I started from scratch, new EC2 instance, follow the installation step by step and the only change that I made was selecting rsa
instead of ed255190
for the generation of the SSH key (as per github docs ), and now I my agent can connect consistently to GitHub. Just thought of posting this in case someone else runs into a similar issue in the future, this is what worked for me!