And yes, I am using the agents that come with the Helm chart from Clearml repository
Hi Jake thanks for your answer!
So I just have a very simple file "project.py" with this content:
` from clearml import Task
task = Task.init(project_name='project-no-git', task_name='experiment-1')
import pandas as pd
print("OK") If I run
python project.py ` from a folder that is not in a git repository, I can clone the task and enqueue it from the UI, and ti runs in the agent with no problems.
If I copy the same file, in a folder that is in a git repository, when I enqueue the experiment it stays in a pending state.
So what I suspect is that the Pod can't be created because it can't access github (enterprise self hosted), but it's not clear to me why.
but I was a bit set off track seeing errors in the logs
Hi Jack, yes we had to customize the default one for some tools we use internally
Yeah, that sounds right. So when you do the first scenario (i.e. running outside of a Git repository), ClearML will just take the code file content and embed it on the task - that's why the agent can take it an run it (even if it doesn't have access to the git repository, because it's not required).
In the second scenario, ClearML basically stored the git repository details on the task, and the agent needs to access it somehow (using some sort of authentication)
Yes, I still see those errors, but queues are working :)
SarcasticSquirrel56 quick question - is it possible you're using a self-build webserver image?
I actually found out it was an indentation error 😅 and the credentials weren't picked
Hi SarcasticSquirrel56 , these look like the webserver logs - they will probably not be indicative.
However if I copy the same code out in a folder with no repository, than they are enqueue and executed correctly.
What do you mean? where is this folder located?
Are you running the ClearML Agents in k8s?
I'll ask the UI people to take a look at these errors anyway 🙂
Got it. So that's the reason, the github code-base does not include the default version.json we use when building our official images - I'll make sure we update it for the next release
So the question is how did you configure your PAT in the agent's configuration, and what is the repository URL format the ClearML SDK stored in the task's execution section - can you share the details?