Reputation
Badges 1
106 × Eureka!what i'm doing is getting
parent = Task.get_task(task.parent)
and then checkingparent.data.user
but the user is some unknown id that doesn't exist in the all_users
list
SmugSnake6 yep, that’s exactly it.
Hope the team is aware and will fix it
I am currently on vacation, I'll ask my team mates. But if not I'll get to it next week
The pipeline is a bit complex, but it did that with a very dumb example
tnx, i just can’t use 1.7.1 because of the pipeline problem from before
that makes more sense 🙂
would this work now as a workaround until the version is released?
don’t have one ATM
Artifacts, nothing is reaching s3
Hi, yes it's running with autoscaler so it's for sure in docker mode
Are you saying that it should've worked? I got 'docker' attribute doesn't exist error. Maybe it's the version of the clearml server?
Yes, but it’s more complex because i’m using a pipeline… where i don’t explicitly call Task.init()
BTW, i would expect this to happen automtically when running “local” and “debug”
@<1523701118159294464:profile|ExasperatedCrab78> Sorry only saw this now,
Thanks for checking it!
Glad to see you found the issue, hope you find a way to fix the second one. for now we will continue using the previous version.
Would be glad if you can post when everything is fixed so we can advance our version.
hi, yes we tried with the same result
yeah, it gets to that error because the previous issue is saved…i’ll try to work on a new example
CostlyOstrich36 This is for a step in the pipeline
for now we downgraded to 1.7.2, but of course prefer not to stay that way
` args.py #504:
for k, v in dictionary.items():
# if key is not present in the task's parameters, assume we didn't get this far when running
# in non-remote mode, and just add it to the task's parameters
if k not in parameters:
self._task.set_parameter((prefix or '') + k, v)
continue
task.py #1266:
def set_parameter(self, name, value, description=None, value_type=None):
# type: (str, str, Optional[str], O...
This is the next step not being able to find the output of the last step
ValueError: Could not retrieve a local copy of artifact return_object, failed downloading
It’s a lot of manual work that you need to remember to undo
also, i don’t need to change it during execution, i want it for a specific run
i get for one of the tasks, but then it fails because it seems that the fastai2 tensorboardcallback isn’t fit for distributed training (which i’m opening an issue for them now)