Reputation
Badges 1
533 × Eureka!the Task
object has a method called Task.execute_remotely
Look it up here:
https://allegro.ai/docs/task.html#trains.task.Task.execute_remotely
Also being able to separate their configurations files would be good (maybe there is and I don't know?)
after you create the pipeline object itself , can you get Task.current_task() ?
AgitatedDove14 no I can't... Just checked this. This is a huge problem for us, it used to work before and it just stopped working and I can't figure out why.
It's a problem for us because we made it a methodology of running some tasks under a pipeline task and saving summary iunfo to the pipeline task - but now since Task.current_task()
doesn't work on the pipeline object we have a serious problem
I suspect that it has something to do with remote execution / local execution of pipelines, because we play with this , so sometimes the pipeline task itself executes on the client, and sometimes on the host (where the agent is also)
The scenario I'm going for is never to run on the dev machine, so all I'll need to do once the server + agents are up is to add task.execute_remotely...
after the Task.init
line and after the execution of the script is called on the dev machine, it won't actually run but rather enqueue itself for the agent to run it?
As a part of a repo
AgitatedDove14 just a reminder if you missed this question 😄
I have a single IAM, my question is what kind of permissions I should associate with the IAM so that the autoscaler task will work
it seems that only the packages that are on the script are getting installed
Now I remind you that using the same credentials exactly, the auto scaler task could launch instances before
actually i was thinking about model that werent trained uaing clearml, like pretrained models etc
Yes, I'll prepare something and send
it's double weird, because also a task that the pipeline says is "in progress" is actually completed
I was refering to what is the returned object of Task.artifacts['...']
- when I call .get
I understand what I get, I'm asking because I want to see how the object I'm calling .get
on behaves
Do you have any idea as to why does that happen SuccessfulKoala55
AgitatedDove14 all I did was to cerate this metric as "last" and then turned on the "max" and "min" and then turned them off
I can't reproduce it now but:
I restarted the services and it didn't help I deleted the columns, and created them again after a while and it helped
is this already available or only on github?
It's kind of random, it works sometimes and sometimes it doesn't
So prior to doing any work on the trains autoscaler servcice, I should first create a auto scaling group in AWS?
The weirdest thing, is that the execution is "completed" but it actually failed