Are you getting the error from boto failing to launch additional ec2 instances ?
Nope AWS aren't approving the increased vCPU request. I've explained the use case several times and they've not approved
Yep, this is a limitation of the "low tier" G instances... I guess they want you to switch to the P instances?
Which G are you using ?
gdn4.xlarge (the best price for 16GB of GPU ram). Not so surprising they would want a switch
Yep... they are pushing "heavy" users away from these instances. Nothing really you can do, maybe switch to Azure/GCP, but it might be the same there
This was the response from AWS:
"Thank you for for sharing the requested details with us. As we discussed, I'd like to share that our internal service team is currently unable to support any G type vCPU increase request for limit increase.
The issue is we are currently facing capacity scarcity to accommodate P and G instances. Our engineers are working towards fixing this issue. However, until then, we are unable to expand the capacity and process limit increase."
AgitatedDove14 is any working on a GCP or Azura autoscaler at the moment?
GCP is being released to the SaaS and then should work its way to the open-source
Azure is till being worked on (only in beta at the moment)
Okay thanks for the update 🙂 the account manager got involved and the limit has been approved 🚀
AgitatedDove14 can you share if there is a plan to put the gcp autoscaler in the open source?