Nope AWS aren't approving the increased vCPU request. I've explained the use case several times and they've not approved
Are you getting the error from boto failing to launch additional ec2 instances ?
This was the response from AWS:
"Thank you for for sharing the requested details with us. As we discussed, I'd like to share that our internal service team is currently unable to support any G type vCPU increase request for limit increase.
The issue is we are currently facing capacity scarcity to accommodate P and G instances. Our engineers are working towards fixing this issue. However, until then, we are unable to expand the capacity and process limit increase."
AgitatedDove14 is any working on a GCP or Azura autoscaler at the moment?
Yep... they are pushing "heavy" users away from these instances. Nothing really you can do, maybe switch to Azure/GCP, but it might be the same there
gdn4.xlarge (the best price for 16GB of GPU ram). Not so surprising they would want a switch
Okay thanks for the update 🙂 the account manager got involved and the limit has been approved 🚀
Yep, this is a limitation of the "low tier" G instances... I guess they want you to switch to the P instances?
Which G are you using ?
AgitatedDove14 can you share if there is a plan to put the gcp autoscaler in the open source?
GCP is being released to the SaaS and then should work its way to the open-source
Azure is till being worked on (only in beta at the moment)