> Long-running tasks like batch mode completions and agent sessions may incur overages beyond your project spend cap.
> Billing data processing times can be delayed in AI Studio, up to around 10 minutes. You may experience overages beyond your project cap if billing data hasn't processed before more charges are accrued.
the question is will we experience resource constraints before we get there? what if the step up to post-scarcity is gated by a compute level just out of our reach?
i don't think so, it is not provided as a service. if you provide vpn service people can connect to from their router then you need to do age verification before giving them a key/password to connect to the server
Gemma 3n E4B has been crazy good for me - fine tune running on Google Cloud Run via Ollama, completely avoiding token based pricing at the cost of throughput limitations