There is no reason to ever use ollama.
I just checked their docs and can't see anything like it.
Did you mistake the command to just download and load the model?
And yes, it downloads the model, caches it, and then serves future loads of that model out of the cache if the file hasn't changed in the hf repo.
reply
Actually that shouldn't be a question, you clearly did.
Hint: it also opens Claude code configured to use that model