Ollama’s cloud is currently in preview.
Cloud Models
Ollama’s cloud models are a new kind of model in Ollama that can run without a powerful GPU. Instead, cloud models are automatically offloaded to Ollama’s cloud service while offering the same capabilities as local models, making it possible to keep using your local tools while running larger models that wouldn’t fit on a personal computer. Ollama currently supports the following cloud models, with more coming soon:gpt-oss:20b-cloud
gpt-oss:120b-cloud
deepseek-v3.1:671b-cloud
qwen3-coder:480b-cloud
Running Cloud models
Ollama’s cloud models require an account on ollama.com. To sign in or create an account, run:To run a cloud model, open the terminal and run: