Keep-Alive and Memory Control
51%
Picking a Keep-Alive That Makes Sense
There's no official guidance on this; the right value depends on how often you reuse the model and how much memory pressure you have. Here are starting points worth using and adjusting:
| Use case | Starting value | Reasoning |
|---|---|---|
| Interactive development | 30m to 1h | Survives breaks without paying reload cost; frees VRAM overnight |
| Production API server | -1 | Never reload on live traffic |
| Shared workstation | 5m |
Local AI Engineering with Ollama
Run, understand, customize, fine-tune, and build agentic apps on your own hardwareEnroll now to unlock all content and receive all future updates for free.
