Feedback

Chat Icon

Local AI Engineering with Ollama

Run, understand, customize, fine-tune, and build agentic apps on your own hardware

Keep-Alive and Memory Control
50%

Forcing an Unload Right Now

You don't have to wait for the timer. Send a request with keep_alive: 0 and an empty payload to drop the model immediately:

curl http://localhost:11434/api/generate -d '{
  "model": "granite3.3:2b",
  "keep_alive": 0
}'

Or use the CLI shortcut:

Local AI Engineering with Ollama

Run, understand, customize, fine-tune, and build agentic apps on your own hardware

Enroll now to unlock all content and receive all future updates for free.