Daniel Hiltgen cc269ba094 Remove no longer supported max vram var
The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM
scenarios.  With Concurrency this was no longer wired up, and the simplistic
value doesn't map to multi-GPU setups.  Users can still set `num_gpu`
to limit memory usage to avoid OOM if we get our predictions wrong.
2024-07-22 09:08:11 -07:00
..
2024-07-22 09:08:11 -07:00
2024-07-13 20:56:24 -07:00
2024-07-13 20:56:24 -07:00
2024-06-04 11:13:30 -07:00