mirror of
https://github.com/ollama/ollama.git
synced 2025-11-13 03:46:44 +01:00
GGML has a function to report the allocated size of a backend buffer. However, this returns 0 if we tried to allocate a buffer and it failed. For memory management purposes, it's important to know how much we were trying to allocate. This extends the API to report attempted sizes for all buffers and whether it succeeeded.
6.9 KiB
6.9 KiB