ollama/llama/patches/0016-graph-memory-reporting-on-failure.patch at d73f8aa8c3979b33f5ea19b80406c20e88ee3b1b

mirror of https://github.com/ollama/ollama.git synced 2025-11-13 03:46:44 +01:00

Files

Jesse Gross 6db8a3771c ggml: Report graph memory for failed allocations

GGML has a function to report the allocated size of a backend buffer.
However, this returns 0 if we tried to allocate a buffer and it failed.
For memory management purposes, it's important to know how much we were
trying to allocate. This extends the API to report attempted sizes for
all buffers and whether it succeeeded.

2025-05-22 14:38:09 -07:00

6.9 KiB

Raw Blame History

View Raw

6.9 KiB Raw Blame History

6.9 KiB

Raw Blame History