ollama/ml/backend/ggml/ggml.go at 220e133fca8fe128dbf8fecef96c8484f991e39c

mirror of https://github.com/ollama/ollama.git synced 2025-11-13 07:29:35 +01:00

Files

Jesse Gross ef549d513c ggml: Increase maximum graph size

The initial implementation of qwen3-vl:235b exceeded the maximum graph
size based on the number of tensors. Although this was later fixed
through the use of the mrope operation, we are close to the limit in
some cases. This updates to track the current llama.cpp usage of GGML.

2025-11-03 16:05:37 -08:00

44 KiB

Raw Blame History

View Raw

44 KiB Raw Blame History

44 KiB

Raw Blame History