ollama

mirror of https://github.com/ollama/ollama.git synced 2025-11-10 22:07:45 +01:00

Files

Jesse Gross 19e6796eac llm: Support KV cache quantization with gpt-oss

With the new version of GGML in #12245, KV cache quantization
no longer causes a fallback to CPU.

2025-10-03 16:31:58 -07:00

2025-10-03 16:31:58 -07:00

…

…

config.go

…