ollama

mirror of https://github.com/ollama/ollama.git synced 2025-04-03 17:39:05 +02:00

History

Jesse Gross d773b7d671 backend: API to support full precision matmul

Most tensor backends try to optimize performance by using a lower
precision for matmuls. However, some operations (such as kq) on
some models are sensitive to this and require full precision.

2025-02-13 17:09:26 -08:00

backend

backend: API to support full precision matmul

2025-02-13 17:09:26 -08:00

next ollama runner (#7913 )

2025-02-13 16:31:21 -08:00

backend.go

backend: API to support full precision matmul

2025-02-13 17:09:26 -08:00