ollama

mirror of https://github.com/ollama/ollama.git synced 2025-04-09 20:29:23 +02:00

History

Fix embeddings memory corruption (#6467 )

* Fix embeddings memory corruption

The patch was leading to a buffer overrun corruption.  Once removed though, parallism
in server.cpp lead to hitting an assert due to slot/seq IDs being >= token count.  To
work around this, only use slot 0 for embeddings.

* Fix embed integration test assumption

The token eval count has changed with recent llama.cpp bumps (0.3.5+)

2024-08-22 14:51:42 -07:00

01-load-progress.diff

update llama.cpp submodule to d7fd29f (#5475 )

2024-07-05 13:25:58 -04:00

02-clip-log.diff

Fix clip log import

2024-04-26 09:43:46 -07:00

03-load_exception.diff

update llama.cpp submodule to d7fd29f (#5475 )

2024-07-05 13:25:58 -04:00

04-metal.diff

update llama.cpp submodule to d7fd29f (#5475 )