ollama

mirror of https://github.com/ollama/ollama.git synced 2025-03-27 02:01:56 +01:00

History

Daniel Hiltgen 58d95cc9bd Switch back to subprocessing for llama.cpp

This should resolve a number of memory leak and stability defects by allowing
us to isolate llama.cpp in a separate process and shutdown when idle, and
gracefully restart if it has problems.  This also serves as a first step to be
able to run multiple copies to support multiple models concurrently.

2024-04-01 16:48:18 -07:00

ext_server

Switch back to subprocessing for llama.cpp

2024-04-01 16:48:18 -07:00

generate

Switch back to subprocessing for llama.cpp

2024-04-01 16:48:18 -07:00

llama.cpp @ ad3a0505e3

Bump llama.cpp to b2527

2024-03-25 13:47:44 -07:00

patches

Bump llama.cpp to b2474