mirror of
https://github.com/ollama/ollama.git
synced 2025-11-11 09:17:50 +01:00
* tests: reduce stress on CPU to 2 models This should avoid flakes due to systems getting overloaded with 3 (or more) models running concurrently * tests: allow slow systems to pass on timeout If a slow system is still streaming a response, and the response will pass validation, don't fail just because the system is slow. * test: unload embedding models more quickly
28 KiB
28 KiB