ollama/integration/embed_test.go at pdevine/parser-tidy

mirror of https://github.com/ollama/ollama.git synced 2025-11-11 09:17:50 +01:00

Files

Daniel Hiltgen 6745182885 tests: reduce stress on CPU to 2 models (#12161 )

* tests: reduce stress on CPU to 2 models

This should avoid flakes due to systems getting overloaded with 3 (or more) models running concurrently

* tests: allow slow systems to pass on timeout

If a slow system is still streaming a response, and the response
will pass validation, don't fail just because the system is slow.

* test: unload embedding models more quickly

2025-09-09 09:32:15 -07:00

28 KiB

Raw Permalink Blame History

View Raw

28 KiB Raw Permalink Blame History

28 KiB

Raw Permalink Blame History