ollama/integration/context_test.go at dc77bbcfa40dea8b8fc7713a2ecacbc6a9d08a25

mirror of https://github.com/ollama/ollama.git synced 2025-12-13 19:33:12 +01:00

Files

Daniel Hiltgen 73e2c8f68f Fix context exhaustion integration test for small gpus

On the smaller GPUs, the initial model load of llama2 took over 30s (the
default timeout for the DoGenerate helper)

2024-07-09 16:24:14 -07:00

View Raw