ollama

mirror of https://github.com/ollama/ollama.git synced 2025-06-22 23:51:04 +02:00

History

Daniel Hiltgen 5e8ff556cb Support forced spreading for multi GPU

Our default behavior today is to try to fit into a single GPU if possible.
Some users would prefer the old behavior of always spreading across
multiple GPUs even if the model can fit into one.  This exposes that
tunable behavior.

2024-06-14 14:51:40 -07:00

auth.go

Revert "use post token"

2024-05-11 22:19:14 -07:00

download.go

server: skip blob verification for already verified blobs

2024-06-05 16:39:11 -07:00

fixblobs_test.go

server: replace blob prefix separator from ':' to '-' (#3146 )

2024-03-14 20:18:06 -07:00

fixblobs.go

server: replace blob prefix separator from ':' to '-' (#3146 )

2024-03-14 20:18:06 -07:00

images.go

server: remove jwt decoding error (#5027 )

2024-06-13 11:21:15 -07:00

layer.go

Merge pull request #3718 from ollama/mxyng/modelname-3

2024-05-29 12:02:07 -07:00

manifest_test.go

add OLLAMA_MODELS to envconfig (#5029 )

2024-06-13 12:52:03 -07:00

manifest.go

fix: skip removing layers that no longer exist

2024-06-10 11:32:19 -07:00

model.go

fix: multiple templates when creating from model

2024-06-12 13:35:49 -07:00

modelpath_test.go

add OLLAMA_MODELS to envconfig (#5029 )

2024-06-13 12:52:03 -07:00

modelpath.go

add OLLAMA_MODELS to envconfig (#5029 )

2024-06-13 12:52:03 -07:00

prompt_test.go

change github.com/jmorganca/ollama to github.com/ollama/ollama (#3347 )

2024-03-26 13:04:17 -07:00

prompt.go

change github.com/jmorganca/ollama to github.com/ollama/ollama (#3347 )

2024-03-26 13:04:17 -07:00

routes_create_test.go

add OLLAMA_MODELS to envconfig (#5029 )

2024-06-13 12:52:03 -07:00

routes_delete_test.go

add OLLAMA_MODELS to envconfig (#5029 )

2024-06-13 12:52:03 -07:00

routes_list_test.go

add OLLAMA_MODELS to envconfig (#5029 )

2024-06-13 12:52:03 -07:00

routes_test.go

add OLLAMA_MODELS to envconfig (#5029 )

2024-06-13 12:52:03 -07:00

routes.go

API app/browser access (#4879 )

2024-06-06 15:19:03 -07:00

sched_test.go

Improve multi-gpu handling at the limit

2024-06-14 14:51:40 -07:00

sched.go

Support forced spreading for multi GPU

2024-06-14 14:51:40 -07:00

upload.go

lint

2024-06-04 11:13:30 -07:00