This website requires JavaScript.
Explore
Help
Register
Sign In
highperfocused
/
ollama
Watch
1
Star
0
Fork
0
You've already forked ollama
mirror of
https://github.com/ollama/ollama.git
synced
2025-07-06 09:13:05 +02:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
main
ollama
/
server
History
Michael Yang
d0b32def60
skip quantizing per_layer_token_embd (
#11207
)
...
this tensor isn't compatible with cuda when quantized to q4_K so skip it
2025-06-26 21:49:35 -07:00
..
internal
cache: fix comment function name in cache.go (
#11110
)
2025-06-18 05:21:45 -07:00
auth.go
…
create_test.go
…
create.go
…
download.go
…
fixblobs_test.go
…
fixblobs.go
…
images_test.go
Reapply "feat: incremental gguf parser (
#10822
)" (
#11114
) (
#11119
)
2025-06-20 11:11:40 -07:00
images.go
Reapply "feat: incremental gguf parser (
#10822
)" (
#11114
) (
#11119
)
2025-06-20 11:11:40 -07:00
layer.go
…
manifest_test.go
…
manifest.go
…
model.go
…
modelpath_test.go
…
modelpath.go
…
prompt_test.go
…
prompt.go
…
quantization_test.go
Reapply "feat: incremental gguf parser (
#10822
)" (
#11114
) (
#11119
)
2025-06-20 11:11:40 -07:00
quantization.go
skip quantizing per_layer_token_embd (
#11207
)
2025-06-26 21:49:35 -07:00
routes_create_test.go
…
routes_delete_test.go
…
routes_generate_test.go
…
routes_list_test.go
…
routes_test.go
…
routes.go
tools: loosen tool parsing to allow for more formats (
#11030
)
2025-06-12 14:18:54 -07:00
sched_test.go
Reapply "feat: incremental gguf parser (
#10822
)" (
#11114
) (
#11119
)
2025-06-20 11:11:40 -07:00
sched.go
Merge branch 'main' into drifkin/array-head-count-simple
2025-06-23 10:37:31 -07:00
sparse_common.go
…
sparse_windows.go
…
upload.go
…