Logo
Explore Help
Register Sign In
highperfocused/ollama
1
0
Fork 0
You've already forked ollama
mirror of https://github.com/ollama/ollama.git synced 2025-07-06 09:13:05 +02:00
Code Issues Packages Projects Releases Wiki Activity
Files
main
ollama/server
History
Michael Yang d0b32def60 skip quantizing per_layer_token_embd (#11207)
this tensor isn't compatible with cuda when quantized to q4_K so skip it
2025-06-26 21:49:35 -07:00
..
internal
cache: fix comment function name in cache.go (#11110)
2025-06-18 05:21:45 -07:00
auth.go
…
create_test.go
…
create.go
…
download.go
…
fixblobs_test.go
…
fixblobs.go
…
images_test.go
Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119)
2025-06-20 11:11:40 -07:00
images.go
Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119)
2025-06-20 11:11:40 -07:00
layer.go
…
manifest_test.go
…
manifest.go
…
model.go
…
modelpath_test.go
…
modelpath.go
…
prompt_test.go
…
prompt.go
…
quantization_test.go
Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119)
2025-06-20 11:11:40 -07:00
quantization.go
skip quantizing per_layer_token_embd (#11207)
2025-06-26 21:49:35 -07:00
routes_create_test.go
…
routes_delete_test.go
…
routes_generate_test.go
…
routes_list_test.go
…
routes_test.go
…
routes.go
tools: loosen tool parsing to allow for more formats (#11030)
2025-06-12 14:18:54 -07:00
sched_test.go
Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119)
2025-06-20 11:11:40 -07:00
sched.go
Merge branch 'main' into drifkin/array-head-count-simple
2025-06-23 10:37:31 -07:00
sparse_common.go
…
sparse_windows.go
…
upload.go
…
Powered by Gitea Version: 1.24.2 Page: 5236ms Template: 2522ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API