Logo
Explore Help
Register Sign In
highperfocused/ollama
1
0
Fork 0
You've already forked ollama
mirror of https://github.com/ollama/ollama.git synced 2025-12-12 05:02:39 +01:00
Code Issues Packages Projects Releases Wiki Activity
Files
1a1c99e3346da21bf2062fa266cf39da954c66a8
ollama/llm
History
Michael Yang e873841cbb deepseek v2 graph
2024-06-18 15:35:12 -07:00
..
ext_server
Fix server.cpp for the new cuda build macros
2024-06-14 14:51:40 -07:00
generate
Add back lower level parallel flags
2024-06-17 13:44:46 -07:00
llama.cpp @ 7c26775adb
llm: update llama.cpp commit to 7c26775 (#4896)
2024-06-17 15:56:16 -04:00
patches
llm: update llama.cpp commit to 7c26775 (#4896)
2024-06-17 15:56:16 -04:00
filetype.go
Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322)
2024-05-23 13:21:49 -07:00
ggla.go
simplify safetensors reading
2024-05-21 11:28:22 -07:00
ggml.go
deepseek v2 graph
2024-06-18 15:35:12 -07:00
gguf.go
Revert "Merge pull request #4938 from ollama/mxyng/fix-byte-order"
2024-06-11 15:56:17 -07:00
llm_darwin_amd64.go
…
llm_darwin_arm64.go
…
llm_linux.go
…
llm_windows.go
…
llm.go
revert tokenize ffi (#4761)
2024-05-31 18:54:21 -07:00
memory_test.go
review comments and coverage
2024-06-14 14:55:50 -07:00
memory.go
Handle models with divergent layer sizes
2024-06-18 11:05:34 -07:00
payload.go
review comments and coverage
2024-06-14 14:55:50 -07:00
server.go
Tighten up memory prediction logging
2024-06-18 09:15:35 -07:00
status.go
…
Powered by Gitea Version: 1.25.1 Page: 1603ms Template: 47ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API