201a987ff9
some more menu options...
2024-04-28 12:40:52 -04:00
2d8125042a
Touch ID for cli install; server restarts
2024-04-27 22:42:38 -04:00
776e7bb5e4
app: fix status item icons
2024-04-27 15:57:57 -04:00
b8d7ca1a7b
Native implementation of macOS app
2024-04-27 14:20:10 -04:00
2bed62926e
types/model: remove Digest (for now) ( #3970 )
...
The Digest type needs more thought and is not necessary at the moment.
2024-04-26 21:14:28 -07:00
aad8d128a0
also look at cwd as a root for windows runners ( #3959 )
2024-04-26 19:14:08 -04:00
ec1acbb867
Merge pull request #3968 from dhiltgen/win_generate
...
Fine grain control over windows generate steps
v0.1.33-rc5
2024-04-26 16:03:38 -07:00
e4859c4563
Fine grain control over windows generate steps
...
This will speed up CI which already tries to only build static for unit tests
2024-04-26 15:49:46 -07:00
8e30eb26bd
Updates the setup command to use llama3. ( #3962 )
2024-04-26 18:41:01 -04:00
0b5c589ca2
Merge pull request #3966 from dhiltgen/bump
...
Fix target in gen_windows.ps1
2024-04-26 15:36:53 -07:00
65fadddc85
Merge pull request #3964 from ollama/mxyng/weights
...
fix gemma, command-r layer weights
2024-04-26 15:23:33 -07:00
ed5fb088c4
Fix target in gen_windows.ps1
2024-04-26 15:10:42 -07:00
f81f308118
fix gemma, command-r layer weights
2024-04-26 15:00:55 -07:00
b1390a7b37
types/model: export ParseNameBare and Merge ( #3957 )
...
These are useful outside this package.
2024-04-26 14:58:07 -07:00
11d83386a5
Merge pull request #3951 from ollama/mxyng/zip
...
check file type before zip
2024-04-26 14:51:23 -07:00
bb31def011
return code 499
when user cancels request while a model is loading ( #3955 )
2024-04-26 17:38:29 -04:00
41e03ede95
check file type before zip
2024-04-26 14:18:07 -07:00
7fea1ecdf6
Merge pull request #3958 from ollama/mxyng/fix-workflow
...
use merge base for diff-tree
2024-04-26 14:17:56 -07:00
054894271d
.github/workflows/test.yaml: add in-flight cancellations on new push ( #3956 )
...
Also, remove a superfluous 'go get'
2024-04-26 13:54:24 -07:00
6fef042f0b
use merge base for diff-tree
2024-04-26 13:54:15 -07:00
5c0c2d1d09
Merge pull request #3954 from dhiltgen/ci_fixes
...
Put back non-avx CPU build for windows
2024-04-26 13:09:03 -07:00
37f9c8ad99
types/model: overhaul Name and Digest types ( #3924 )
2024-04-26 13:08:32 -07:00
2a80f55e2a
Update windows.md ( #3855 )
...
Fixed a typo
2024-04-26 16:04:15 -04:00
421c878a2d
Put back non-avx CPU build for windows
2024-04-26 12:44:07 -07:00
36666c2142
Merge pull request #3925 from dhiltgen/bump
...
Bump llama.cpp to b2737
v0.1.33-rc4
2024-04-26 10:09:38 -07:00
85801317d1
Fix clip log import
2024-04-26 09:43:46 -07:00
2ed0d65948
Bump llama.cpp to b2737
2024-04-26 09:43:28 -07:00
d459dc4ad1
Merge pull request #3950 from dhiltgen/windows_packaging
...
Fix exe name for zip packaging on windows
2024-04-26 09:27:37 -07:00
40bc4622ef
Fix exe name for zip packaging on windows
...
The zip file encodes the OS and architecture, so keep the short exe name
2024-04-26 09:18:05 -07:00
c0f818a07a
Merge pull request #3948 from dhiltgen/win_generate
...
Refactor windows generate for more modular usage
2024-04-26 09:17:20 -07:00
8671fdeda6
Refactor windows generate for more modular usage
2024-04-26 08:35:50 -07:00
2619850fb4
Merge pull request #3933 from dhiltgen/ci_fixes
...
Move cuda/rocm dependency gathering into generate script
v0.1.33-rc3
2024-04-26 07:01:24 -07:00
8feb97dc0d
Move cuda/rocm dependency gathering into generate script
...
This will make it simpler for CI to accumulate artifacts from prior steps
2024-04-25 22:38:44 -07:00
4e1ff6dcbb
Merge pull request #3926 from dhiltgen/ci_fixes
...
Fix release CI
v0.1.33-rc2
2024-04-25 17:42:31 -07:00
8589d752ac
Fix release CI
...
download-artifact path was being used incorrectly. It is where to
extract the zip not the files in the zip to extract. Default is
workspace dir which is what we want, so omit it
2024-04-25 17:27:11 -07:00
de4ded68b0
Merge pull request #3923 from ollama/mxyng/mem
...
only count output tensors
v0.1.33-rc1
2024-04-25 16:34:17 -07:00
9b5a3c5991
Merge pull request #3914 from dhiltgen/mac_perf
...
Improve mac parallel performance
2024-04-25 16:28:31 -07:00
00b0699c75
Reload model if num_gpu
changes ( #3920 )
...
* reload model if `num_gpu` changes
* dont reload on -1
* fix tests
2024-04-25 19:02:40 -04:00
993cf8bf55
llm: limit generation to 10x context size to avoid run on generations ( #3918 )
...
* llm: limit generation to 10x context size to avoid run on generations
* add comment
* simplify condition statement
2024-04-25 19:02:30 -04:00
7bb7cb8a60
only count output tensors
2024-04-25 15:24:08 -07:00
b123be5b71
Adjust context size for parallelism
2024-04-25 13:58:54 -07:00
ddf5c09a9b
use matrix multiplcation kernels in more cases
2024-04-25 13:58:54 -07:00
5f73c08729
Remove trailing spaces ( #3889 )
2024-04-25 14:32:26 -04:00
f503a848c2
Merge pull request #3895 from brycereitano/shiftloading
...
Move ggml loading to when attempting to fit
2024-04-25 09:24:08 -07:00
36a6daccab
Restructure loading conditional chain
2024-04-24 17:37:03 -06:00
ceb0e26e5e
Provide variable ggml for TestLoad
2024-04-24 17:19:55 -06:00
284e02bed0
Move ggml loading to when we attempt fitting
2024-04-24 17:17:24 -06:00
3450a57d4a
Merge pull request #3713 from ollama/mxyng/modelname
...
update copy handler to use model.Name
2024-04-24 16:00:32 -07:00
592dae31c8
update copy to use model.Name
2024-04-24 15:54:54 -07:00
2010cbc5fa
Merge pull request #3833 from ollama/mxyng/fix-from
...
fix: from blob
2024-04-24 15:13:47 -07:00