ollama

mirror of https://github.com/ollama/ollama.git synced 2025-07-09 03:20:15 +02:00

Author	SHA1	Message	Date
L. Jiang	716e365615	test: add test cases for HumanNumber (#9108 )	2025-02-18 11:35:26 -08:00
Azis Alvriyanto	32285a6d19	format: rename test file from byte_test.go to bytes_test.go (#8865 )	2025-02-06 13:06:15 -08:00
Azis Alvriyanto	8d8b9f83ae	format: byte formatting test coverage (#8692 ) Removed redundant checks and streamlined the switch-case structure. Added test cases for both HumanBytes and HumanBytes2 to cover a wide range of scenarios.	2025-02-05 12:23:07 -08:00
Michael Yang	b732beba6a	lint	2024-08-01 17:06:06 -07:00
Michael Yang	e40145a39d	lint	2024-06-04 11:13:30 -07:00
Patrick Devine	6845988807	Ollama `ps` command for showing currently loaded models (#4327 )	2024-05-13 17:17:36 -07:00
Daniel Hiltgen	bee2f4a3b0	Record GPU usage information This records more GPU usage information for eventual UX inclusion.	2024-05-08 14:45:39 -07:00
Bruce MacDonald	527e9be058	fix: store accurate model parameter size (#4058 ) - add test for number formatting - fix bug where 1B and 1M were not stored correctly - display 2 decimal points for million param sizes - display 1 decimal point for billion param sizes	2024-05-07 14:41:53 -07:00
Daniel Hiltgen	34b9db5afc	Request and model concurrency This change adds support for multiple concurrent requests, as well as loading multiple models by spawning multiple runners. The default settings are currently set at 1 concurrent request per model and only 1 loaded model at a time, but these can be adjusted by setting OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.	2024-04-22 19:29:12 -07:00
Michael Yang	7e33a017c0	partial offloading	2024-04-10 11:37:20 -07:00
Michael Yang	91b3e4d282	update memory calcualtions count each layer independently when deciding gpu offloading	2024-04-01 13:16:32 -07:00
Michael Yang	fd10a2ad4b	remove format/openssh.go this is unnecessary now that x/crypto/ssh.MarshalPrivateKey has been added	2024-02-23 16:52:23 -08:00
Michael Yang	424d53ac70	progress: fix bar rate	2023-11-28 11:44:56 -08:00
Jeffrey Morgan	93a108214c	only show decimal points for smaller file size numbers	2023-11-20 10:58:19 -05:00
Michael Yang	9f04e5a8ea	format bytes	2023-11-17 10:06:19 -08:00
Michael Yang	01ea6002c4	replace go-humanize with format.HumanBytes	2023-11-14 14:57:41 -08:00
Michael Yang	c5e1bbabda	instead of static number of parameters for each model family, get the real number from the tensors (#1022 ) * parse tensor info * refactor decoder * return actual parameter count * explicit rounding * s/Human/HumanNumber/	2023-11-08 17:55:46 -08:00
Michael Yang	2ce1793a1d	go fmt	2023-10-19 09:21:51 -07:00
Michael Yang	92189a5855	fix memory check	2023-10-13 14:47:29 -07:00
Michael Yang	b599946b74	add format bytes	2023-10-11 14:08:23 -07:00
Michael Yang	b5e08e3373	cleanup format time	2023-10-11 11:09:27 -07:00
Michael Yang	0dae34b6a7	remove unused openssh key types	2023-09-06 14:34:09 -07:00
Patrick Devine	9770e3b325	Generate private/public keypair for use w/ auth (#324 )	2023-08-11 10:58:23 -07:00
Patrick Devine	5bea29f610	add new list command (#97 )	2023-07-18 09:09:45 -07:00

24 Commits