mirror of
https://github.com/ollama/ollama.git
synced 2025-04-01 00:19:43 +02:00
some tensors are expected to be used in repeating layers but are not themselves repeated. this change copies these tensors into the same backends as their repeating counterparts to minimize copying tensors between backends