Changed default local model to nomic (#1943)

2025-09-20 13:05:49 +02:00 · 2024-07-31 18:54:02 -07:00
parent 1654378850
commit 1be1959d80
12 changed files with 78 additions and 28 deletions
--- a/deployment/docker_compose/README.md
+++ b/deployment/docker_compose/README.md
@@ -8,7 +8,7 @@ For general information, please read the instructions in this [README](https://g
 This part is elaborated precisely in  in this [README](https://github.com/danswer-ai/danswer/blob/main/deployment/README.md) in section *Docker Compose*. If you have any questions, please feel free to open an issue or get in touch in slack for support.

 ## Deploy in a system with GPU support
-Running Model servers with GPU support while indexing and querying can result in significant improvements in performance. This is highly recommended if you have access to resources. Currently, Danswer offloads embedding model and tokenizers to the GPU VRAM and the size needed depends on chosen embedding model. Default embedding models `intfloat/e5-base-v2` takes up about 1GB of VRAM and since we need this for inference and embedding pipeline, you would need roughly 2GB of VRAM.
+Running Model servers with GPU support while indexing and querying can result in significant improvements in performance. This is highly recommended if you have access to resources. Currently, Danswer offloads embedding model and tokenizers to the GPU VRAM and the size needed depends on chosen embedding model. For example, the embedding model `nomic-ai/nomic-embed-text-v1` takes up about 1GB of VRAM. That means running this model for inference and embedding pipeline would require roughly 2GB of VRAM.

 ### Setup
 To be able to use NVIDIA runtime, following is mandatory: