Commit Graph

38 Commits

Author SHA1 Message Date
2bd3833c55 Update search settings + chat/search handling (#2333)
* validate web list

* update search settings + chat/search handling

* remove accidentally added search manager

* minor build fix

* push from local
2024-09-06 00:07:39 +00:00
34ba3181ff Update auth for litellm proxy (#2316)
* update for auth

* validated embedding model names

* remove embedding provider

* remove logs

* add ability to delete search setting

* add abiility to delete models + more streamlined API endpoints

* remove upsert

* minor typing fix

* add connector utils
2024-09-04 20:59:07 +00:00
299cb5035c Add litellm proxy embeddings (#2291)
* add litellm proxy

* formatting

* move `api_url` to cloud provider + nits

* remove log

* typing

* quick tuyping fix

* update LiteLLM selection logic

* remove logs + validate functionality

* rename proxy var

* update path casing

* remove pricing for custom models

* functional values
2024-09-02 09:08:35 -07:00
3b035d791e Fix Model Server (#2191) 2024-08-20 17:57:09 -07:00
53387ab3eb Simplify index and model name swap logic (#2188) 2024-08-20 17:31:00 -07:00
8de04acb7f k 2024-08-20 14:06:49 -07:00
71c2b16a01 Pull out stripping of model suffix (#2175) 2024-08-20 11:32:03 -07:00
5ab4d94d94 Logging Level Update (#2165) 2024-08-18 21:53:40 -07:00
22573aba2a Improve Search (#2105) 2024-08-16 21:29:15 -07:00
61b5bd569b Reworked chunking to support mega chunks (#2032) 2024-08-14 22:18:53 -07:00
1c10f54294 GPU Model Server (#2135) 2024-08-14 11:04:28 -07:00
bbb8c5ff0b Speed up docker launch (#2099)
* use move instead of copy

* added logging

* fix overwrites

* tested throughly

* fixes

* clearer commenting
2024-08-13 00:45:05 +00:00
386b229ed3 Cohere Rerank (#2109) 2024-08-11 14:22:42 -07:00
ce666f3320 Propagate Embedding Enum (#2108) 2024-08-11 12:17:54 -07:00
8cd1eda8b1 Rework Rerankers (#2093) 2024-08-08 21:33:49 -07:00
7dcc42aa95 Intent Model Update (#2069) 2024-08-08 20:45:53 -07:00
51731ad0dd Fix issue where large docs/batches break openai embedding 2024-08-02 01:07:09 -07:00
1be1959d80 Changed default local model to nomic (#1943) 2024-07-31 18:54:02 -07:00
036d5c737e No Null Embeddings (#1982) 2024-07-30 19:54:49 -07:00
3938a053aa Rework tokenizer (#1957) 2024-07-29 23:01:49 -07:00
48a0d29a5c Fix empty / reverted embeddings (#1910) 2024-07-23 22:41:31 -07:00
6db4634871 Token Truncation (#1892) 2024-07-21 16:26:32 -07:00
5cfed45cef Handle Empty Titles (#1891) 2024-07-21 14:59:23 -07:00
0e8ba111c8 Model Touchups (#1887) 2024-07-21 12:31:00 -07:00
44820b4909 k 2024-07-21 10:27:57 -07:00
eb3e7610fc Added retries and multithreading for cloud embedding (#1879)
* added retries and multithreading for cloud embedding

* refactored a bit

* cleaned up code

* got the errors to bubble up to the ui correctly

* added exceptin printing

* added requirements

* touchups

---------

Co-authored-by: Yuhong Sun <yuhongsun96@gmail.com>
2024-07-20 22:10:18 -07:00
4d295ab97d Model Server Logging (#1839) 2024-07-15 09:00:27 -07:00
6fe3eeaa48 Fix model serer startup 2024-07-14 23:33:58 -07:00
0d52e99bd4 Improve confluence rate limiting 2024-07-14 16:40:45 -07:00
e7f81d1688 add third party embedding models (#1818) 2024-07-14 10:19:53 -07:00
e361e92230 Healthcheck for model server (#1350) 2024-04-18 16:22:38 -07:00
b59912884b Fix Model Server (#1320) 2024-04-10 23:13:22 -07:00
2db906b7a2 Always Use Model Server (#1306) 2024-04-07 21:25:06 -07:00
4b45164496 Background Index Attempt Creation (#1010) 2024-01-28 23:14:20 -08:00
e94fd8b022 Remove un-needed imports (#999) 2024-01-25 12:10:19 -08:00
2e6be57880 Model Server CVEs (#898) 2023-12-29 21:14:08 -08:00
c3cf9134bb Telemetry Revision (#868) 2023-12-24 17:39:37 -08:00
7433dddac3 Model Server (#695)
Provides the ability to pull out the NLP models into a separate model server which can then be hosted on a GPU instance if desired.
2023-11-06 16:36:09 -08:00