pablonyx
ab1b6b487e
descrease model server logspam ( #4166 )
2025-03-10 18:29:27 +00:00
rkuo-danswer
870b59a1cc
Bugfix/vertex crash ( #4181 )
...
* Update text embedding model to version 005 and enhance embedding retrieval process
* re
* Fix formatting issues
* Add support for Bedrock reranking provider and AWS credentials handling
* fix: improve AWS key format validation and error messages
* Fix vertex embedding model crash
* feat: add environment template for local development setup
* Add display name for Claude 3.7 Sonnet model
* Add display names for Gemini 2.0 models and update Claude 3.7 Sonnet entry
* Fix ruff errors by ensuring lines are within 130 characters
* revert to currently default onyx browser settings
* add / fix boto requirements
---------
Co-authored-by: ferdinand loesch <f.loesch@sportradar.com>
Co-authored-by: Ferdinand Loesch <ferdinandloesch@me.com>
Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app>
2025-03-05 01:59:46 +00:00
Chris Weaver
f25e1e80f6
Add option to not re-index ( #4157 )
...
* Add option to not re-index
* Add quantizaton / dimensionality override support
* Fix build / ut
2025-03-03 10:54:11 -08:00
rkuo-danswer
fe8a5d671a
don't spam the logs with texts on auth errors ( #4085 )
...
* don't spam the logs with texts on auth errors
* refactor the logging a bit
---------
Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app>
2025-02-21 13:40:07 -08:00
Richard Kuo (Danswer)
fb931ee4de
fixes
2025-02-07 17:28:17 -08:00
Richard Kuo (Danswer)
bc2c56dfb6
improve gpu detection functions and logging in model server
2025-02-07 16:59:02 -08:00
rkuo-danswer
03acb6587a
Feature/model server logging ( #3579 )
...
* improve model server logging
* improve exception logging with provider/model names
* get everything into one log line
---------
Co-authored-by: Richard Kuo <rkuo@rkuo.com>
2025-01-03 01:40:29 +00:00
pablodanswer
21ec5ed795
welcome to onyx
2024-12-13 09:56:10 -08:00
Yuhong Sun
6026536110
Model Server Async ( #3386 )
...
* need-verify
* fix some lib calls
* k
* tests
* k
* k
* k
* Address the comments
* fix comment
2024-12-11 01:33:44 +00:00
Chris Weaver
ac448956e9
Add handling for rate limiting ( #3280 )
2024-11-27 14:22:15 -08:00
Chris Weaver
36134021c5
Refactor + add global timeout env variable ( #2844 )
...
* Refactor + add global timeout env variable
* remove model
* mypy
* Remove unused
2024-10-18 18:25:27 +00:00
pablodanswer
e022e77b6d
Simpler azure embedding ( #2751 )
...
* functional but janky
* nit
* adapt for azure
* nit
* minor updates
* nits
* nit
* nit
* ensure access to litellm
* k
2024-10-15 23:23:11 +00:00
pablodanswer
d5b9a6e552
add vespa + embedding timeout env variables ( #2689 )
...
* add vespa + embedding timeout env variables
* nit: integration test
* add dangerous override
* k
* add additional clarity
* nit
* nit
2024-10-09 03:20:28 +00:00
pablodanswer
3a9b964d5c
Add Litellm Rerank proxy ( #2346 )
...
* add ability ot set reranking litellm proxy
* add fully functional rerank litellm cards
* minor formatting enforcement
* remove logs
2024-09-09 15:57:01 +00:00
pablodanswer
2bd3833c55
Update search settings + chat/search handling ( #2333 )
...
* validate web list
* update search settings + chat/search handling
* remove accidentally added search manager
* minor build fix
* push from local
2024-09-06 00:07:39 +00:00
pablodanswer
34ba3181ff
Update auth for litellm proxy ( #2316 )
...
* update for auth
* validated embedding model names
* remove embedding provider
* remove logs
* add ability to delete search setting
* add abiility to delete models + more streamlined API endpoints
* remove upsert
* minor typing fix
* add connector utils
2024-09-04 20:59:07 +00:00
pablodanswer
299cb5035c
Add litellm proxy embeddings ( #2291 )
...
* add litellm proxy
* formatting
* move `api_url` to cloud provider + nits
* remove log
* typing
* quick tuyping fix
* update LiteLLM selection logic
* remove logs + validate functionality
* rename proxy var
* update path casing
* remove pricing for custom models
* functional values
2024-09-02 09:08:35 -07:00
pablodanswer
53387ab3eb
Simplify index and model name swap logic ( #2188 )
2024-08-20 17:31:00 -07:00
pablodanswer
71c2b16a01
Pull out stripping of model suffix ( #2175 )
2024-08-20 11:32:03 -07:00
Yuhong Sun
5ab4d94d94
Logging Level Update ( #2165 )
2024-08-18 21:53:40 -07:00
pablodanswer
22573aba2a
Improve Search ( #2105 )
2024-08-16 21:29:15 -07:00
Yuhong Sun
386b229ed3
Cohere Rerank ( #2109 )
2024-08-11 14:22:42 -07:00
Yuhong Sun
ce666f3320
Propagate Embedding Enum ( #2108 )
2024-08-11 12:17:54 -07:00
Yuhong Sun
8cd1eda8b1
Rework Rerankers ( #2093 )
2024-08-08 21:33:49 -07:00
Weves
51731ad0dd
Fix issue where large docs/batches break openai embedding
2024-08-02 01:07:09 -07:00
hagen-danswer
1be1959d80
Changed default local model to nomic ( #1943 )
2024-07-31 18:54:02 -07:00
Yuhong Sun
036d5c737e
No Null Embeddings ( #1982 )
2024-07-30 19:54:49 -07:00
hagen-danswer
3938a053aa
Rework tokenizer ( #1957 )
2024-07-29 23:01:49 -07:00
pablodanswer
48a0d29a5c
Fix empty / reverted embeddings ( #1910 )
2024-07-23 22:41:31 -07:00
Yuhong Sun
6db4634871
Token Truncation ( #1892 )
2024-07-21 16:26:32 -07:00
Yuhong Sun
5cfed45cef
Handle Empty Titles ( #1891 )
2024-07-21 14:59:23 -07:00
Yuhong Sun
0e8ba111c8
Model Touchups ( #1887 )
2024-07-21 12:31:00 -07:00
Yuhong Sun
44820b4909
k
2024-07-21 10:27:57 -07:00
hagen-danswer
eb3e7610fc
Added retries and multithreading for cloud embedding ( #1879 )
...
* added retries and multithreading for cloud embedding
* refactored a bit
* cleaned up code
* got the errors to bubble up to the ui correctly
* added exceptin printing
* added requirements
* touchups
---------
Co-authored-by: Yuhong Sun <yuhongsun96@gmail.com>
2024-07-20 22:10:18 -07:00
Yuhong Sun
4d295ab97d
Model Server Logging ( #1839 )
2024-07-15 09:00:27 -07:00
Weves
6fe3eeaa48
Fix model serer startup
2024-07-14 23:33:58 -07:00
Weves
0d52e99bd4
Improve confluence rate limiting
2024-07-14 16:40:45 -07:00
pablodanswer
e7f81d1688
add third party embedding models ( #1818 )
2024-07-14 10:19:53 -07:00
Yuhong Sun
b59912884b
Fix Model Server ( #1320 )
2024-04-10 23:13:22 -07:00
Yuhong Sun
2db906b7a2
Always Use Model Server ( #1306 )
2024-04-07 21:25:06 -07:00
Yuhong Sun
4b45164496
Background Index Attempt Creation ( #1010 )
2024-01-28 23:14:20 -08:00
Yuhong Sun
c3cf9134bb
Telemetry Revision ( #868 )
2023-12-24 17:39:37 -08:00
Yuhong Sun
7433dddac3
Model Server ( #695 )
...
Provides the ability to pull out the NLP models into a separate model server which can then be hosted on a GPU instance if desired.
2023-11-06 16:36:09 -08:00