danswer

mirror of https://github.com/danswer-ai/danswer.git synced 2025-03-27 02:02:18 +01:00

Author	SHA1	Message	Date
pablonyx	ab1b6b487e	descrease model server logspam (#4166 )	2025-03-10 18:29:27 +00:00
rkuo-danswer	870b59a1cc	Bugfix/vertex crash (#4181 ) * Update text embedding model to version 005 and enhance embedding retrieval process * re * Fix formatting issues * Add support for Bedrock reranking provider and AWS credentials handling * fix: improve AWS key format validation and error messages * Fix vertex embedding model crash * feat: add environment template for local development setup * Add display name for Claude 3.7 Sonnet model * Add display names for Gemini 2.0 models and update Claude 3.7 Sonnet entry * Fix ruff errors by ensuring lines are within 130 characters * revert to currently default onyx browser settings * add / fix boto requirements --------- Co-authored-by: ferdinand loesch <f.loesch@sportradar.com> Co-authored-by: Ferdinand Loesch <ferdinandloesch@me.com> Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app>	2025-03-05 01:59:46 +00:00
Chris Weaver	f25e1e80f6	Add option to not re-index (#4157 ) * Add option to not re-index * Add quantizaton / dimensionality override support * Fix build / ut	2025-03-03 10:54:11 -08:00
rkuo-danswer	fe8a5d671a	don't spam the logs with texts on auth errors (#4085 ) * don't spam the logs with texts on auth errors * refactor the logging a bit --------- Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app>	2025-02-21 13:40:07 -08:00
Richard Kuo (Danswer)	fb931ee4de	fixes	2025-02-07 17:28:17 -08:00
Richard Kuo (Danswer)	bc2c56dfb6	improve gpu detection functions and logging in model server	2025-02-07 16:59:02 -08:00
rkuo-danswer	03acb6587a	Feature/model server logging (#3579 ) * improve model server logging * improve exception logging with provider/model names * get everything into one log line --------- Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2025-01-03 01:40:29 +00:00
pablodanswer	21ec5ed795	welcome to onyx	2024-12-13 09:56:10 -08:00
Yuhong Sun	6026536110	Model Server Async (#3386 ) * need-verify * fix some lib calls * k * tests * k * k * k * Address the comments * fix comment	2024-12-11 01:33:44 +00:00
Chris Weaver	ac448956e9	Add handling for rate limiting (#3280 )	2024-11-27 14:22:15 -08:00
Chris Weaver	36134021c5	Refactor + add global timeout env variable (#2844 ) * Refactor + add global timeout env variable * remove model * mypy * Remove unused	2024-10-18 18:25:27 +00:00
pablodanswer	e022e77b6d	Simpler azure embedding (#2751 ) * functional but janky * nit * adapt for azure * nit * minor updates * nits * nit * nit * ensure access to litellm * k	2024-10-15 23:23:11 +00:00
pablodanswer	d5b9a6e552	add vespa + embedding timeout env variables (#2689 ) * add vespa + embedding timeout env variables * nit: integration test * add dangerous override * k * add additional clarity * nit * nit	2024-10-09 03:20:28 +00:00
pablodanswer	3a9b964d5c	Add Litellm Rerank proxy (#2346 ) * add ability ot set reranking litellm proxy * add fully functional rerank litellm cards * minor formatting enforcement * remove logs	2024-09-09 15:57:01 +00:00
pablodanswer	2bd3833c55	Update search settings + chat/search handling (#2333 ) * validate web list * update search settings + chat/search handling * remove accidentally added search manager * minor build fix * push from local	2024-09-06 00:07:39 +00:00
pablodanswer	34ba3181ff	Update auth for litellm proxy (#2316 ) * update for auth * validated embedding model names * remove embedding provider * remove logs * add ability to delete search setting * add abiility to delete models + more streamlined API endpoints * remove upsert * minor typing fix * add connector utils	2024-09-04 20:59:07 +00:00
pablodanswer	299cb5035c	Add litellm proxy embeddings (#2291 ) * add litellm proxy * formatting * move `api_url` to cloud provider + nits * remove log * typing * quick tuyping fix * update LiteLLM selection logic * remove logs + validate functionality * rename proxy var * update path casing * remove pricing for custom models * functional values	2024-09-02 09:08:35 -07:00
pablodanswer	53387ab3eb	Simplify index and model name swap logic (#2188 )	2024-08-20 17:31:00 -07:00
pablodanswer	71c2b16a01	Pull out stripping of model suffix (#2175 )	2024-08-20 11:32:03 -07:00
Yuhong Sun	5ab4d94d94	Logging Level Update (#2165 )	2024-08-18 21:53:40 -07:00
pablodanswer	22573aba2a	Improve Search (#2105 )	2024-08-16 21:29:15 -07:00
Yuhong Sun	386b229ed3	Cohere Rerank (#2109 )	2024-08-11 14:22:42 -07:00
Yuhong Sun	ce666f3320	Propagate Embedding Enum (#2108 )	2024-08-11 12:17:54 -07:00
Yuhong Sun	8cd1eda8b1	Rework Rerankers (#2093 )	2024-08-08 21:33:49 -07:00
Weves	51731ad0dd	Fix issue where large docs/batches break openai embedding	2024-08-02 01:07:09 -07:00
hagen-danswer	1be1959d80	Changed default local model to nomic (#1943 )	2024-07-31 18:54:02 -07:00
Yuhong Sun	036d5c737e	No Null Embeddings (#1982 )	2024-07-30 19:54:49 -07:00
hagen-danswer	3938a053aa	Rework tokenizer (#1957 )	2024-07-29 23:01:49 -07:00
pablodanswer	48a0d29a5c	Fix empty / reverted embeddings (#1910 )	2024-07-23 22:41:31 -07:00
Yuhong Sun	6db4634871	Token Truncation (#1892 )	2024-07-21 16:26:32 -07:00
Yuhong Sun	5cfed45cef	Handle Empty Titles (#1891 )	2024-07-21 14:59:23 -07:00
Yuhong Sun	0e8ba111c8	Model Touchups (#1887 )	2024-07-21 12:31:00 -07:00
Yuhong Sun	44820b4909	k	2024-07-21 10:27:57 -07:00
hagen-danswer	eb3e7610fc	Added retries and multithreading for cloud embedding (#1879 ) * added retries and multithreading for cloud embedding * refactored a bit * cleaned up code * got the errors to bubble up to the ui correctly * added exceptin printing * added requirements * touchups --------- Co-authored-by: Yuhong Sun <yuhongsun96@gmail.com>	2024-07-20 22:10:18 -07:00
Yuhong Sun	4d295ab97d	Model Server Logging (#1839 )	2024-07-15 09:00:27 -07:00
Weves	6fe3eeaa48	Fix model serer startup	2024-07-14 23:33:58 -07:00
Weves	0d52e99bd4	Improve confluence rate limiting	2024-07-14 16:40:45 -07:00
pablodanswer	e7f81d1688	add third party embedding models (#1818 )	2024-07-14 10:19:53 -07:00
Yuhong Sun	b59912884b	Fix Model Server (#1320 )	2024-04-10 23:13:22 -07:00
Yuhong Sun	2db906b7a2	Always Use Model Server (#1306 )	2024-04-07 21:25:06 -07:00
Yuhong Sun	4b45164496	Background Index Attempt Creation (#1010 )	2024-01-28 23:14:20 -08:00
Yuhong Sun	c3cf9134bb	Telemetry Revision (#868 )	2023-12-24 17:39:37 -08:00
Yuhong Sun	7433dddac3	Model Server (#695 ) Provides the ability to pull out the NLP models into a separate model server which can then be hosted on a GPU instance if desired.	2023-11-06 16:36:09 -08:00

43 Commits