danswer

mirror of https://github.com/danswer-ai/danswer.git synced 2025-03-18 22:01:55 +01:00

Author	SHA1	Message	Date
pablodanswer	d5b9a6e552	add vespa + embedding timeout env variables (#2689 ) * add vespa + embedding timeout env variables * nit: integration test * add dangerous override * k * add additional clarity * nit * nit	2024-10-09 03:20:28 +00:00
pablodanswer	3a9b964d5c	Add Litellm Rerank proxy (#2346 ) * add ability ot set reranking litellm proxy * add fully functional rerank litellm cards * minor formatting enforcement * remove logs	2024-09-09 15:57:01 +00:00
Shukant Pal	362156f97e	Model inference for connector classifier on queries (#2137 )	2024-09-08 14:46:00 -07:00
pablodanswer	2bd3833c55	Update search settings + chat/search handling (#2333 ) * validate web list * update search settings + chat/search handling * remove accidentally added search manager * minor build fix * push from local	2024-09-06 00:07:39 +00:00
pablodanswer	34ba3181ff	Update auth for litellm proxy (#2316 ) * update for auth * validated embedding model names * remove embedding provider * remove logs * add ability to delete search setting * add abiility to delete models + more streamlined API endpoints * remove upsert * minor typing fix * add connector utils	2024-09-04 20:59:07 +00:00
pablodanswer	299cb5035c	Add litellm proxy embeddings (#2291 ) * add litellm proxy * formatting * move `api_url` to cloud provider + nits * remove log * typing * quick tuyping fix * update LiteLLM selection logic * remove logs + validate functionality * rename proxy var * update path casing * remove pricing for custom models * functional values	2024-09-02 09:08:35 -07:00
Yuhong Sun	3b035d791e	Fix Model Server (#2191 )	2024-08-20 17:57:09 -07:00
pablodanswer	53387ab3eb	Simplify index and model name swap logic (#2188 )	2024-08-20 17:31:00 -07:00
Yuhong Sun	8de04acb7f	k	2024-08-20 14:06:49 -07:00
pablodanswer	71c2b16a01	Pull out stripping of model suffix (#2175 )	2024-08-20 11:32:03 -07:00
Yuhong Sun	5ab4d94d94	Logging Level Update (#2165 )	2024-08-18 21:53:40 -07:00
pablodanswer	22573aba2a	Improve Search (#2105 )	2024-08-16 21:29:15 -07:00
hagen-danswer	61b5bd569b	Reworked chunking to support mega chunks (#2032 )	2024-08-14 22:18:53 -07:00
Yuhong Sun	1c10f54294	GPU Model Server (#2135 )	2024-08-14 11:04:28 -07:00
hagen-danswer	bbb8c5ff0b	Speed up docker launch (#2099 ) * use move instead of copy * added logging * fix overwrites * tested throughly * fixes * clearer commenting	2024-08-13 00:45:05 +00:00
Yuhong Sun	386b229ed3	Cohere Rerank (#2109 )	2024-08-11 14:22:42 -07:00
Yuhong Sun	ce666f3320	Propagate Embedding Enum (#2108 )	2024-08-11 12:17:54 -07:00
Yuhong Sun	8cd1eda8b1	Rework Rerankers (#2093 )	2024-08-08 21:33:49 -07:00
Yuhong Sun	7dcc42aa95	Intent Model Update (#2069 )	2024-08-08 20:45:53 -07:00
Weves	51731ad0dd	Fix issue where large docs/batches break openai embedding	2024-08-02 01:07:09 -07:00
hagen-danswer	1be1959d80	Changed default local model to nomic (#1943 )	2024-07-31 18:54:02 -07:00
Yuhong Sun	036d5c737e	No Null Embeddings (#1982 )	2024-07-30 19:54:49 -07:00
hagen-danswer	3938a053aa	Rework tokenizer (#1957 )	2024-07-29 23:01:49 -07:00
pablodanswer	48a0d29a5c	Fix empty / reverted embeddings (#1910 )	2024-07-23 22:41:31 -07:00
Yuhong Sun	6db4634871	Token Truncation (#1892 )	2024-07-21 16:26:32 -07:00
Yuhong Sun	5cfed45cef	Handle Empty Titles (#1891 )	2024-07-21 14:59:23 -07:00
Yuhong Sun	0e8ba111c8	Model Touchups (#1887 )	2024-07-21 12:31:00 -07:00
Yuhong Sun	44820b4909	k	2024-07-21 10:27:57 -07:00
hagen-danswer	eb3e7610fc	Added retries and multithreading for cloud embedding (#1879 ) * added retries and multithreading for cloud embedding * refactored a bit * cleaned up code * got the errors to bubble up to the ui correctly * added exceptin printing * added requirements * touchups --------- Co-authored-by: Yuhong Sun <yuhongsun96@gmail.com>	2024-07-20 22:10:18 -07:00
Yuhong Sun	4d295ab97d	Model Server Logging (#1839 )	2024-07-15 09:00:27 -07:00
Weves	6fe3eeaa48	Fix model serer startup	2024-07-14 23:33:58 -07:00
Weves	0d52e99bd4	Improve confluence rate limiting	2024-07-14 16:40:45 -07:00
pablodanswer	e7f81d1688	add third party embedding models (#1818 )	2024-07-14 10:19:53 -07:00
Yuhong Sun	e361e92230	Healthcheck for model server (#1350 )	2024-04-18 16:22:38 -07:00
Yuhong Sun	b59912884b	Fix Model Server (#1320 )	2024-04-10 23:13:22 -07:00
Yuhong Sun	2db906b7a2	Always Use Model Server (#1306 )	2024-04-07 21:25:06 -07:00
Yuhong Sun	4b45164496	Background Index Attempt Creation (#1010 )	2024-01-28 23:14:20 -08:00
Chris Weaver	e94fd8b022	Remove un-needed imports (#999 )	2024-01-25 12:10:19 -08:00
Yuhong Sun	2e6be57880	Model Server CVEs (#898 )	2023-12-29 21:14:08 -08:00
Yuhong Sun	c3cf9134bb	Telemetry Revision (#868 )	2023-12-24 17:39:37 -08:00
Yuhong Sun	7433dddac3	Model Server (#695 ) Provides the ability to pull out the NLP models into a separate model server which can then be hosted on a GPU instance if desired.	2023-11-06 16:36:09 -08:00

41 Commits