19 Commits

Author SHA1 Message Date
joachim-danswer
463340b8a1
Reduce ranking scores for short chunks without actual information (#4098)
* remove title for slack

* initial working code

* simplification

* improvements

* name change to information_content_model

* avoid boost_score > 1.0

* nit

* EL comments and improvements

Improvements:
  - proper import of information content model from cache or HF
  - warm up for information content model

Other:
  - EL PR review comments

* nit

* requirements version update

* fixed docker file

* new home for model_server configs

* default off

* small updates

* YS comments - pt 1

* renaming to chunk_boost & chunk table def

* saving and deleting chunk stats in new table

* saving and updating chunk stats

* improved dict score update

* create columns for individual boost factors

* RK comments

* Update migration

* manual import reordering
2025-03-13 17:35:45 +00:00
Richard Kuo (Danswer)
fb931ee4de fixes 2025-02-07 17:28:17 -08:00
Richard Kuo (Danswer)
bc2c56dfb6 improve gpu detection functions and logging in model server 2025-02-07 16:59:02 -08:00
pablonyx
27acd3387a
Auth specific rate limiting (#3463)
* k

* v1

* fully functional

* finalize

* nit

* nit

* nit

* clean up with wrapper + comments

* k

* update

* minor clean
2024-12-29 23:34:23 +00:00
Emerson Gomes
b5d0df90b9 Remove hardcoded root path for HF models 2024-12-16 19:03:15 -08:00
pablodanswer
21ec5ed795 welcome to onyx 2024-12-13 09:56:10 -08:00
pablodanswer
848d86b886
feat: sentry updates (#2929) 2024-10-26 19:06:46 +00:00
pablodanswer
61424de531
add sentry (#2786)
* add sentry

* nit

* nit

* add requirement to ee

* try to ensure sentry is installed in integration tests
2024-10-17 23:20:37 +00:00
Yuhong Sun
8de04acb7f k 2024-08-20 14:06:49 -07:00
Yuhong Sun
5ab4d94d94
Logging Level Update (#2165) 2024-08-18 21:53:40 -07:00
hagen-danswer
bbb8c5ff0b
Speed up docker launch (#2099)
* use move instead of copy

* added logging

* fix overwrites

* tested throughly

* fixes

* clearer commenting
2024-08-13 00:45:05 +00:00
Yuhong Sun
386b229ed3
Cohere Rerank (#2109) 2024-08-11 14:22:42 -07:00
Yuhong Sun
8cd1eda8b1
Rework Rerankers (#2093) 2024-08-08 21:33:49 -07:00
hagen-danswer
1be1959d80
Changed default local model to nomic (#1943) 2024-07-31 18:54:02 -07:00
Yuhong Sun
e361e92230
Healthcheck for model server (#1350) 2024-04-18 16:22:38 -07:00
Yuhong Sun
b59912884b
Fix Model Server (#1320) 2024-04-10 23:13:22 -07:00
Yuhong Sun
2db906b7a2
Always Use Model Server (#1306) 2024-04-07 21:25:06 -07:00
Yuhong Sun
4b45164496
Background Index Attempt Creation (#1010) 2024-01-28 23:14:20 -08:00
Yuhong Sun
7433dddac3
Model Server (#695)
Provides the ability to pull out the NLP models into a separate model server which can then be hosted on a GPU instance if desired.
2023-11-06 16:36:09 -08:00