56 Commits

Author SHA1 Message Date
Yuhong Sun
e90c66c1b6
Include Titles in Chunks (#1817) 2024-07-12 09:42:24 -07:00
hagen-danswer
991ee79e47
some qol improvements for search pipeline (#1809) 2024-07-11 17:42:11 -07:00
Yuhong Sun
08c6e821e7
Merge Sections Logic (#1801) 2024-07-10 20:14:02 -07:00
pablodanswer
09a11b5e1a
Fix citations + unit tests (#1760) 2024-07-10 10:05:20 -07:00
Yuhong Sun
39982e5fdc
Info propagating to allow Chunk Merging (#1794) 2024-07-09 18:15:07 -07:00
Yuhong Sun
f0888f2f61
Eval Script Incremental Write (#1784) 2024-07-06 15:43:40 -07:00
Yuhong Sun
d35d7ee833
Evaluation Pipeline Touchup (#1783) 2024-07-06 13:17:05 -07:00
hagen-danswer
ac14369716
Added search quality testing pipeline (#1774) 2024-07-06 11:51:50 -07:00
Weves
415960564d Fix fast models 2024-06-29 15:19:09 -07:00
Yuhong Sun
680482bd06 Metadata filter for document search API (#53) 2024-06-25 15:07:56 -07:00
Yuhong Sun
123ec4342a
Relari (#1687)
Also includes some bugfixes
2024-06-22 18:52:48 -07:00
Weves
b723627e0c Ability to pass through headers to LLM call 2024-06-10 13:18:31 -07:00
Weves
97d058b8b2 Fix mypy for mediawiki tests 2024-05-25 17:16:47 -07:00
Andrew Sansom
94018e83b0
Add MediaWiki and Wikipedia Connectors (#1250)
* Add MediaWikiConnector first draft

* Add MediaWikiConnector first draft

* Add MediaWikiConnector first draft

* Add MediaWikiConnector sections for each document

* Add MediaWikiConnector to constants and factory

* Integrate MediaWikiConnector with connectors page

* Unit tests + bug fixes

* Allow adding multiple mediawikiconnectors

* add wikipedia connector

* add wikipedia connector to factory

* improve docstrings of mediawiki connector backend

* improve docstrings of mediawiki connector backend

* move wikipedia and mediawiki icon locations in admin page

* undo accidental commit of modified docker compose yaml
2024-05-24 08:51:20 -07:00
Yuhong Sun
546815dc8c
Consolidate File Processing (#1449) 2024-05-11 23:11:22 -07:00
Yuhong Sun
a17060af5a
Provide Additional Context for Chunk Options in APIs (#1330) 2024-04-14 18:32:22 -07:00
Weves
f135ba9c0c Rework LLM answering flow 2024-03-25 13:34:03 -07:00
Weves
1ba74ee4df Refactor search pipeline 2024-03-25 13:34:03 -07:00
Yuhong Sun
6768c24723
Default LLM Update (#1042) 2024-02-05 01:25:51 -08:00
Yuhong Sun
bb2b517124
Relari Test Script (#1033) 2024-02-02 09:50:48 -08:00
Yuhong Sun
4b45164496
Background Index Attempt Creation (#1010) 2024-01-28 23:14:20 -08:00
Itay
a4053501d0
CI: adding prettier to pre-commit (#1009) 2024-01-28 13:03:39 -08:00
Itay
0ce992e22e
CI: Run Python tests (#1001) 2024-01-28 12:59:51 -08:00
Yuhong Sun
cf4ede2130
Embedding Models Table (#1006) 2024-01-26 18:40:53 -08:00
Jeremi Joslin
d07647c597
Fix typo in gmail test connector (#981) 2024-01-24 12:01:26 -08:00
Itay
692fdb4597
Gmail Connector (#946)
---------

Co-authored-by: Yuhong Sun <yuhongsun96@gmail.com>
2024-01-22 16:25:10 -08:00
Yuhong Sun
c3cf9134bb
Telemetry Revision (#868) 2023-12-24 17:39:37 -08:00
Yuhong Sun
5957b888a5
DanswerBot Chat (#831) 2023-12-17 18:18:48 -08:00
Yuhong Sun
65fde8f1b3
Chat Backend (#801) 2023-12-14 22:14:37 -08:00
Yuhong Sun
02095e9281
Restructure APIs (#803) 2023-12-02 14:48:08 -08:00
Chris Weaver
78d1ae0379
Customizable personas (#772)
Also includes a small fix to LLM filtering when combined with reranking
2023-11-28 00:57:48 -08:00
Yuhong Sun
d291fea020
Turn off Reranking for Streaming Flows (#770) 2023-11-26 16:45:23 -08:00
Yuhong Sun
2665bff78e
Option to turn off LLM for eval script (#769) 2023-11-26 15:31:03 -08:00
Yuhong Sun
13001ede98
Search Regression Test and Save/Load State updates (#761) 2023-11-23 00:00:30 -08:00
Yuhong Sun
fda377a2fa
Regression Script for Search quality (#760) 2023-11-22 19:33:28 -08:00
Weves
08909b40b0 Add rate limiting wrapper + add to Document360 2023-10-29 18:00:17 -07:00
Yuhong Sun
26b491fb0c
Prep for Hybrid Search (#648) 2023-10-29 00:13:21 -07:00
Yuhong Sun
fe117513b0
Reorganize and Cleanup for Hybrid Search (#643) 2023-10-28 14:24:28 -07:00
Weves
3554e29b8d Add updated_at to UI + add time range selector 2023-10-23 23:32:16 -07:00
Yuhong Sun
e279918f95
Introduce Time Filters (#610) 2023-10-22 15:06:52 -07:00
Yuhong Sun
6a449f1fb1
Introduce Recency Bias (#592) 2023-10-19 12:54:35 -07:00
Chris Weaver
1bd76f528f
Document explorer admin page (#590) 2023-10-18 18:41:39 -07:00
Yuhong Sun
a5d2759fbc
Recreate Tables from HTML (#588) 2023-10-18 11:16:40 -07:00
Yuhong Sun
595f61ea3a
Add Retrieval to Chat History (#577) 2023-10-15 13:40:07 -07:00
Yuhong Sun
b416c85f0f
Add Metrics to Regression Test (#470) 2023-09-20 20:42:02 -07:00
Yuhong Sun
6b305c56b3
Use Sentence Aware Splitter (#452) 2023-09-16 16:28:16 -07:00
Weves
cf2bd8a40c highlighting 2023-09-12 11:35:37 -07:00
Yuhong Sun
b5fc2a5775
Regression Test (#434) 2023-09-11 19:06:01 -07:00
Yuhong Sun
ec4d0b856c
Added boost to rerank step (#360) 2023-08-30 23:12:55 -07:00
Yuhong Sun
2a339ec34b
Prevent too many tokens to GPT (#245) 2023-07-28 16:00:26 -07:00