danswer

mirror of https://github.com/danswer-ai/danswer.git synced 2025-04-05 10:28:26 +02:00

Author	SHA1	Message	Date
Richard Kuo (Onyx)	d7063e0a1d	expose acl link feature in onyx_vespa	2025-04-01 16:19:50 -07:00
pablonyx	3a3b2a2f8d	add user files (#4152 )	2025-04-01 16:19:44 -07:00
evan-danswer	56f8ab927b	Contextual Retrieval (#4029 ) * contextual rag implementation * WIP * indexing test fix * workaround for chunking errors, WIP on fixing massive memory cost * mypy and test fixes * reformatting * fixed rebase	2025-03-30 18:49:09 +00:00
rkuo-danswer	aab777f844	Bugfix/acl prefix (#4377 ) * fix acl prefixing * increase timeout a tad * block access to init'ing DocumentAccess directly, fix test to work with ee/MIT * fix env var checks --------- Co-authored-by: Richard Kuo (Onyx) <rkuo@onyx.app>	2025-03-28 05:52:35 +00:00
Weves	61366df34c	Add execute permission	2025-03-18 12:03:32 -07:00
Chris Weaver	1a444245f6	Memory tracking script (#4297 ) * Add simple container-level memory tracking script	2025-03-18 12:00:09 -07:00
joachim-danswer	463340b8a1	Reduce ranking scores for short chunks without actual information (#4098 ) * remove title for slack * initial working code * simplification * improvements * name change to information_content_model * avoid boost_score > 1.0 * nit * EL comments and improvements Improvements: - proper import of information content model from cache or HF - warm up for information content model Other: - EL PR review comments * nit * requirements version update * fixed docker file * new home for model_server configs * default off * small updates * YS comments - pt 1 * renaming to chunk_boost & chunk table def * saving and deleting chunk stats in new table * saving and updating chunk stats * improved dict score update * create columns for individual boost factors * RK comments * Update migration * manual import reordering	2025-03-13 17:35:45 +00:00
pablonyx	ecbd4eb1ad	add basic user invite flow (#4253 )	2025-03-11 19:02:51 +00:00
rkuo-danswer	a7acc07e79	fix usage report pagination (#4183 ) * early work in progress * rename utility script * move actual data seeding to a shareable function * add test * make the test pass with the fix * fix comment --------- Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app>	2025-03-05 19:13:51 +00:00
pablonyx	20f2b9b2bb	Add image support for search (#4090 ) * add support for image search * quick fix up * k * k * k * k * nit * quick fix for connector tests	2025-03-05 17:44:18 +00:00
Chris Weaver	f25e1e80f6	Add option to not re-index (#4157 ) * Add option to not re-index * Add quantizaton / dimensionality override support * Fix build / ut	2025-03-03 10:54:11 -08:00
pablonyx	a98dcbc7de	Update tenant logic (#4122 ) * k * k * k * quick nit * nit	2025-02-26 03:53:46 +00:00
pablonyx	7d40676398	Heavy task improvements, logging, and validation (#4058 )	2025-02-24 13:48:53 -08:00
rkuo-danswer	61d536c782	tool fixes (#4075 )	2025-02-21 12:30:33 -08:00
pablonyx	47fd4fa233	Strict Tenant ID Enforcement (#3871 ) * strict tenant id enforcement * k * k * nit * merge * nit * k	2025-02-19 00:52:56 +00:00
Chris Weaver	f1fc8ac19b	Connector checkpointing (#3876 ) * wip checkpointing/continue on failure more stuff for checkpointing Basic implementation FE stuff More checkpointing/failure handling rebase rebase initial scaffolding for IT IT to test checkpointing Cleanup cleanup Fix it Rebase Add todo Fix actions IT Test more Pagination + fixes + cleanup Fix IT networking fix it * rebase * Address misc comments * Address comments * Remove unused router * rebase * Fix mypy * Fixes * fix it * Fix tests * Add drop index * Add retries * reset lock timeout * Try hard drop of schema * Add timeout/retries to downgrade * rebase * test * test * test * Close all connections * test closing idle only * Fix it * fix * try using null pool * Test * fix * rebase * log * Fix * apply null pool * Fix other test * Fix quality checks * Test not using the fixture * Fix ordering * fix test * Change pooling behavior	2025-02-16 02:34:39 +00:00
pablonyx	b70db15622	Bugfix Vespa Deletion Script (#3998 )	2025-02-13 17:26:04 -08:00
pablonyx	c6434db7eb	Add delete all for tenants in Vespa (#3970 )	2025-02-13 14:33:49 -08:00
pablodanswer	a202e2bf9d	Improvements to Redis + Vespa debugging	2025-02-06 13:30:06 -08:00
pablonyx	4affc259a6	Password reset tenant (#3895 ) * nots * functional * minor naming cleanup * nit * update constant * k	2025-02-05 03:17:11 +00:00
pablodanswer	125e5eaab1	various mypy improvements	2025-02-04 12:06:10 -08:00
rkuo-danswer	4fe99d05fd	add timings for syncing (#3798 ) * add timings for syncing * add more logging * more debugging * refactor multipass/db check out of VespaIndex * circular imports? * more debugging * add logs * various improvements * additional logs to narrow down issue * use global httpx pool for the main vespa flows in celery. Use in more places eventually. * cleanup debug logging, etc * remove debug logging * this should use the secondary index * mypy * missed some logging * review fixes * refactor get_default_document_index to use search settings * more missed logging * fix circular refs --------- Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app> Co-authored-by: pablodanswer <pablo@danswer.ai>	2025-01-29 23:24:44 +00:00
pablonyx	7f10494bbe	Better vespa interface (#3781 ) * k * much cleaner vespa util class * log * typing * improvement * improve	2025-01-26 21:22:44 +00:00
Yuhong Sun	cbf98c0128	Fix Seeding Link for Support Use Case (#3784 )	2025-01-25 19:39:36 -08:00
pablonyx	2a1bb4ac41	Vespa scripts + Redis script update (#3758 ) * update onyx redis script * looking good * simplify comments * remove unnecessary apps option * iterate * fix typing	2025-01-23 23:46:17 +00:00
pablonyx	ccb16b7484	Indexing latency check fix (#3747 ) * add logs + update dev script * update conig * remove prints * temporarily turn off * va * update * fix * finalize monitoring updates * update	2025-01-23 17:14:26 +00:00
skylares	af953ff8a3	Paginate Query History table (#3592 ) * Add pagination for query history table * Fix method name * Fix mypy	2025-01-17 15:31:42 -08:00
hagen-danswer	b1957737f2	refactored _add_user_filter usage (#3674 ) * refactored db.connector_credential_pair * Rerfactored the db.credentials user filtering * the restr	2025-01-14 23:35:52 +00:00
rkuo-danswer	2ae91f0f2b	Feature/redis prod tool (#3619 ) * prototype tools for handling prod issues * add some commands * add batching and dry run options * custom redis tool * comment * default to app config settings for redis --------- Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app>	2025-01-09 21:34:07 +00:00
pablonyx	d7bc32c0ec	Fully remove visit API (#3621 ) * v1 * update indexing logic * update updates * nit * clean up args * update for clarity + best practices * nit + logs * fix * minor clean up * remove logs * quick nit	2025-01-08 13:49:01 -08:00
pablonyx	ddec239fef	Improved indexing (#3594 ) * nit * k * add steps * main util functions * functioning fully * quick nit * k * typing fix * k * address comments	2025-01-05 23:31:53 +00:00
Chris Weaver	f895e5f7d0	Speedup orphan doc cleanup script (#3596 ) * Speedup orphan doc cleanup script * Fix mypy	2025-01-05 14:28:25 +00:00
skylares	c191e23256	Pagination Hook (#3494 ) * Backend changes for pagination hook + Paginated users table * Frontend changes for pagination hook * Fix invited users endpoint * Fix layout shift & add enter to submit user invites * mypy * Cleanup * Resolve PR concerns & remove UserStatus * Fix errors --------- Co-authored-by: hagen-danswer <hagen@danswer.ai>	2025-01-03 14:32:55 -08:00
Richard Kuo	3eb72e5c1d	Revert "More efficient Vespa indexing (#3552 )" This reverts commit 27832167819f063bc2f4436765ff27f0ccba2b64.	2024-12-31 09:40:23 -08:00
pablonyx	2783216781	More efficient Vespa indexing (#3552 ) --------- Co-authored-by: Chris Weaver <25087905+Weves@users.noreply.github.com>	2024-12-30 18:51:14 -08:00
Yuhong Sun	814f97c2c7	MT Cloud Monitoring (#3465 )	2024-12-15 16:05:03 -08:00
pablodanswer	21ec5ed795	welcome to onyx	2024-12-13 09:56:10 -08:00
hagen-danswer	ef9942b751	Related permission docs to cc_pair to prevent orphan docs (#3336 ) * Related permission docs to cc_pair to prevent orphan docs * added script * group sync deduping * logging	2024-12-04 21:00:54 +00:00
Weves	b01a1b509a	Add basic loadtest script	2024-12-04 10:53:48 -08:00
Yuhong Sun	62a4aa10db	Refactor Search (#3233 )	2024-11-23 13:42:54 -08:00
hagen-danswer	fdc4811fce	doc sync celery refactor (#3084 ) * doc_sync is refactored * maybe this works * tested to work! * mypy fixes * enabled integration tests * fixed the test * added external group sync * testing should work now * mypy * confluence doc id fix * got group sync working * addressed feedback * renamed some vars and fixed mypy * conf fix? * added wiki handling to confluence connector * test fixes * revert google drive connector * fixed groups * hotfix	2024-11-12 23:57:14 +00:00
Yuhong Sun	55919f596c	PG Dev Max Connections (#3082 )	2024-11-07 11:51:23 -08:00
Richard Kuo (Danswer)	0ed77aa8a7	Merge branch 'main' of https://github.com/danswer-ai/danswer into feature/reset_indexes	2024-10-25 12:00:25 -07:00
Richard Kuo (Danswer)	84d551eda4	Merge branch 'patch-1' of https://github.com/Yash-2707/danswer into feature/reset_indexes	2024-10-25 09:35:45 -07:00
Chris Weaver	4a47e9a841	Add strict json mode (#2917 )	2024-10-24 22:38:46 -07:00
Yuhong Sun	b49a9ab171	Seeding (#2902 ) * checkpoint * k * k * k * fixed slack api calls * missed one --------- Co-authored-by: hagen-danswer <hagen@danswer.ai>	2024-10-24 23:45:48 +00:00
rkuo-danswer	2b9a751b96	working chat feedback dump script (with api addition) (#2891 ) * working chat feedback dump script (with api addition) * mypy fix * comment out pydantic models (but leave for reference) * small code review tweaks * bump to clear vercel issue?	2024-10-24 19:50:09 +00:00
pablodanswer	14e75bbd24	add default schema config (#2888 ) * add default schema config * resolve circular import * k	2024-10-23 23:12:17 +00:00
rkuo-danswer	9105f95d13	Feature/celery refactor (#2813 ) * fresh indexing feature branch * cherry pick test * Revert "cherry pick test" This reverts commit 2a624220687affdda3de347e30f2011136f64bda. * set multitenant so that vespa fields match when indexing * cleanup pass * mypy * pass through env var to control celery indexing concurrency * comments on task kickoff and some logging improvements * disentangle configuration for different workers and beats. * use get_session_with_tenant * comment out all of update.py * rename to RedisConnectorIndexingFenceData * first check num_indexing_workers * refactor RedisConnectorIndexingFenceData * comment out on_worker_process_init * missed a file * scope db sessions to short lengths * update launch.json template * fix types * code review	2024-10-22 22:57:36 +00:00
Yuhong Sun	eccec6ab7c	Notion Fix Nested Properties (#2877 )	2024-10-22 14:10:31 -07:00

1 2 3

132 Commits