danswer

mirror of https://github.com/danswer-ai/danswer.git synced 2025-03-18 22:01:55 +01:00

Author	SHA1	Message	Date
pablodanswer	b74f5ae46d	nit	2024-10-26 14:45:47 -07:00
pablodanswer	d0c4c8b172	add tenantJWTSrategy	2024-10-26 14:44:57 -07:00
pablodanswer	9b147ae437	Tenant integration tests (#2913 ) * check for index swap * initial bones * kk * k * k: * nit * nit * rebase + update * nit * minior update * k * minor integration test fixes * nit * ensure we build test docker image * remove one space * k * ensure we wipe volumes * remove log * typo * nit * k * k	2024-10-25 18:47:17 +00:00
Chris Weaver	4a47e9a841	Add strict json mode (#2917 )	2024-10-24 22:38:46 -07:00
Yuhong Sun	b49a9ab171	Seeding (#2902 ) * checkpoint * k * k * k * fixed slack api calls * missed one --------- Co-authored-by: hagen-danswer <hagen@danswer.ai>	2024-10-24 23:45:48 +00:00
rkuo-danswer	2b9a751b96	working chat feedback dump script (with api addition) (#2891 ) * working chat feedback dump script (with api addition) * mypy fix * comment out pydantic models (but leave for reference) * small code review tweaks * bump to clear vercel issue?	2024-10-24 19:50:09 +00:00
pablodanswer	0545fb4443	Multitenant redis update (#2889 ) * add multi tenancy to redis * rename context var * k * args -> kwargs * minor update to kv interface * robustify	2024-10-24 02:12:25 +00:00
pablodanswer	14e75bbd24	add default schema config (#2888 ) * add default schema config * resolve circular import * k	2024-10-23 23:12:17 +00:00
pablodanswer	8b72264535	Gating Notifications (#2868 ) * functional notifications * typing * minor * ports * nit * verify functionality * pretty	2024-10-23 20:20:20 +00:00
rkuo-danswer	9105f95d13	Feature/celery refactor (#2813 ) * fresh indexing feature branch * cherry pick test * Revert "cherry pick test" This reverts commit 2a624220687affdda3de347e30f2011136f64bda. * set multitenant so that vespa fields match when indexing * cleanup pass * mypy * pass through env var to control celery indexing concurrency * comments on task kickoff and some logging improvements * disentangle configuration for different workers and beats. * use get_session_with_tenant * comment out all of update.py * rename to RedisConnectorIndexingFenceData * first check num_indexing_workers * refactor RedisConnectorIndexingFenceData * comment out on_worker_process_init * missed a file * scope db sessions to short lengths * update launch.json template * fix types * code review	2024-10-22 22:57:36 +00:00
hagen-danswer	914da2e4cb	Confluence polish (#2874 )	2024-10-22 20:41:47 +00:00
hagen-danswer	802086ee57	Refactored Confluence Connector (#2859 ) * Refactored Confluence Connector * rename metadataconnector to slimconnector Finish rename * danswer->onyx * added rec * typo * refactored doc_sync for confluence * mypy + enable tests * tested and fixed for confluence cloud * fixed all server syncing * fixed connector test * mypy+connector test fixes * addressed richards comments * minor fix	2024-10-21 23:03:40 +00:00
pablodanswer	a24b465663	Minor tenant ID improvements (#2850 ) * add migration dockerfile * address edge case * k * k * k * nit * k * k * k * k * remove * k * add comment	2024-10-20 23:48:00 +00:00
rkuo-danswer	457e7992a4	missing tenant_id as optional param (#2851 ) Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2024-10-19 21:10:42 +00:00
rkuo-danswer	6913efef90	fresh indexing feature branch (#2790 ) * fresh indexing feature branch * cherry pick test * Revert "cherry pick test" This reverts commit 2a624220687affdda3de347e30f2011136f64bda. * set multitenant so that vespa fields match when indexing * cleanup pass * mypy * pass through env var to control celery indexing concurrency * comments on task kickoff and some logging improvements * use get_session_with_tenant * comment out all of update.py * rename to RedisConnectorIndexingFenceData * first check num_indexing_workers * refactor RedisConnectorIndexingFenceData * comment out on_worker_process_init * fix where num_indexing_workers falls back * remove extra brace	2024-10-18 22:40:05 +00:00
rkuo-danswer	5b78299880	use native rate limiting in the confluence client (#2837 ) * use native rate limiting in the confluence client * upgrade urllib3 to v2.2.3 to support retries in confluence client * improve logging so that progress is visible.	2024-10-18 18:15:43 +00:00
pablodanswer	a159779d39	prevent alembic from configuring logger (#2826 ) * k * k	2024-10-17 16:31:17 +00:00
hagen-danswer	938a65628d	rearrange logging	2024-10-17 09:01:51 -07:00
hagen-danswer	5d390b65eb	Added logging for when a member has no email or username	2024-10-17 08:47:46 -07:00
pablodanswer	db0779dd02	Session id: int -> UUID (#2814 ) * session id: int -> UUID * nit * validated * validated downgrade + upgrade + all functionality * nit * minor nit * fix test case	2024-10-16 22:18:45 +00:00
pablodanswer	1a9921f63e	Redirect with query param (#2811 ) * validated * k * k * k * minor update	2024-10-16 17:26:44 +00:00
pablodanswer	bfe963988e	various multi tenant improvements (#2803 ) * various multi tenant improvements * nit * ensure consistent db session operations * minor robustification	2024-10-15 20:10:57 +00:00
hagen-danswer	494fda906d	Confluence permission sync fix for server deployment (#2784 ) * initial commit * Made perm sync with with cql * filter fix * undo connector changes * fixed everything * whoops	2024-10-14 20:52:57 +00:00
pablodanswer	20df20ae51	Multi tenant vespa (#2762 ) * add vespa multi tenancy * k * formatting * Billing (#2667) * k * data -> control * nit * nit: error handling * auth + app * nit: color standardization * nit * nit: typing * k * k * feat: functional upgrading * feat: add block for downgrading to seats < active users * add auth * remove accomplished todo + prints * nit * tiny nit * nit: centralize security * add tenant expulsion/gating + invite user -> increment billing seat no. * add cloud configs * k * k * nit: update * k * k * k * k * nit	2024-10-12 23:53:11 +00:00
hagen-danswer	101b010c5c	Improved logging and added comments (#2763 ) * Improved logging and added comments * fix exception logging * cleanup	2024-10-10 17:37:27 +00:00
pablodanswer	f40c5ca9bd	Add tenant context (#2596 ) * add proper tenant context to background tasks * update for new session logic * remove unnecessary functions * add additional tenant context * update ports * proper format / directory structure * update ports * ensure tenant context properly passed to ee bg tasks * add user provisioning * nit * validated for multi tenant * auth * nit * nit * nit * nit * validate pruning * evaluate integration tests * at long last, validated celery beat * nit: minor edge case patched * minor * validate update * nit	2024-10-10 16:34:32 +00:00
hagen-danswer	804de3248e	google drive permission sync cleanup (#2749 )	2024-10-09 21:17:22 +00:00
rkuo-danswer	3404c7eb1d	Feature/background prune 2 (#2583 ) * first cut at redis * some new helper functions for the db * ignore kombu tables in alembic migrations (used by celery) * multiline commands for readability, add vespa_metadata_sync queue to worker * typo fix * fix returning tuple fields * add constants * fix _get_access_for_document * docstrings! * fix double function declaration and typing * fix type hinting * add a global redis pool * Add get_document function * use task_logger in various celery tasks * add celeryconfig.py to simplify configuration. Will be used in a subsequent commit * Add celery redis helper. used in a subsequent PR * kombu warning getting spammy since celery is not self managing its queue in Postgres any more * add last_modified and last_synced to documents * fix task naming convention * use celeryconfig.py * the big one. adds queues and tasks, updates functions to use the queues with priorities, etc * change vespa index log line to debug * mypy fixes * update alembic migration * fix fence ordering, rename to "monitor", fix fetch_versioned_implementation call * mypy * switch to monotonic time * fix startup dependencies on redis * rebase alembic migration * kombu cleanup - fail silently * mypy * add redis_host environment override * update REDIS_HOST env var in docker-compose.dev.yml * update the rest of the docker files * in flight * harden indexing-status endpoint against db changes happening in the background. Needs further improvement but OK for now. * allow no task syncs to run because we create certain objects with no entries but initially marked as out of date * add back writing to vespa on indexing * actually working connector deletion * update contributing guide * backporting fixes from background_deletion * renaming cache to cache_volume * add redis password to various deployments * try setting up pr testing for helm * fix indent * hopefully this release version actually exists * fix command line option to --chart-dirs * fetch-depth 0 * edit values.yaml * try setting ct working directory * bypass testing only on change for now * move files and lint them * update helm testing * some issues suggest using --config works * add vespa repo * add postgresql repo * increase timeout * try amd64 runner * fix redis password reference * add comment to helm chart testing workflow * rename helm testing workflow to disable it * adding clarifying comments * address code review * missed a file * remove commented warning ... just not needed * fix imports * refactor to use update_single * mypy fixes * add vespa test * multiple celery workers * update logs as well and set prefetch multipliers appropriate to the worker intent * add db refresh to connector deletion * add some preliminary locking * organize tasks into separate files * celery auto associates tasks created inside another task, which bloats the result metadata considerably. trail=False prevents this. * code review fixes * move monitor_usergroup_taskset to ee, improve logging * add multi workers to dev_run_background_jobs.py * update supervisord with some recommended settings for celery * name celery workers and shorten dev script prefixing * add configurable sql alchemy engine settings on startup (needed for various intents like API server, different celery workers and tasks, etc) * fix comments * autoscale sqlalchemy pool size to celery concurrency (allow override later?) * supervisord needs the percent symbols escaped * use name as primary check, some minor refactoring and type hinting too. * stash merge (may not function yet) * remove dead code * more cleanup * remove dead file * we shouldn't be checking for deletion attempts in the db any more * print cc_pair_id * print status on status mismatch again * add logging when cc_pair isn't present * don't indexing any ingestion type connectors, and don't pause any connectors that aren't active * add more specific check for deletion completion * remove flaky mediawiki test site * move is_pruning * remove unused code * remove old function --------- Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2024-10-07 18:16:17 +00:00
pablodanswer	0da736bed9	Tenant provisioning in the dataplane (#2694 ) * add tenant provisioning to data plane * minor typing update * ensure tenant router included * proper auth check * update disabling logic * validated basic provisioning * use new kv store	2024-10-06 04:08:35 +00:00
hagen-danswer	c2088602e1	Implement source testing framework + Slack (#2650 ) * Added permission sync tests for Slack * moved folders * prune test + mypy * added wait for indexing to cc_pair creation * commented out check * should fix other tests * added slack channel pool * fixed everything and mypy * reduced flake	2024-10-02 23:16:07 +00:00
Chris Weaver	b3c367d09c	[tiny] adjust user group sync log (#2664 )	2024-10-02 18:01:40 +00:00
Yuhong Sun	fffb9c155a	Redis Cache for KV Store (#2603 ) * k * k * k * k	2024-10-01 18:31:18 +00:00
hagen-danswer	b0056907fb	Added permissions syncing for slack (#2602 ) * Added permissions syncing for slack * add no email case handling * mypy fixes * frontend * minor cleanup * param tweak	2024-09-30 15:14:43 +00:00
hagen-danswer	1cff2b82fd	Global Curator Fix + Testing (#2591 ) * Global Curator Fix * test fix	2024-09-28 20:14:39 +00:00
rkuo-danswer	fbf51b70d0	Feature/celery multi (#2470 ) * first cut at redis * some new helper functions for the db * ignore kombu tables in alembic migrations (used by celery) * multiline commands for readability, add vespa_metadata_sync queue to worker * typo fix * fix returning tuple fields * add constants * fix _get_access_for_document * docstrings! * fix double function declaration and typing * fix type hinting * add a global redis pool * Add get_document function * use task_logger in various celery tasks * add celeryconfig.py to simplify configuration. Will be used in a subsequent commit * Add celery redis helper. used in a subsequent PR * kombu warning getting spammy since celery is not self managing its queue in Postgres any more * add last_modified and last_synced to documents * fix task naming convention * use celeryconfig.py * the big one. adds queues and tasks, updates functions to use the queues with priorities, etc * change vespa index log line to debug * mypy fixes * update alembic migration * fix fence ordering, rename to "monitor", fix fetch_versioned_implementation call * mypy * switch to monotonic time * fix startup dependencies on redis * rebase alembic migration * kombu cleanup - fail silently * mypy * add redis_host environment override * update REDIS_HOST env var in docker-compose.dev.yml * update the rest of the docker files * in flight * harden indexing-status endpoint against db changes happening in the background. Needs further improvement but OK for now. * allow no task syncs to run because we create certain objects with no entries but initially marked as out of date * add back writing to vespa on indexing * actually working connector deletion * update contributing guide * backporting fixes from background_deletion * renaming cache to cache_volume * add redis password to various deployments * try setting up pr testing for helm * fix indent * hopefully this release version actually exists * fix command line option to --chart-dirs * fetch-depth 0 * edit values.yaml * try setting ct working directory * bypass testing only on change for now * move files and lint them * update helm testing * some issues suggest using --config works * add vespa repo * add postgresql repo * increase timeout * try amd64 runner * fix redis password reference * add comment to helm chart testing workflow * rename helm testing workflow to disable it * adding clarifying comments * address code review * missed a file * remove commented warning ... just not needed * fix imports * refactor to use update_single * mypy fixes * add vespa test * multiple celery workers * update logs as well and set prefetch multipliers appropriate to the worker intent * add db refresh to connector deletion * add some preliminary locking * organize tasks into separate files * celery auto associates tasks created inside another task, which bloats the result metadata considerably. trail=False prevents this. * code review fixes * move monitor_usergroup_taskset to ee, improve logging * add multi workers to dev_run_background_jobs.py * update supervisord with some recommended settings for celery * name celery workers and shorten dev script prefixing * add configurable sql alchemy engine settings on startup (needed for various intents like API server, different celery workers and tasks, etc) * fix comments * autoscale sqlalchemy pool size to celery concurrency (allow override later?) * supervisord needs the percent symbols escaped * use name as primary check, some minor refactoring and type hinting too. * addressing code review * fix import * fix prune_documents_task references --------- Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2024-09-27 00:50:55 +00:00
hagen-danswer	b97cc01bb2	Added confluence permission syncing (#2537 ) * Added confluence permission syncing * seperated out group and doc syncing * minorbugfix and mypy * added frontend and fixed bug * Minor refactor * dealth with confluence rate limits! * mypy fixes!!! * addressed yuhong feedback * primary key fix	2024-09-26 22:10:41 +00:00
hagen-danswer	b73d66c84a	Cleaned up foreign key cleanup for user group deletion (#2559 ) * cleaned up fk cleanup for user group deletion * added test for user group deletion	2024-09-26 03:38:01 +00:00
hagen-danswer	8cfe80c53a	Added doc_set__user_group cleanup for user_group deletion (#2551 )	2024-09-24 16:09:52 +00:00
ThomaciousD	487250320b	fix saml email login upsert issue	2024-09-24 07:42:08 -07:00
Chris Weaver	34c2aa0860	Support svg navigation items (#2542 ) * Support SVG nav items * Handle specifying custom SVGs for navbar * Add comment * More comment * More comment	2024-09-23 13:22:20 -07:00
pablodanswer	18c62a0c24	Add additional custom tooling configuration (#2426 ) * add custom headers * add tool seeding * squash * tmep * validated * rm * update typing * update alembic * update import name * reformat * alembic	2024-09-20 23:12:52 +00:00
hagen-danswer	16d1c19d9f	Added bool to disable chat_session_id check for search_docs for api	2024-09-19 17:36:46 -07:00
pablodanswer	8a8e2b310e	Assistants panel rework (#2509 ) * update user model * squash - update assistant gallery * rework assistant display logic + ux * update tool + assistant display * update a couple function names * update typing + some logic * remove unnecessary comments * finalize functionality * updated logic * fully functional * remove logs + ports * small update to logic * update typing * allow seeding of display priority * reorder migrations * update for alembic	2024-09-19 23:36:15 +00:00
hagen-danswer	2274cab554	Added permission syncing (#2340 ) * Added permission syncing on the backend * Rewored to work with celery alembic fix fixed test * frontend changes * got groups working * added comments and fixed public docs * fixed merge issues * frontend complete! * frontend cleanup and mypy fixes * refactored connector access_type selection * mypy fixes * minor refactor and frontend improvements * get to fetch * renames and comments * minor change to var names * got curator stuff working * addressed pablo's comments * refactored user_external_group to reference users table * implemented polling * small refactor * fixed a whoopsies on the frontend * added scripts to seed dummy docs and test query times * fixed frontend build issue * alembic fix * handled is_public overlap * yuhong feedback * added more checks for sync * black * mypy * fixed circular import * todos * alembic fix * alembic	2024-09-19 22:07:36 +00:00
rkuo-danswer	f531d071af	Feature/background deletion (#2337 ) * first cut at redis * some new helper functions for the db * ignore kombu tables in alembic migrations (used by celery) * multiline commands for readability, add vespa_metadata_sync queue to worker * typo fix * fix returning tuple fields * add constants * fix _get_access_for_document * docstrings! * fix double function declaration and typing * fix type hinting * add a global redis pool * Add get_document function * use task_logger in various celery tasks * add celeryconfig.py to simplify configuration. Will be used in a subsequent commit * Add celery redis helper. used in a subsequent PR * kombu warning getting spammy since celery is not self managing its queue in Postgres any more * add last_modified and last_synced to documents * fix task naming convention * use celeryconfig.py * the big one. adds queues and tasks, updates functions to use the queues with priorities, etc * change vespa index log line to debug * mypy fixes * update alembic migration * fix fence ordering, rename to "monitor", fix fetch_versioned_implementation call * mypy * switch to monotonic time * fix startup dependencies on redis * rebase alembic migration * kombu cleanup - fail silently * mypy * add redis_host environment override * update REDIS_HOST env var in docker-compose.dev.yml * update the rest of the docker files * in flight * harden indexing-status endpoint against db changes happening in the background. Needs further improvement but OK for now. * allow no task syncs to run because we create certain objects with no entries but initially marked as out of date * add back writing to vespa on indexing * actually working connector deletion * update contributing guide * backporting fixes from background_deletion * renaming cache to cache_volume * add redis password to various deployments * try setting up pr testing for helm * fix indent * hopefully this release version actually exists * fix command line option to --chart-dirs * fetch-depth 0 * edit values.yaml * try setting ct working directory * bypass testing only on change for now * move files and lint them * update helm testing * some issues suggest using --config works * add vespa repo * add postgresql repo * increase timeout * try amd64 runner * fix redis password reference * add comment to helm chart testing workflow * rename helm testing workflow to disable it * adding clarifying comments * address code review * missed a file * remove commented warning ... just not needed * fix imports * refactor to use update_single * mypy fixes * add vespa test * add db refresh to connector deletion * code review fixes * move monitor_usergroup_taskset to ee, improve logging --------- Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2024-09-18 16:50:11 +00:00
Chris Weaver	4218814385	Add flow to query history CSV (#2492 )	2024-09-18 14:23:56 +00:00
Chris Weaver	5f25b243c5	Add back llm_chunks_indices (#2491 )	2024-09-18 01:21:31 +00:00
Chris Weaver	7ba829a585	Add top_documents to APIs (#2469 ) * Add top_documents * Fix test --------- Co-authored-by: hagen-danswer <hagen@danswer.ai>	2024-09-16 23:48:33 +00:00
trial-danswer	8b2ecb4eab	EE movement followup for Standard Answers (#2467 ) * Move StandardAnswer to EE section of danswer/db/models * Move StandardAnswer DB layer to EE * Add EERequiredError for distinct error handling here * Handle EE fallback for slack bot config * Migrate all standard answer models to ee * Flagging categories for removal * Add missing versioned impl for update_slack_bot_config --------- Co-authored-by: danswer-trial <danswer-trial@danswer-trials-MacBook-Pro.local>	2024-09-16 22:05:53 +00:00
pablodanswer	2dd3870504	Add ability to specify persona in API request (#2302 ) * persona * all prepared excluding configuration * more sensical model structure * update tstream * type updates * rm * quick and simple updates * minor updates * te * ensure typing + naming * remove old todo + rebase update * remove unnecessary check	2024-09-16 21:31:01 +00:00

1 2 3 4

176 Commits