danswer

mirror of https://github.com/danswer-ai/danswer.git synced 2025-04-04 18:12:23 +02:00

Author	SHA1	Message	Date
Evan Lohn	71304e4228	always persist in agent search	2025-02-03 20:10:51 -08:00
Evan Lohn	118e8afbef	reworked config to have logical structure	2025-02-03 20:10:51 -08:00
Evan Lohn	6c7f8eaefb	first pass at dead code deletion	2025-02-03 20:10:50 -08:00
Evan Lohn	1a22af4f27	AgentPromptConfig in Answer class	2025-02-03 20:10:50 -08:00
Evan Lohn	db2004542e	fixed chat tests	2025-02-03 20:10:50 -08:00
Evan Lohn	2032fb10da	removed print statements, fixed pass through handling	2025-02-03 20:10:50 -08:00
Evan Lohn	ca1f176c61	fixed basic flow citations and second test	2025-02-03 20:10:50 -08:00
Evan Lohn	dd260140b2	basic search restructure: WIP on fixing tests	2025-02-03 20:07:57 -08:00
pablonyx	3c34ddcc4f	E2e assistant tests (#3869 ) * adding llm override logic * update * general cleanup * fix various tests * rm * update * update * better comments * k * k * update to pass tests * clarify content * improve timeout	2025-02-01 20:05:53 +00:00
rkuo-danswer	4fe99d05fd	add timings for syncing (#3798 ) * add timings for syncing * add more logging * more debugging * refactor multipass/db check out of VespaIndex * circular imports? * more debugging * add logs * various improvements * additional logs to narrow down issue * use global httpx pool for the main vespa flows in celery. Use in more places eventually. * cleanup debug logging, etc * remove debug logging * this should use the secondary index * mypy * missed some logging * review fixes * refactor get_default_document_index to use search settings * more missed logging * fix circular refs --------- Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app> Co-authored-by: pablodanswer <pablo@danswer.ai>	2025-01-29 23:24:44 +00:00
devin-ai-integration[bot]	b82123563b	Fix Unicode sanitization for Vespa document indexing (#3831 ) * Add support for filtering 0xFDD0-0xFDEF Unicode range - Update remove_invalid_unicode_chars to handle 0xFDD0-0xFDEF range - Add comprehensive test cases for Unicode character sanitization - Fix issue with illegal code point 0xFDDB in Vespa indexing Co-Authored-By: Chris Weaver <chris@onyx.app> * Remove unused pytest import Co-Authored-By: Chris Weaver <chris@onyx.app> --------- Co-authored-by: Devin AI <158243242+devin-ai-integration[bot]@users.noreply.github.com> Co-authored-by: Chris Weaver <chris@onyx.app>	2025-01-29 18:32:00 +00:00
Chris Weaver	5d6a18f358	Add support for more /models/list formats (#3739 )	2025-01-24 18:25:19 +00:00
Chris Weaver	8a4d762798	Fix follow ups in thread + fix user name (#3686 ) * Fix follow ups in thread + fix user name * Add back single history str * Remove newline	2025-01-16 02:40:25 +00:00
hagen-danswer	e329b63b89	Added Permission Syncing for Salesforce (#3551 ) * Added Permission Syncing for Salesforce * cleanup * updated connector doc conversion * finished salesforce permission syncing * fixed connector to batch Salesforce queries * tests! * k * Added error handling and check for ee and sync type for postprocessing * comments * minor touchups * tested to work! * done * my pie * lil cleanup * minor comment	2025-01-07 00:37:03 +00:00
pablonyx	ddec239fef	Improved indexing (#3594 ) * nit * k * add steps * main util functions * functioning fully * quick nit * k * typing fix * k * address comments	2025-01-05 23:31:53 +00:00
joachim-danswer	8750f14647	alignment & renaming of objects for initial (displayed) ranking and re-ranking/validation citations - renamed post-reranking/validation citation information consistently to final_... (example: doc_id_to_rank_map -> final_doc_id_to_rank_map) - changed and renamed objects containing initial ranking information (now: display_...) consistent with final rankings (final_...). Specifically, {} to [] for displayed_search_results - for CitationInfo, changed citation_num from 'x-th citation in response stream' to the initial position of the doc [NOTE: test implications] - changed tests: onyx/backend/tests/unit/onyx/chat/stream_processing/test_citation_processing.py onyx/backend/tests/unit/onyx/chat/stream_processing/test_citation_substitution.py	2025-01-05 15:44:34 -08:00
hagen-danswer	d1ec72b5e5	Reworked salesforce connector to use bulk api (#3581 )	2025-01-02 18:09:02 -08:00
pablonyx	788b3015bc	fix single quote block in llm answer (#3139 )	2024-12-16 20:37:47 +00:00
pablodanswer	e9b41bddc9	gmail configuration update	2024-12-14 14:53:02 -08:00
pablodanswer	64f0ad8b26	fix drive tests (nit)	2024-12-13 11:36:39 -08:00
pablodanswer	21ec5ed795	welcome to onyx	2024-12-13 09:56:10 -08:00
joachim-danswer	9455576078	Mismatch issue of Documents shown and Citation number in text fix (#3421 ) * Mismatch issue of Documents shown and Citation number in text fix When document order presented to LLM differs from order shown to user, wrong doc numbers are cited. Fix: - SearchTool.get_search_result returns now final and initial ranking - initial ranking is passed through a few objects and used for replacement in citation processing Notes: - the citation_num in the CitationInfo() object has not been changed. * PR fixes - linting - removed erroneous tab - added a substitution test case - adjusted original citation extraction use case * Included a key test and * Fixed extra spaces * Updated test documentation Updated: - test_citation_substitution (changed description) - test_citation_processing (removed data only relevant for the substitution)	2024-12-11 19:58:24 +00:00
Yuhong Sun	6026536110	Model Server Async (#3386 ) * need-verify * fix some lib calls * k * tests * k * k * k * Address the comments * fix comment	2024-12-11 01:33:44 +00:00
Yuhong Sun	ca988f5c5f	Max File Size (#3422 ) * k * k * k	2024-12-11 00:06:47 +00:00
Yuhong Sun	2a55696545	Move Answer (#3339 )	2024-12-04 16:30:47 -08:00
Yuhong Sun	aa1c4c635a	Combining Search and Chat Backend (#3273 ) * k * k * fix slack issues * rebase * k	2024-12-03 22:37:14 +00:00
pablodanswer	7c618c9d17	Unified UI (#3308 ) * fix typing * add filters display	2024-12-02 15:12:13 -08:00
Yuhong Sun	9bd0cb9eb5	Fix Citation Minor Bugs (#3294 )	2024-12-01 13:55:24 -08:00
Yuhong Sun	86d8666481	Add Test Case	2024-11-24 15:42:14 -08:00
Yuhong Sun	8abcde91d4	Fix Test (#3242 )	2024-11-24 14:31:28 -08:00
Yuhong Sun	62a4aa10db	Refactor Search (#3233 )	2024-11-23 13:42:54 -08:00
rkuo-danswer	5eddc89b5a	merge indexing and heartbeat callbacks (and associated lock reacquisi… (#3178 ) * merge indexing and heartbeat callbacks (and associated lock reacquisition). no db updates * review fixes	2024-11-21 23:48:58 +00:00
hagen-danswer	100b4a0d16	Added Slim connector for Jira (#3181 ) * Added Slim connector for Jira * fixed testing * more cleanup of Jira connector * cleanup	2024-11-21 17:00:20 +00:00
pablodanswer	bf291d0c0a	Fix missing json (#3177 ) * initial steps * k * remove logs * k * k	2024-11-20 21:24:43 +00:00
hagen-danswer	2cd1e6be00	gmail refactor + permission syncing (#3021 ) * initial frontend changes and shared google refactoring * gmail connector is reworked * added permission syncing for gmail * tested! * Added tests for gmail connector * fixed tests and mypy * temp fix * testing done! * rename * test fixes maybe? * removed irrelevant tests * anotha one * refactoring changes * refactor finished * maybe these fixes work * dumps * final fixes	2024-11-04 18:06:23 +00:00
pablodanswer	c6e8bf2d28	add multiple formats to tools (#3041 )	2024-11-03 23:54:19 +00:00
Chris Weaver	ecf4923a3a	Fix answer with specified doc ids (#2703 ) * Fix Fix Refactor more more fix refactor Fix circular imports Refactor Move tests around * Add quote support * Testing * More testing * Fix image generation slowness * Remove unused exception * Fix UT * fix stop generating * minor typo * minor logging updates for clarity --------- Co-authored-by: pablodanswer <pablo@danswer.ai>	2024-11-01 19:50:20 +00:00
rkuo-danswer	dc2dfeb5b8	Fix pywikibot droppings (#2924 ) * make pywikibot store its working files in a system provided temp directory * move the config setting around --------- Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2024-11-01 05:59:12 +00:00
Yuhong Sun	1a7d627949	Disable Mediawiki Tests (#3005 )	2024-10-30 17:27:58 -07:00
Yuhong Sun	5062075b8d	Backport Test 7 (#2971 )	2024-10-27 22:55:35 -07:00
hagen-danswer	802086ee57	Refactored Confluence Connector (#2859 ) * Refactored Confluence Connector * rename metadataconnector to slimconnector Finish rename * danswer->onyx * added rec * typo * refactored doc_sync for confluence * mypy + enable tests * tested and fixed for confluence cloud * fixed all server syncing * fixed connector test * mypy+connector test fixes * addressed richards comments * minor fix	2024-10-21 23:03:40 +00:00
Chris Weaver	33974fc12c	Add support for passthrough auth for custom tool calls (#2824 ) * Add support for passthrough auth for custom tool calls * Fix formatting	2024-10-16 22:50:16 +00:00
pablodanswer	db0779dd02	Session id: int -> UUID (#2814 ) * session id: int -> UUID * nit * validated * validated downgrade + upgrade + all functionality * nit * minor nit * fix test case	2024-10-16 22:18:45 +00:00
rkuo-danswer	efe2e79f27	Rate limiting confluence through redis (#2798 ) * try rate limiting through redis * fix circular import issue * fix bad formatting of family string * Revert "fix bad formatting of family string" This reverts commit be688899e5b4dd189dc13d9fec1d0f3ade07ad4f. * redis usage optional * disable test that doesn't match with new design	2024-10-14 23:51:24 +00:00
rkuo-danswer	dee197570d	Bugfix/mediawiki (#2800 ) * fix formatting * fix poorly structured doc id, fix empty page id, fix family_class_dispatch invalid name (no spaces), fix setting id with int pageid * fix mediawiki test	2024-10-14 22:48:06 +00:00
Chris Weaver	26bdb41e8f	Fix parallel tool calls (#2779 ) * Fix parallel tool calls * remove comments	2024-10-13 03:29:18 +00:00
rkuo-danswer	3404c7eb1d	Feature/background prune 2 (#2583 ) * first cut at redis * some new helper functions for the db * ignore kombu tables in alembic migrations (used by celery) * multiline commands for readability, add vespa_metadata_sync queue to worker * typo fix * fix returning tuple fields * add constants * fix _get_access_for_document * docstrings! * fix double function declaration and typing * fix type hinting * add a global redis pool * Add get_document function * use task_logger in various celery tasks * add celeryconfig.py to simplify configuration. Will be used in a subsequent commit * Add celery redis helper. used in a subsequent PR * kombu warning getting spammy since celery is not self managing its queue in Postgres any more * add last_modified and last_synced to documents * fix task naming convention * use celeryconfig.py * the big one. adds queues and tasks, updates functions to use the queues with priorities, etc * change vespa index log line to debug * mypy fixes * update alembic migration * fix fence ordering, rename to "monitor", fix fetch_versioned_implementation call * mypy * switch to monotonic time * fix startup dependencies on redis * rebase alembic migration * kombu cleanup - fail silently * mypy * add redis_host environment override * update REDIS_HOST env var in docker-compose.dev.yml * update the rest of the docker files * in flight * harden indexing-status endpoint against db changes happening in the background. Needs further improvement but OK for now. * allow no task syncs to run because we create certain objects with no entries but initially marked as out of date * add back writing to vespa on indexing * actually working connector deletion * update contributing guide * backporting fixes from background_deletion * renaming cache to cache_volume * add redis password to various deployments * try setting up pr testing for helm * fix indent * hopefully this release version actually exists * fix command line option to --chart-dirs * fetch-depth 0 * edit values.yaml * try setting ct working directory * bypass testing only on change for now * move files and lint them * update helm testing * some issues suggest using --config works * add vespa repo * add postgresql repo * increase timeout * try amd64 runner * fix redis password reference * add comment to helm chart testing workflow * rename helm testing workflow to disable it * adding clarifying comments * address code review * missed a file * remove commented warning ... just not needed * fix imports * refactor to use update_single * mypy fixes * add vespa test * multiple celery workers * update logs as well and set prefetch multipliers appropriate to the worker intent * add db refresh to connector deletion * add some preliminary locking * organize tasks into separate files * celery auto associates tasks created inside another task, which bloats the result metadata considerably. trail=False prevents this. * code review fixes * move monitor_usergroup_taskset to ee, improve logging * add multi workers to dev_run_background_jobs.py * update supervisord with some recommended settings for celery * name celery workers and shorten dev script prefixing * add configurable sql alchemy engine settings on startup (needed for various intents like API server, different celery workers and tasks, etc) * fix comments * autoscale sqlalchemy pool size to celery concurrency (allow override later?) * supervisord needs the percent symbols escaped * use name as primary check, some minor refactoring and type hinting too. * stash merge (may not function yet) * remove dead code * more cleanup * remove dead file * we shouldn't be checking for deletion attempts in the db any more * print cc_pair_id * print status on status mismatch again * add logging when cc_pair isn't present * don't indexing any ingestion type connectors, and don't pause any connectors that aren't active * add more specific check for deletion completion * remove flaky mediawiki test site * move is_pruning * remove unused code * remove old function --------- Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2024-10-07 18:16:17 +00:00
evan-danswer	089c734f63	disabled llm when skip_gen_ai_answer_question set (#2687 ) * disabled llm when skip_gen_ai_answer_question set * added unit test * typing	2024-10-06 18:10:02 +00:00
Chris Weaver	728a41a35a	Add heartbeat to indexing (#2595 )	2024-09-29 19:26:40 -07:00
Chris Weaver	50dd3c8beb	Add size limit to jira tickets (#2586 )	2024-09-28 12:49:13 -07:00

1 2

97 Commits