danswer

mirror of https://github.com/danswer-ai/danswer.git synced 2025-09-05 17:08:36 +02:00

Author	SHA1	Message	Date
rkuo-danswer	392b87fb4f	Bugfix/limit permission size (#4695 ) * add utility function * add utility functions to DocExternalAccess * refactor db access out of individual celery tasks and put it directly into the heavy task * code review and remove leftovers * fix circular imports --------- Co-authored-by: Richard Kuo (Onyx) <rkuo@onyx.app>	2025-05-13 00:46:31 +00:00
Raunak Bhagat	d744c0dab4	fix: Fix error in which channel names would not have the leading "#" removed (#4664 ) * Fix failing entrypoint into slack connector * Pre-filter channel names upon instantiation of slack connector class * Add decrypt script * Add slack connector tests * Fix mypy errors on decrypt.py * Add property to SlackConnector class * Add some basic tests * Move location of tests * Change name of env token * Add secrets for Slack * Add more parameterized cases * Change env variable name * Change names * Update channel names * Edit tests * Modify tests * Only import type in __main__ * Fix tests to actually test connectors * Pass parameter to fixture directly	2025-05-07 04:55:21 +00:00
Raunak Bhagat	79b981075e	perf: Optimize query history exporting process (#4602 ) * Update mode to be a default parameter in `FileStore.read` * Move query history exporting process to be a background job instead * Move hardcoded report-file-naming to a common utility function * Add type annotations * Update download component * Implement button to re-ping and download CSV file; fix up some backend file-checking logic * De-indent logic (w/ early return) * Return different error codes dependings on the type of task status * Add more resistant failure retrying mechanisms * Remove default parameter in helper function * Use popup for error messaging * Update return code * Update web/src/app/ee/admin/performance/query-history/DownloadAsCSV.tsx Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * Add type to useState call * Update backend/ee/onyx/server/query_history/api.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * Update backend/onyx/file_store/file_store.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * Update backend/ee/onyx/background/celery/apps/primary.py Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com> * Move rerender call to after check * Run formatter * Add type conversions back (smh greptile) * Remove duplicated call to save_file * Move non-fallible logic out of try-except block * Pass date-ranges into API call * Convert to ISO strings before passing it into the API call * Add API to list all tasks * Create new pydantic model to represent tasks to return instead * Change helper to only fetch query-history tasks * Use `shared_tasks` instead of old method * Address more comments from PR; consolidate how task name is generated * Mark task as failed if any exception is raised * Change the task object which is returned back to the FE * Add a table to display previously generated query-history-csv's * Add timestamps to task; delete tasks as soon as file finishes processing * Raise exception if start_time is not present * Convert hard-coded string to constant * Add "Generated At" field to table * Return task list in sorted order (based off of start-time) * Implement pagination * Remove unused props and cleanup tailwind classes * Change the name of kickoff button * Redesign how previous query exports are viewed * Make button a constant width even when contents change * Remove timezone information before comparing * Decrease interval time for re-pinging API * Add timezone to start-time creation * Add a refreshInterval for getting updated task status * Add new background queue * Edit small verbiage and remove error popup when max-retries is hit * Change up heavy worker to recognize new task in new module * Ensure `celery_app` is imported * Change how `celery_app` is imported and defined * Update comment on why `celery_app` must be imported * Add basic skeleton for new beat task to cleanup any dead / failed query-history-export tasks * Move cleanup task to different worker / queue * Implement cleanup task * Add return type * Address comment on PR * Remove delimiter from prefix * Change name of function to be more descriptive * Remove delimiter from prefix constant * Move function invocation closer to usage location * Move imports to top of file * Move variable up a scope due to undefined error * Remove dangling if-statement * Make function more pure-functional * Remove redefinition --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>	2025-05-03 00:16:35 +00:00
Evan Lohn	55e4465782	orphan tag cleanup optimization (#4651 ) * move orphan tag cleanup to final cleanup section of associated tparent tasks * naming	2025-05-02 17:22:59 +00:00
rkuo-danswer	ea1d3c1eda	Feature/db script (#4574 ) * debug script + slight refactor of db class * better comments * move setup logger --------- Co-authored-by: Richard Kuo (Onyx) <rkuo@onyx.app> Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2025-04-23 20:00:35 +00:00
rkuo-danswer	2111eccf07	Feature/vespa jinja (#4558 ) * tool to generate vespa schema variations for our cloud * extraneous assign * use a real templating system instead of search/replace * fix float * maybe this should be double * remove redundant var * template the other files * try a spawned process * move the wrapper * fix args * increase timeout --------- Co-authored-by: Richard Kuo (Onyx) <rkuo@onyx.app> Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2025-04-20 22:28:55 +00:00
rkuo-danswer	e5e0944049	tool to generate vespa schema variations for our cloud (#4556 ) * tool to generate vespa schema variations for our cloud * extraneous assign * float, not double * back to double --------- Co-authored-by: Richard Kuo (Onyx) <rkuo@onyx.app>	2025-04-18 20:47:17 +00:00
joachim-danswer	2683207a24	Expanded basic search (#4517 ) * initial working version * ranking profile * modification for keyword/instruction retrieval * mypy fixes * EL comments * added env var (True for now) * flipped default to False * mypy & final EL/CW comments + import issue	2025-04-13 23:13:01 -07:00
pablonyx	65fd8b90a8	add image indexing tests (#4477 ) * address file path * k * update * update * nit- fix typing * k * should path * in a good state * k * k * clean up file * update * update * k * k * k	2025-04-11 22:16:37 +00:00
rkuo-danswer	24184024bb	Bugfix/dependency updates (#4482 ) * bump fastapi and starlette * bumping llama index and nltk and associated deps * bump to fix python-multipart * bump aiohttp * update package lock for examples/widget * bump black * sentencesplitter has changed namespaces * fix reorder import check, fix missing passlib * update package-lock.json * black formatter updated * reformatted again * change to black compatible reorder * change to black compatible reorder-python-imports fork * fix pytest dependency * black format again * we don't need cdk.txt. update packages to be consistent across all packages --------- Co-authored-by: Richard Kuo (Onyx) <rkuo@onyx.app> Co-authored-by: Richard Kuo <rkuo@rkuo.com>	2025-04-10 08:23:02 +00:00
evan-danswer	17562f9b8f	Id not set in checkpoint2 (#4468 ) * unconditionally set completion * drive connector improvements * fixing broader typing issue * fix tests, CW comments * actual test fix	2025-04-07 17:00:42 -07:00
Richard Kuo (Onyx)	d7063e0a1d	expose acl link feature in onyx_vespa	2025-04-01 16:19:50 -07:00
pablonyx	3a3b2a2f8d	add user files (#4152 )	2025-04-01 16:19:44 -07:00
evan-danswer	56f8ab927b	Contextual Retrieval (#4029 ) * contextual rag implementation * WIP * indexing test fix * workaround for chunking errors, WIP on fixing massive memory cost * mypy and test fixes * reformatting * fixed rebase	2025-03-30 18:49:09 +00:00
rkuo-danswer	aab777f844	Bugfix/acl prefix (#4377 ) * fix acl prefixing * increase timeout a tad * block access to init'ing DocumentAccess directly, fix test to work with ee/MIT * fix env var checks --------- Co-authored-by: Richard Kuo (Onyx) <rkuo@onyx.app>	2025-03-28 05:52:35 +00:00
Weves	61366df34c	Add execute permission	2025-03-18 12:03:32 -07:00
Chris Weaver	1a444245f6	Memory tracking script (#4297 ) * Add simple container-level memory tracking script	2025-03-18 12:00:09 -07:00
joachim-danswer	463340b8a1	Reduce ranking scores for short chunks without actual information (#4098 ) * remove title for slack * initial working code * simplification * improvements * name change to information_content_model * avoid boost_score > 1.0 * nit * EL comments and improvements Improvements: - proper import of information content model from cache or HF - warm up for information content model Other: - EL PR review comments * nit * requirements version update * fixed docker file * new home for model_server configs * default off * small updates * YS comments - pt 1 * renaming to chunk_boost & chunk table def * saving and deleting chunk stats in new table * saving and updating chunk stats * improved dict score update * create columns for individual boost factors * RK comments * Update migration * manual import reordering	2025-03-13 17:35:45 +00:00
pablonyx	ecbd4eb1ad	add basic user invite flow (#4253 )	2025-03-11 19:02:51 +00:00
rkuo-danswer	a7acc07e79	fix usage report pagination (#4183 ) * early work in progress * rename utility script * move actual data seeding to a shareable function * add test * make the test pass with the fix * fix comment --------- Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app>	2025-03-05 19:13:51 +00:00
pablonyx	20f2b9b2bb	Add image support for search (#4090 ) * add support for image search * quick fix up * k * k * k * k * nit * quick fix for connector tests	2025-03-05 17:44:18 +00:00
Chris Weaver	f25e1e80f6	Add option to not re-index (#4157 ) * Add option to not re-index * Add quantizaton / dimensionality override support * Fix build / ut	2025-03-03 10:54:11 -08:00
pablonyx	a98dcbc7de	Update tenant logic (#4122 ) * k * k * k * quick nit * nit	2025-02-26 03:53:46 +00:00
pablonyx	7d40676398	Heavy task improvements, logging, and validation (#4058 )	2025-02-24 13:48:53 -08:00
rkuo-danswer	61d536c782	tool fixes (#4075 )	2025-02-21 12:30:33 -08:00
pablonyx	47fd4fa233	Strict Tenant ID Enforcement (#3871 ) * strict tenant id enforcement * k * k * nit * merge * nit * k	2025-02-19 00:52:56 +00:00
Chris Weaver	f1fc8ac19b	Connector checkpointing (#3876 ) * wip checkpointing/continue on failure more stuff for checkpointing Basic implementation FE stuff More checkpointing/failure handling rebase rebase initial scaffolding for IT IT to test checkpointing Cleanup cleanup Fix it Rebase Add todo Fix actions IT Test more Pagination + fixes + cleanup Fix IT networking fix it * rebase * Address misc comments * Address comments * Remove unused router * rebase * Fix mypy * Fixes * fix it * Fix tests * Add drop index * Add retries * reset lock timeout * Try hard drop of schema * Add timeout/retries to downgrade * rebase * test * test * test * Close all connections * test closing idle only * Fix it * fix * try using null pool * Test * fix * rebase * log * Fix * apply null pool * Fix other test * Fix quality checks * Test not using the fixture * Fix ordering * fix test * Change pooling behavior	2025-02-16 02:34:39 +00:00
pablonyx	b70db15622	Bugfix Vespa Deletion Script (#3998 )	2025-02-13 17:26:04 -08:00
pablonyx	c6434db7eb	Add delete all for tenants in Vespa (#3970 )	2025-02-13 14:33:49 -08:00
pablodanswer	a202e2bf9d	Improvements to Redis + Vespa debugging	2025-02-06 13:30:06 -08:00
pablonyx	4affc259a6	Password reset tenant (#3895 ) * nots * functional * minor naming cleanup * nit * update constant * k	2025-02-05 03:17:11 +00:00
pablodanswer	125e5eaab1	various mypy improvements	2025-02-04 12:06:10 -08:00
rkuo-danswer	4fe99d05fd	add timings for syncing (#3798 ) * add timings for syncing * add more logging * more debugging * refactor multipass/db check out of VespaIndex * circular imports? * more debugging * add logs * various improvements * additional logs to narrow down issue * use global httpx pool for the main vespa flows in celery. Use in more places eventually. * cleanup debug logging, etc * remove debug logging * this should use the secondary index * mypy * missed some logging * review fixes * refactor get_default_document_index to use search settings * more missed logging * fix circular refs --------- Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app> Co-authored-by: pablodanswer <pablo@danswer.ai>	2025-01-29 23:24:44 +00:00
pablonyx	7f10494bbe	Better vespa interface (#3781 ) * k * much cleaner vespa util class * log * typing * improvement * improve	2025-01-26 21:22:44 +00:00
Yuhong Sun	cbf98c0128	Fix Seeding Link for Support Use Case (#3784 )	2025-01-25 19:39:36 -08:00
pablonyx	2a1bb4ac41	Vespa scripts + Redis script update (#3758 ) * update onyx redis script * looking good * simplify comments * remove unnecessary apps option * iterate * fix typing	2025-01-23 23:46:17 +00:00
pablonyx	ccb16b7484	Indexing latency check fix (#3747 ) * add logs + update dev script * update conig * remove prints * temporarily turn off * va * update * fix * finalize monitoring updates * update	2025-01-23 17:14:26 +00:00
skylares	af953ff8a3	Paginate Query History table (#3592 ) * Add pagination for query history table * Fix method name * Fix mypy	2025-01-17 15:31:42 -08:00
hagen-danswer	b1957737f2	refactored _add_user_filter usage (#3674 ) * refactored db.connector_credential_pair * Rerfactored the db.credentials user filtering * the restr	2025-01-14 23:35:52 +00:00
rkuo-danswer	2ae91f0f2b	Feature/redis prod tool (#3619 ) * prototype tools for handling prod issues * add some commands * add batching and dry run options * custom redis tool * comment * default to app config settings for redis --------- Co-authored-by: Richard Kuo (Danswer) <rkuo@onyx.app>	2025-01-09 21:34:07 +00:00
pablonyx	d7bc32c0ec	Fully remove visit API (#3621 ) * v1 * update indexing logic * update updates * nit * clean up args * update for clarity + best practices * nit + logs * fix * minor clean up * remove logs * quick nit	2025-01-08 13:49:01 -08:00
pablonyx	ddec239fef	Improved indexing (#3594 ) * nit * k * add steps * main util functions * functioning fully * quick nit * k * typing fix * k * address comments	2025-01-05 23:31:53 +00:00
Chris Weaver	f895e5f7d0	Speedup orphan doc cleanup script (#3596 ) * Speedup orphan doc cleanup script * Fix mypy	2025-01-05 14:28:25 +00:00
skylares	c191e23256	Pagination Hook (#3494 ) * Backend changes for pagination hook + Paginated users table * Frontend changes for pagination hook * Fix invited users endpoint * Fix layout shift & add enter to submit user invites * mypy * Cleanup * Resolve PR concerns & remove UserStatus * Fix errors --------- Co-authored-by: hagen-danswer <hagen@danswer.ai>	2025-01-03 14:32:55 -08:00
Richard Kuo	3eb72e5c1d	Revert "More efficient Vespa indexing (#3552 )" This reverts commit `2783216781`.	2024-12-31 09:40:23 -08:00
pablonyx	2783216781	More efficient Vespa indexing (#3552 ) --------- Co-authored-by: Chris Weaver <25087905+Weves@users.noreply.github.com>	2024-12-30 18:51:14 -08:00
Yuhong Sun	814f97c2c7	MT Cloud Monitoring (#3465 )	2024-12-15 16:05:03 -08:00
pablodanswer	21ec5ed795	welcome to onyx	2024-12-13 09:56:10 -08:00
hagen-danswer	ef9942b751	Related permission docs to cc_pair to prevent orphan docs (#3336 ) * Related permission docs to cc_pair to prevent orphan docs * added script * group sync deduping * logging	2024-12-04 21:00:54 +00:00
Weves	b01a1b509a	Add basic loadtest script	2024-12-04 10:53:48 -08:00

1 2 3

143 Commits