335 Commits

Author SHA1 Message Date
Weves
ab905e9fe6 Include channel name in slack bot logs 2023-08-19 22:23:57 -07:00
Chris Weaver
067503bc84
Background logs to stdout (#315) 2023-08-18 19:02:32 -07:00
Chris Weaver
f541a3ee85
Continue on some connector failures (#314) 2023-08-18 17:59:33 -07:00
Weves
70d7ca5c73 Better error for missing allowed_users / allowed_groups 2023-08-18 10:21:27 -07:00
Chris Weaver
bf4b63de19
Linear connector (#312) 2023-08-17 15:17:57 -07:00
Chris Weaver
f37ac76d3c
Stop using untyped dicts to represent quotes (#310) 2023-08-17 14:53:55 -07:00
Sid Ravinutala
81a4934bb8
Google drive shared files fix + shortcuts (#300)
Also fixes foreign key constraint issue when manually wiping postgres + keeps track of accessed folders
2023-08-17 08:54:00 -07:00
Weves
0aa04ad616 Add chunk level logging when indexing 2023-08-15 18:43:46 -07:00
Weves
820f8b7b48 Add document-level logging for each batch of indexed documents 2023-08-15 18:09:39 -07:00
Weves
8fc74a4313 Fix slack pagination 2023-08-15 17:58:36 -07:00
Weves
a6e08b42e2 Improve slack connector logging 2023-08-15 16:19:03 -07:00
Yuhong Sun
620280db92
Fix formatting according to precommit hooks (#303) 2023-08-15 01:32:09 -07:00
Yuhong Sun
b73d19f35f
Fix Azure OpenAI Docker Deployment (#302) 2023-08-15 01:06:23 -07:00
Michał Flak
286445f9ba
Zulip connector (#247)
Co-authored-by: Yuhong Sun <yuhongsun96@gmail.com>
2023-08-14 15:29:34 -07:00
Yuhong Sun
848e5653a9
More permissive quote matching (#295) 2023-08-14 15:03:21 -07:00
Yuhong Sun
59db40cf36
Add Azure OpenAI parameters to background job for Slackbot (#294) 2023-08-14 14:37:02 -07:00
Matthew Holland
204d89a148
Azure OpenAI integration (#293) 2023-08-14 14:30:44 -07:00
Yuhong Sun
bb58dce1c5
Default Empty Timeout Value breaks Docker Compose (#292) 2023-08-14 10:22:31 -07:00
Weves
e0cbd087f7 Fix count of docs for connector failure 2023-08-13 17:31:55 -07:00
Yuhong Sun
be318433e3
Reset the Default GenAI model choice to OpenAI (#288) 2023-08-13 16:23:55 -07:00
jabdoa2
a73ea23e2c
add simple local llm (#202)
A very simple local llm. Not as good as OpenAI but works as a drop-in replacement for on premise deployments.

---------

Co-authored-by: Yuhong Sun <yuhongsun96@gmail.com>
2023-08-13 15:54:59 -07:00
James Choncholas
758015baa5
Allow setting QA_TIMEOUT from env var (#258)
Co-authored-by: Yuhong Sun <yuhongsun96@gmail.com>
2023-08-13 10:37:32 -07:00
Yuhong Sun
b1bd0b42e5
Add Blog Link (#286) 2023-08-12 18:29:45 -07:00
Weves
ecb26ddaf7 Add polling range for updates 2023-08-12 18:04:48 -07:00
Yuhong Sun
bcca8daab1
Fix misleading comment about HuggingFace (#284) 2023-08-12 17:40:03 -07:00
Weves
156ccc15a8 Fix fetching of latest index attempt 2023-08-12 17:18:09 -07:00
Chris Weaver
d5bb10b61f
Improve indexing status display (#278)
Adds:
- actual error message in UI for indexing failure
- if a connector is disabled, stops indexing immediately (after the current batch of documents) to allow for deletion
- adds num docs indexed for the current run + a speed
2023-08-12 14:49:04 -07:00
Yuhong Sun
bca63e5a76
Do not stream Quote when using freeform prompt (#277) 2023-08-12 14:26:44 -07:00
Weves
54ee323e59 Fix duplicate documents with Slack connector 2023-08-10 10:54:12 -07:00
Yuhong Sun
a03818e6f6
Fix Google Colab Demo (#275) 2023-08-09 01:35:48 -07:00
Chris Weaver
89f71ac335
Support deletion of documents when a connector is deleted (#271) 2023-08-09 00:53:42 -07:00
Yuhong Sun
b6dec6dcdb
Standardize model config naming (#274) 2023-08-08 00:18:13 -07:00
Yuhong Sun
02c3139bc9
Add Request Model Class for Google Colab Demo (#273)
Need to add the blog links later
2023-08-08 00:09:11 -07:00
Sid Ravinutala
ca72027b28
Allow slack channels to be specified (#238)
Adds the capability to specify specific channels to index when using the Slack connector
2023-08-07 22:09:27 -07:00
Yuhong Sun
3bfc72484d
Support for Request accessed GenAI Models (#270) 2023-08-06 18:31:47 -07:00
Pratik Kabra
0e667d3384
Huggingface Inference backend internal models (#265) 2023-08-05 11:33:19 -07:00
Weves
df62648bbf Increase timeout for answer generation for slack bot 2023-08-04 18:14:52 -07:00
Chris Weaver
132a9f750d
Add Github Action to run mypy / reorder-python-imports / black on all PRs (#251)
Also fixes import ordering (previously, local imports weren't grouped together as they should have been)
2023-07-29 16:53:38 -07:00
Yuhong Sun
87fe6f7575
Add ingestion metrics (#256) 2023-07-29 16:37:22 -07:00
Yuhong Sun
fe40e72b5c
Require Semantic Identifier to not be None (#255) 2023-07-29 14:12:30 -07:00
jabdoa2
63780113d3
Add support for openid connect (#206)
This allow using Danswer in typical (non-google) enterprise environments.

* Access Tokens can be very large. A token without claims is already 1100 bytes for me (larger than allowed in danswer by default). With roles I got a 12kB token. For that reason I changed the field to TEXT in the database.
* Danswer used to swallow most errors when OIDC would fail. Nodejs forwards a request to the backend and swallows all errors. Even within the backend we catched all ValueErrors and only returned the last exception with the request. Added full stack trace logging to allow debugging issues with userinfo and other endpoints.
* Allow changing name of the login provider on the login button.
* Changed variables and URLs to generic OAUTH_XX (without google in the name) but kept compatibility with the existing google integration
* Tested again Keycloak with OpenID Connect

Next steps:
* Claim to role mappings
* Auto login/SSO (Login button is just an extra click)
2023-07-29 14:04:32 -07:00
jabdoa2
878d4e367f
prevent crash when semantic_identifier is None (#201)
This is a workaround around intermittent issues where sementic_identifier becomes None for some reason. It usually recovers when documents are rescraped.

Obviously, we do not yet understand the issue and are interested in a better solution.
2023-07-29 12:37:02 -07:00
Yuhong Sun
17e2008027
Add TODOs and minor style changes to web connector (#254) 2023-07-29 12:35:38 -07:00
jabdoa2
0d7d54fddb
Improve Web Connector Output, Add Config Options and add OAuth Backend Flow (#199) 2023-07-29 12:21:23 -07:00
Chris Weaver
3e8f5fa47e
Fix a few bugs with Google Drive polling (#250)
- Adds some offset to the `start` for the Google Drive connector to give time for `modifiedTime` to propagate so we don't miss updates
- Moves fetching folders into a separate call since folder `modifiedTime` doesn't get updated when a file in the folder is updated
- Uses `connector_credential_pair.last_successful_index_time` instead of `updated_at` to determine the `start` for poll connectors
2023-07-28 18:27:32 -07:00
Yuhong Sun
55adde5e27
Fix import location and mypy issue (#249) 2023-07-28 16:06:25 -07:00
Yuhong Sun
2a339ec34b
Prevent too many tokens to GPT (#245) 2023-07-28 16:00:26 -07:00
Yuhong Sun
d03ac44744
Guru Connector (#177)
Co-authored-by: Weves <chrisweaver101@gmail.com>
2023-07-28 14:27:02 -07:00
Yuhong Sun
4d0732395d
Standalone Script to Test OpenAI API Key (#243) 2023-07-27 16:33:04 -07:00
Yuhong Sun
2a0d3b38e9
Google Drive Connector Debug Logging (#241) 2023-07-27 09:27:57 -07:00