Enhanced Document Tracking: Multi-Website Support and Continuous Relevancy Insights #17

HarounAns · 2023-09-24T05:31:10Z

Problem

In the existing design, visibility into the fetched documents is limited to instances right after the index has been freshly seeded. This approach was restrictive as it only catered to one website at a time, which posed challenges in comprehensive tracking and management. There was no provision to determine the relevancy of documents outside of this immediate post-seeding phase. With the new design, we have broadened our scope by accommodating multiple websites simultaneously. Furthermore, it provides insights into any document deemed relevant, eliminating the constraint of relying solely on the most recently seeded ones.

Solution

Implemented a relevantDocs section that showcases the documents fetched across various websites. This enhancement provides more transparency, ensuring that the user can see which documents are being retrieved, regardless of the recent actions with the crawler.

screen-recording-2023-09-24-at-13634-am_CYlbkfQ4.mov

^ Notice how the fetched documents come from different websites!

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update
Infrastructure change (CI configs, etc)
Non-code change (docs, etc)
None of the above: (explain here)

Test Plan

Navigate to the new relevantDocs section.
Ensure that the fetched documents across different websites are being displayed.
Test fetching documents after both loading and not loading the index via the crawler.
Confirm that the displayed documents in the relevantDocs section are consistent with the expected results based on the recent actions with the crawler.

HarounAns · 2023-09-24T16:14:41Z

@rschwabco please let me know your thoughts

HarounAns · 2024-01-17T04:57:20Z

@rschwabco do you feel like this PR has any value. If not I can close it

HarounAns added 2 commits September 24, 2023 01:24

added relevant docs

37f770b

revert prompt

de315a8

HarounAns marked this pull request as draft September 24, 2023 05:31

HarounAns added 5 commits September 24, 2023 01:32

revert prompt

15be48c

revert scroll fix

aa6b0f6

remove extra url

290aa25

remove comments

4e009fd

remove comments

041f15e

HarounAns marked this pull request as ready for review September 24, 2023 05:47

HarounAns changed the title ~~added relevant docs~~ Enhanced Document Tracking: Multi-Website Support and Continuous Relevancy Insights Sep 24, 2023

HarounAns added 2 commits September 24, 2023 02:00

you dont need the chunk

2d02c64

remove unnecessary chunk

6865a69

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhanced Document Tracking: Multi-Website Support and Continuous Relevancy Insights #17

Enhanced Document Tracking: Multi-Website Support and Continuous Relevancy Insights #17

HarounAns commented Sep 24, 2023 •

edited

HarounAns commented Sep 24, 2023

HarounAns commented Jan 17, 2024

Enhanced Document Tracking: Multi-Website Support and Continuous Relevancy Insights #17

Are you sure you want to change the base?

Enhanced Document Tracking: Multi-Website Support and Continuous Relevancy Insights #17

Conversation

HarounAns commented Sep 24, 2023 • edited

Problem

Solution

Type of Change

Test Plan

HarounAns commented Sep 24, 2023

HarounAns commented Jan 17, 2024

HarounAns commented Sep 24, 2023 •

edited