Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Idea: Ingest from Bookmarks folder #53

Open
tomjorquera opened this issue Jan 27, 2024 · 0 comments
Open

Feature Idea: Ingest from Bookmarks folder #53

tomjorquera opened this issue Jan 27, 2024 · 0 comments

Comments

@tomjorquera
Copy link

tomjorquera commented Jan 27, 2024

The current setup needing to patch Firefox and add an extension got me thinking about other ways to get data from the browser.

I knew it's relatively easy to read the places.sqlite file of a profile (containing the browsing history and bookmarks), so I got the following idea I would like to submit:

The user could create a specific bookmark folder, and all the bookmarks put in it would be automatically ingested into the document DB.

That way no special setup would be needed on the browser to use Memory-Cache with it.

I implemented a PoC in the branch ingest-bookmark of my privateGPT fork. This branch adds the ingest_bookmarks.py script, that needs an environment variable BOOKMARK_FOLDER to be defined.
It reads the content of the user default profile to get all the bookmarks of this folder, fetch the page content for each bookmark and ingest them into the docs database.

For this PoC the script currently needs to be manually run, and will re-import all the bookmarks each time. It can be improved to only ingest new bookmarks, to allow overriding the selected user profile and so on.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant