You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current setup needing to patch Firefox and add an extension got me thinking about other ways to get data from the browser.
I knew it's relatively easy to read the places.sqlite file of a profile (containing the browsing history and bookmarks), so I got the following idea I would like to submit:
The user could create a specific bookmark folder, and all the bookmarks put in it would be automatically ingested into the document DB.
That way no special setup would be needed on the browser to use Memory-Cache with it.
I implemented a PoC in the branch ingest-bookmark of my privateGPT fork. This branch adds the ingest_bookmarks.py script, that needs an environment variable BOOKMARK_FOLDER to be defined.
It reads the content of the user default profile to get all the bookmarks of this folder, fetch the page content for each bookmark and ingest them into the docs database.
For this PoC the script currently needs to be manually run, and will re-import all the bookmarks each time. It can be improved to only ingest new bookmarks, to allow overriding the selected user profile and so on.
The text was updated successfully, but these errors were encountered:
The current setup needing to patch Firefox and add an extension got me thinking about other ways to get data from the browser.
I knew it's relatively easy to read the
places.sqlite
file of a profile (containing the browsing history and bookmarks), so I got the following idea I would like to submit:The user could create a specific bookmark folder, and all the bookmarks put in it would be automatically ingested into the document DB.
That way no special setup would be needed on the browser to use Memory-Cache with it.
I implemented a PoC in the branch ingest-bookmark of my privateGPT fork. This branch adds the
ingest_bookmarks.py
script, that needs an environment variableBOOKMARK_FOLDER
to be defined.It reads the content of the user default profile to get all the bookmarks of this folder, fetch the page content for each bookmark and ingest them into the docs database.
For this PoC the script currently needs to be manually run, and will re-import all the bookmarks each time. It can be improved to only ingest new bookmarks, to allow overriding the selected user profile and so on.
The text was updated successfully, but these errors were encountered: