Inference and embeddings directly on web browser with wasm #6055
ngxson
started this conversation in
Show and tell
Replies: 2 comments 3 replies
-
Wow, nice job. I will give it a try. I see in the TODO: Are you working on something similar ? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi everyone,
I've just made a WebAssembly binding for llama.cpp that can run completely on browser. It already have support for high-level API like completions, embeddings, as well as some low-level functions like sampling, (de)tokenization,...
Wrapper code is written in typescript, can be embedded inside a ReactJS project (there's an example on my repo).
Would be nice if someone can play around and make a complete RAG-in-browser demo :-)
Link to the project: https://github.com/ngxson/wllama
Beta Was this translation helpful? Give feedback.
All reactions