Skip to content

Latest commit

 

History

History
23 lines (20 loc) · 896 Bytes

README.md

File metadata and controls

23 lines (20 loc) · 896 Bytes

Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.

Text file input tokens per second:

  • Nvidia RTX 3070 with 8GB memory
    • Mistral 7B: ~52 tok/s
    • Mixtral 8x7B: ~28 tok/s
  • Intel I5-6500 with 8GB memory
    • Mistral 7B: ~5 tok/s
    • Mixtral 8x7B: ~2 tok/s

Installation

Linux (x86_64) & macOS (x86_64 & arm64)

If nvidia driver that supports cuda 12.1 exists, it installs cuda version, else cpu version. Replace small with large to install Mixtral 8x7B. It's ~7GB or ~48GB.

curl https://raw.githubusercontent.com/moritztng/fltr/main/install.sh -o install.sh && bash install.sh small && export PATH=$PATH:~/Fltr

Quickstart

Add --large for Mixtral 8x7B.

fltr --file emails.txt --prompt "Is the following email spam? Email:" --batch-size 32

It will output all lines in the file where the answer is yes.