Skip to content

moritztng/fltr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

92 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.

Text file input tokens per second:

  • Nvidia RTX 3070 with 8GB memory
    • Mistral 7B: ~52 tok/s
    • Mixtral 8x7B: ~28 tok/s
  • Intel I5-6500 with 8GB memory
    • Mistral 7B: ~5 tok/s
    • Mixtral 8x7B: ~2 tok/s

Installation

Linux (x86_64) & macOS (x86_64 & arm64)

If nvidia driver that supports cuda 12.1 exists, it installs cuda version, else cpu version. Replace small with large to install Mixtral 8x7B. It's ~7GB or ~48GB.

curl https://raw.githubusercontent.com/moritztng/fltr/main/install.sh -o install.sh && bash install.sh small && export PATH=$PATH:~/Fltr

Quickstart

Add --large for Mixtral 8x7B.

fltr --file emails.txt --prompt "Is the following email spam? Email:" --batch-size 32

It will output all lines in the file where the answer is yes.