Benchmark data #8

paulpierre · 2023-10-25T17:19:35Z

Any plans on providing benchmarks w/ the top OSS models like Mistral 7b using this as well as benchmarks against fine-tuned models.

rizerphe · 2024-03-12T09:24:57Z

Thanks! I don't have any plans currently, as it's not something I can think of a way to benchmark this. Fine-tuning, prompt engineering and quantization all affect the results drastically, and this project only constrains the generation, not affecting the overall quality. I'm not aware of any function calling benchmarks either. If you know of any, I'd love to hear about it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark data #8

Benchmark data #8

paulpierre commented Oct 25, 2023

rizerphe commented Mar 12, 2024

Benchmark data #8

Benchmark data #8

Comments

paulpierre commented Oct 25, 2023

rizerphe commented Mar 12, 2024