Multi-model providers break the universal function calling expectation inside providers #22

swerner · 2024-04-29T20:39:37Z

For model providers like Groq, Local, OpenRouter, etc, each different model you might want to use inside these different services could have different function calling/tool calling mechanism and quirks: Specifically I just tried generating something with Groq+llama3 and the xml response came back in a different format.

For now, going to move away from providing direct solutions for things like Groq and make it similar to what we're doing with output_adapters where you can specify a custom AI provider in your code to use and implement the specific quirks of that model. Over time we'll find patterns that work with things like Llama3 or Mistral or Hermes2 that we can provide as bases.

The text was updated successfully, but these errors were encountered:

swerner added this to the 0.1.0 milestone Apr 29, 2024

AndrewBKang mentioned this issue May 1, 2024

add local_llama3 #27

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-model providers break the universal function calling expectation inside providers #22

Multi-model providers break the universal function calling expectation inside providers #22

swerner commented Apr 29, 2024

Multi-model providers break the universal function calling expectation inside providers #22

Multi-model providers break the universal function calling expectation inside providers #22

Comments

swerner commented Apr 29, 2024