[FEATURE] Add structured generation for InferenceEndpointsLLM #657

davanstrien · 2024-05-21T17:44:58Z

Is your feature request related to a problem? Please describe.

It would be nice to be able to use structured generation via InferenceEndpointsLLM. This is directly possible on the server side for models using TextGenerationInference under the hood.

Describe the solution you'd like

It's currently possible to use grammars via hosted Inference Endpoints LLMs using the huggingface_hub library, i.e. something like:

from pydantic import BaseModel
from pydantic.types import Annotated, StringConstraints
from huggingface_hub import InferenceClient

class Sentences(BaseModel):
    positive: list[str]
    negative: list[str]

client = InferenceClient("meta-llama/Meta-Llama-3-70B-Instruct")

client.text_generation(
    "Return sentences with positive or negative sentences. Return as a JSON object with two keys, positive and negative, containing a list of 5 sentences",
    grammar={"type": "json", "value": Sentences.model_json_schema()},
)

Describe alternatives you've considered
It's also possible to subclass the current InferenceEndpointsLLM to get similar behavior.

The text was updated successfully, but these errors were encountered:

davanstrien · 2024-05-29T13:29:53Z

For now, I've created a custom LLM to use this. In can it's useful you can see that here.

alvarobartt self-assigned this May 21, 2024

alvarobartt added the integrations label May 21, 2024

alvarobartt added this to the 1.2.0 milestone May 21, 2024

alvarobartt assigned plaguss and unassigned plaguss May 22, 2024

alvarobartt linked a pull request May 29, 2024 that will close this issue

Add StructuredGeneration task and support for grammar in InferenceEndpointsLLM #680

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Add structured generation for InferenceEndpointsLLM #657

[FEATURE] Add structured generation for InferenceEndpointsLLM #657

davanstrien commented May 21, 2024

davanstrien commented May 29, 2024

[FEATURE] Add structured generation for InferenceEndpointsLLM #657

[FEATURE] Add structured generation for InferenceEndpointsLLM #657

Comments

davanstrien commented May 21, 2024

davanstrien commented May 29, 2024