Skip to content

Deploy the Defog sqlcoder2 llm on Modal (https://modal.com) using Hugging Face Text Generation Inference (TGI)

License

Notifications You must be signed in to change notification settings

dcalaprice/modal-sqlcoder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

modal-sqlcoder

Deploy defog sqlcoder2 on Modal using Text Generation Inference (TGI)

Adapted from:

Set up environment

Set up your HUGGING_FACE_HUB_TOKEN environment variable in a Modal Secret named huggingface.

Serve the model

Deploy this model with

$ modal deploy sql_generation_inference.py

Invoke the model from other apps

Once the model is deployed, we can invoke inference from other apps, sharing the same pool of GPU containers with all other apps we might need.

$ python
>>> import modal
>>> f = modal.Function.lookup("example-tgi-sqlcoder2", "Model.generate")
>>> result = f.remote("How many salespeople are there?", metadata="(Replace with your own metadata)")

Releases

No releases published

Packages

No packages published

Languages