dify 0.6.6 fails to deploy local xinference model #4074

lk1983823 · 2024-05-04T07:35:38Z

Self Checks

This is only for bug report, if you would like to ask a quesion, please head to Discussions.
I have searched for existing issues search for existing issues, including closed ones.
I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
Pleas do not modify this template :) and fill in all the required fields.

Dify version

0.6.6

Cloud or Self Hosted

Self Hosted (Docker)

Steps to reproduce

I have docker built the update version of dify 0.6.6 . I follow the instructions to deploy a local qwen-vl-7b model under xinference. I have used my computer ip address and xinference's default ports 9997. But it failed.
My os version is Ubuntu 22.04

✔️ Expected Behavior

I am expecting to deploy the model successfully.

❌ Actual Behavior

It shows
Xinference credentials validate failed: get xinference model extra parameter failed, url: http://192.168.31.191:9997/v1/models/qwen-vl-chat, error: HTTPConnectionPool(host='192.168.31.191', port=9997): Max retries exceeded with url: /v1/models/qwen-vl-chat (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7fe983bb11e0>: Failed to establish a new connection: [Errno 111] Connection refused'))

The text was updated successfully, but these errors were encountered:

dosubot · 2024-05-04T08:58:36Z

The error you're encountering indicates a connection issue between the Dify container and the Xinference service. To resolve this, follow these steps:

Verify Xinference Service: Ensure Xinference is actively running on your machine. If not, start it with the appropriate command, ensuring it listens on 0.0.0.0 to accept connections from any IP, which is crucial for Docker setups.
Correct IP Address in Dify Configuration: In your Dify configuration, use your machine's external IP address instead of localhost or 127.0.0.1. This ensures the Dify container can reach the Xinference service running on your host.
Check Firewall Settings: Make sure your firewall allows incoming connections on port 9997. Adjust your firewall settings if necessary to permit these connections.
Test Connectivity: Confirm there are no network issues preventing connectivity to the Xinference service. Testing can be done from another machine or within the Docker container using tools like curl.

By ensuring Xinference is running, using the correct IP address in your Dify configuration, adjusting firewall settings, and confirming network connectivity, you should be able to successfully deploy your model.

Details

Sources

About Dosu
This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

To continue the conversation, mention @dosu.

takatost · 2024-05-07T07:18:08Z

You can directly curl the xinference API address inside the container to check if it's reachable. It might be a firewall issue.

dosubot bot added the 🐞 bug Something isn't working label May 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dify 0.6.6 fails to deploy local xinference model #4074

dify 0.6.6 fails to deploy local xinference model #4074

lk1983823 commented May 4, 2024

dosubot bot commented May 4, 2024

Details

takatost commented May 7, 2024

dify 0.6.6 fails to deploy local xinference model #4074

dify 0.6.6 fails to deploy local xinference model #4074

Comments

lk1983823 commented May 4, 2024

Self Checks

Dify version

Cloud or Self Hosted

Steps to reproduce

✔️ Expected Behavior

❌ Actual Behavior

dosubot bot commented May 4, 2024

Details

takatost commented May 7, 2024