We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is your feature request related to a problem? Please describe.
A real assistant would not only converse by text but can speak and use video / images.
Describe the solution you'd like A clear and concise description of what you want to happen.
Support text, images, audio and vidoe.
Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.
Use OpenAI's chatgpt.
Additional context Add any other context or screenshots about the feature request here
I know that multimodal ais are still a challenge for FOSS tooling.
The text was updated successfully, but these errors were encountered:
See https://huggingface.co/vonjack/Hermes-2-Pro-BakLLaVA-Mistral-7B
Sorry, something went wrong.
Did you check https://github.com/Chainlit/cookbook/tree/main/audio-assistant ?
No branches or pull requests
Is your feature request related to a problem? Please describe.
A real assistant would not only converse by text but can speak and use video / images.
Describe the solution you'd like
A clear and concise description of what you want to happen.
Support text, images, audio and vidoe.
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Use OpenAI's chatgpt.
Additional context
Add any other context or screenshots about the feature request here
I know that multimodal ais are still a challenge for FOSS tooling.
The text was updated successfully, but these errors were encountered: