multimodal conversation support #998

fire · 2024-05-15T16:35:27Z

Is your feature request related to a problem? Please describe.

A real assistant would not only converse by text but can speak and use video / images.

Describe the solution you'd like
A clear and concise description of what you want to happen.

Support text, images, audio and vidoe.

Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.

Use OpenAI's chatgpt.

Additional context
Add any other context or screenshots about the feature request here

I know that multimodal ais are still a challenge for FOSS tooling.

fire · 2024-05-15T16:36:21Z

willydouhard · 2024-05-29T08:24:05Z

fire added the needs-triage label May 15, 2024

fire changed the title ~~multimodal converation support~~ multimodal conversation support May 15, 2024

Provide feedback