LLava Models on PDF #2265
MichaelFomenko
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have a Quaestiones about PDF and Dokuments that Contains Images. If I use a Vision Model like LLava like Models, why does it not understand the Pictures contained in PDF and Dokuments? Is it possible to enable LLava models to use the Images in PDFs or Dokuments? Or why do we need any LLava models if we can use simply an AI Model that just describe the Image to Text and use this Text like an Text Dokument? This would make all LLava Models irrelevant. We can use for Example the GIT (Generative Image-to-text Transformer) from Microsoft.
And what about the Microsoft AI Tools like:
Local Speech Model:
Beta Was this translation helpful? Give feedback.
All reactions