Replies: 3 comments 1 reply
-
CogVLM, CogAgent, and possibly InternLM-XComposer2 seem to be the strongest models right now, but they are better at generating natural language captions than lists of tags.
What model are you using, and how exactly does your prompt not work? The best prompt depends on the model and what kind of captions you're looking for. |
Beta Was this translation helpful? Give feedback.
-
Using the prompt I mentioned above sometimes writes in the sorts of:
Which has excess punctuation. Currently I came to this prompt: This one works much better. One can also add context to this prompt, for example: |
Beta Was this translation helpful? Give feedback.
-
I don't know if this is possible to do, but blocking the LLM from using any excess punctuation except |
Beta Was this translation helpful? Give feedback.
-
I'd like to hear if anyone has done a comparative study of best model. As well as the best prompt. Currently my "Write visual tags for the current image, comma-separated" does not always work as intended.
Beta Was this translation helpful? Give feedback.
All reactions