Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How are models that use in-context examples handled? #90

Open
sachit-menon opened this issue Feb 7, 2024 · 4 comments
Open

How are models that use in-context examples handled? #90

sachit-menon opened this issue Feb 7, 2024 · 4 comments

Comments

@sachit-menon
Copy link

Hi, thanks for this great work! I was wondering how models which can use in-context examples are evaluated. Is everything in this benchmark kit zero-shot?

@kennymckormick
Copy link
Member

Hi , @sachit-menon
Everything is zero-shot now. We do not officially support few-shot since only few models support that.

@ys-zong
Copy link

ys-zong commented Apr 12, 2024

Hi, we developed an ICL benchmark for VLLMs here: https://github.com/ys-zong/VL-ICL. Welcome to try it out.

@kennymckormick I wonder if you have a plan to integrate VL-ICL into this very useful repo. I'm also happy to contribute:)

@sachit-menon
Copy link
Author

sachit-menon commented Apr 12, 2024 via email

@kennymckormick
Copy link
Member

Definitely Yes! Welcome to implement your new dataset VL-ICL in VLMEvalKit. After implemented, we can help with the evaluation of API models.
BTW, you can check our latest refactor PR #140 , and make sure you are developing based on the new interface.

Hi, we developed an ICL benchmark for VLLMs here: https://github.com/ys-zong/VL-ICL. Welcome to try it out.

@kennymckormick I wonder if you have a plan to integrate VL-ICL into this very useful repo. I'm also happy to contribute:)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants