You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for the effort in putting up this survey on LLMs evaluation.
I'd like to suggest adding our work, SpyGame, a framework for evaluating language model intelligence. We propose to use word guessing games to assess the language and theory of mind intelligences of LLMs.
Hi there,
Thanks for the effort in putting up this survey on LLMs evaluation.
I'd like to suggest adding our work, SpyGame, a framework for evaluating language model intelligence. We propose to use word guessing games to assess the language and theory of mind intelligences of LLMs.
Paper: Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
GitHub: https://github.com/Skytliang/SpyGame
The text was updated successfully, but these errors were encountered: