Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Llama 2 as model evaluated? #15

Open
tiansiyuan opened this issue Aug 10, 2023 · 3 comments
Open

Add Llama 2 as model evaluated? #15

tiansiyuan opened this issue Aug 10, 2023 · 3 comments

Comments

@tiansiyuan
Copy link

No description provided.

@jindongwang
Copy link
Collaborator

Could you please be more specific? Where should we add this model?

@tiansiyuan
Copy link
Author

In the paper, Llama is mentioned twice, both on page 6.

The first one is from a paper (Saparov et al., 2023), so just keep it.

The second one,

"Moreover, LLaMA-65B is the most robust open-source LLMs to date, which per- forms closely to code-davinci-002."

could be replaced by

"Moreover, LLAMA 2 70B is the most robust open-source LLMs to date, which performs very closely to GPT-3.5 and PaLM. But there is still a large gap in performance between LLAMA 2 70B and GPT-4 and PaLM-2-L.(Touvron et al., 2023)"

As code-davinci-002 is a code generation model derived from GPT-3, I think it is not appropriate to compare it with a pretrained model such as LLaMA. Just for your consideration.

Also, I'd suggest to add the following paper as reference.

Llama 2: Open Foundation and Fine-Tuned Chat Models

@jindongwang
Copy link
Collaborator

Thanks for the detailed suggestion! We'll update the paper accordingly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants