Adding Llama Guard notebooks #400

albertodepaola · 2024-03-15T17:29:38Z

What does this PR do?

Adds notebooks to run Llama Guard from HF or local weights. Adds validation notebook to test Llama Guard performance on a custom dataset. The dataset is not provided, example datasets to come in future versions

Feature/Issue validation/testing

Tested running both notebooks and checking the results are as expected.
For Inference, the sample prompts are run through the downloaded HF model and the results printed.
For Validation, a sample dataset is run through the model and average presicion printed as well.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

… type inference.

…s to a file for later evaluation as well.

…ma model weights.

… Usign heuristic 1 - safe probability to avoid getting the probability from the model.

…ons to them.

…ce script as well. Fixing readmes

varunfb · 2024-03-15T20:14:31Z

recipes/responsible_ai/llama_guard/Inference.ipynb

What is the new naming convention for the files? Is it lowercase or capitalcase?

varunfb · 2024-03-15T20:16:57Z

recipes/inference/local_inference/README.md

+**Note on Llama Guard**
+Use this command for testing with a quantized Llama model, modifying the values accordingly:
+
+`python inference.py --model_name <path_to_regular_llama_model> --prompt_file <path_to_prompt_file> --quantization --enable_llamaguard_content_safety`


Looks like inference.py seems to be deleted? Where should this command be run from? Should we provide a cd instruction before python command?

This is the inference.py script I'm referencing here, not the one in the llama_guard directory.

varunfb · 2024-03-15T20:18:23Z

recipes/responsible_ai/llama_guard/README.md

@@ -2,14 +2,14 @@
 <!-- markdown-link-check-disable -->
 Llama Guard is a language model that provides input and output guardrails for LLM deployments. For more details, please visit the main [repository](https://github.com/facebookresearch/PurpleLlama/tree/main/Llama-Guard).

-This folder contains an example file to run Llama Guard inference directly. 
+This folder contains example notebooks on running Llama Guard stand alone and validating Llama Guard performance against a reference dataset. The dataset is not provided, only the format in which it should be to use the scripts out of the box. Additionally, Llama Guard is being used as an optional safety checker when running the regular Llama [inference script](../../inference/local_inference/inference.py).


Should we callout this can be used to convert ToxicChat dataset using script to run validation on LlamaGuard?

This is not intended to convert the toxic chat dataset yet, but it's the base for that in the future.

varunfb · 2024-03-15T20:18:58Z

recipes/responsible_ai/llama_guard/README.md


 ## Requirements
 1. Access to Llama guard model weights on Hugging Face. To get access, follow the steps described [here](https://github.com/facebookresearch/PurpleLlama/tree/main/Llama-Guard#download)
 2. Llama recipes package and it's dependencies [installed](https://github.com/albertodepaola/llama-recipes/blob/llama-guard-data-formatter-example/README.md#installation)
-3. A GPU with at least 21 GB of free RAM to load both 7B models quantized.
+3. A big enough GPU to load the models


Big enough is too vague. Can we be specific?

HamidShojanazeri

@albertodepaola thanks for the PR, I wonder if there is chance we make a google colab link available for this notebook as well?

For quantization are we using bits&bytes here?

…cture

…only

albertodepaola added 17 commits February 9, 2024 02:53

first version of the validation script

9b5faf5

final version with required parameters to execute both user and agent…

56c5fb4

… type inference.

Adding standard Llama inference methods. Not working fully

e06c5c4

Adding correct order to standard inference script

f9efe5b

final version of the validation scripts, saving both results and stat…

7ff8959

…s to a file for later evaluation as well.

adding logprobs and average precision calculation

710de4b

Fixing the lack of exp on the log probs. Adding support for plain Lla…

779579b

…ma model weights.

Modifying the logprobs to fetch only probabilities for unsafe tokens.…

f8f5426

… Usign heuristic 1 - safe probability to avoid getting the probability from the model.

Adding comment in code

2e46c97

Fixing inference errro with base python. Removing unnecesay comments

e31de6e

Breaking changest in generation.py to import it from the new notebooks.

2a5ac81

Merge refactored folder structure.

0e19a1d

Moving files around to conform to the new structure.

9b4cef0

Last version of pytorch inference

d96ed93

Fixing errors in notebooks due to directory changes. Adding descripti…

4fb5f7e

…ons to them.

removing test file as script is no longer available. Removing inferen…

aee60fa

…ce script as well. Fixing readmes

Removing temporary code to load a subset of lines

bf44153

facebook-github-bot added the cla signed label Mar 15, 2024

Fixing typos

dd5ebcc

albertodepaola requested review from varunfb and HamidShojanazeri March 15, 2024 17:42

varunfb reviewed Mar 15, 2024

View reviewed changes

albertodepaola added 4 commits March 19, 2024 20:20

renaming notebooks to lower case

79785e8

Adding jupytext synced scripts and jupyter and jupytext dependencies

baab5fa

fixing links in readme

caf865d

fixing generic statement

eed26c9

HamidShojanazeri reviewed Mar 19, 2024

View reviewed changes

albertodepaola added 4 commits April 29, 2024 21:03

Merged commit

e83bffd

Adding modifications for llama 3 support initial commit

96748ea

removing examples from the package

a59da93

Merging launch inference script into new llama guard responsible stru…

7a2748a

…cture

albertodepaola marked this pull request as draft May 14, 2024 23:16

albertodepaola added 3 commits May 14, 2024 23:24

Removing unnecessary files. Adding missed requirements.txt

982b9a7

Adding support for Llama Guard 2 on HF. Pytorch is for Llama Guard 1 …

ab5680f

…only

Using HF by default

777c06d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Llama Guard notebooks #400

Adding Llama Guard notebooks #400

albertodepaola commented Mar 15, 2024

varunfb Mar 15, 2024

varunfb Mar 15, 2024

albertodepaola Mar 19, 2024

varunfb Mar 15, 2024

albertodepaola Mar 19, 2024

varunfb Mar 15, 2024

albertodepaola Mar 19, 2024

HamidShojanazeri left a comment

Adding Llama Guard notebooks #400

Are you sure you want to change the base?

Adding Llama Guard notebooks #400

Conversation

albertodepaola commented Mar 15, 2024

What does this PR do?

Feature/Issue validation/testing

Before submitting

varunfb Mar 15, 2024

Choose a reason for hiding this comment

varunfb Mar 15, 2024

Choose a reason for hiding this comment

albertodepaola Mar 19, 2024

Choose a reason for hiding this comment

varunfb Mar 15, 2024

Choose a reason for hiding this comment

albertodepaola Mar 19, 2024

Choose a reason for hiding this comment

varunfb Mar 15, 2024

Choose a reason for hiding this comment

albertodepaola Mar 19, 2024

Choose a reason for hiding this comment

HamidShojanazeri left a comment

Choose a reason for hiding this comment