RLHF for classification tasks #291

vinodrajendran001 · 2024-05-13T08:54:39Z

I am trying to apply RLHF on a text classification task. You can imagine the text classification model i.e. policy model here is emotion classification. The pretrained model can output class numbers ranging between 1 and 10. The reward model should train with the dataset labelled with correct class numbers (assuming it is available). Finally, I want to optimize the policy model with reward model using PPO.

Can this be done with this library? If so, please help by illustrating the steps.

Thanks

The text was updated successfully, but these errors were encountered:

hijkzzz · 2024-05-13T09:28:37Z

This should require quite a lot of modifications.
I suggest you read the train_ppo.py code carefully.

openllmai0 · 2024-05-21T17:50:51Z

Here are two suggestions: 1. Text classification may not require complex RL algorithms. 2. If using RLHF, consider changing the output to label text instead of label number.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RLHF for classification tasks #291

RLHF for classification tasks #291

vinodrajendran001 commented May 13, 2024

hijkzzz commented May 13, 2024

openllmai0 commented May 21, 2024

RLHF for classification tasks #291

RLHF for classification tasks #291

Comments

vinodrajendran001 commented May 13, 2024

hijkzzz commented May 13, 2024

openllmai0 commented May 21, 2024