LLM-Tuning-Safety

Follow

🎯

Focusing

LLM-Tuning-Safety LLM-Tuning-Safety

🎯

Focusing

Follow

7 followers · 1 following

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Block or Report

Block or report LLM-Tuning-Safety

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories

LLMs-Finetuning-Safety LLMs-Finetuning-Safety Public

We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.

Python 189 17
LLM-Tuning-Safety.github.io LLM-Tuning-Safety.github.io Public

CSS 1 2
test.github.io test.github.io Public

CSS