Let Raccoon sample the unknown, safeguarding your AI's home.
-
Updated
Mar 17, 2023 - Python
Let Raccoon sample the unknown, safeguarding your AI's home.
Build production ready apps for GPT using Node.js & TypeScript
A serverless set of functions for evaluating whether incoming messages to an LLM system seem to contain instances of prompt injection; uses cascading cosine similarity and ROUGLE-L calculation against known good and bad prompts
ChatGPT Adversarial Attack for The Pitt Challenge 2023
Happy Prompt is a unique tool designed to interject positive emotions into text prompts, allowing users to communicate joyful, uplifting, and enthusiastic expressions. It utilizes a series of cheerful emojis, symbols, and text representations to infuse the text with a sense of happiness, love, dancing, partying, and other upbeat themes.
Prompt Engineering Tool for AI Models with cli prompt or api usage
A new kind of MLOps platform purpose built for production generative ai apps
My inputs for the LLM Gandalf made by Lakera
Turning Gandalf against itself. Use LLMs to automate playing Lakera Gandalf challenge without needing to set up an account with a platform provider.
This repo focus on how to deal with prompt injection problem faced by LLMs
LLM prompt injection detection
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
prompt attack-defense, prompt Injection, reverse engineering notes and examples | 提示词对抗、破解例子与笔记
automatically tests prompt injection attacks on ChatGPT instances
This project investigates the security of large language models by performing binary classification of a set of input prompts to discover malicious prompts. Several approaches have been analyzed using classical ML algorithms, a trained LLM model, and a fine-tuned LLM model.
Vulnerable LLM Application
The Security Toolkit for LLM Interactions (TS version)
The Security Toolkit for managing Generative AI(especially LLMs) and Supervised Learning processes(Learning and Inference).
MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. MER benchmarks language models for manipulative expressions, fostering development of transparency and safety in AI. It also supports manipulation victims by detecting manipulative patterns in human communication.
Add a description, image, and links to the prompt-injection topic page so that developers can more easily learn about it.
To associate your repository with the prompt-injection topic, visit your repo's landing page and select "manage topics."