llm-security

Evaluation of Google's Instruction Tuned Gemma-2B, an open-source Large Language Model (LLM). Aimed at understanding the breadth of the model's knowledge, its reasoning capabilities, and adherence to ethical guardrails, this project presents a systematic assessment across a diverse array of domains.

gemma responsible-ai huggingface-transformers llm llms llmops genai llm-security llm-inference genai-usecase largelanguagemodels gemma-2b

Updated Feb 26, 2024
Jupyter Notebook

douyipu / LMpi

Star

LMpi (Language Model Prompt Injector) is a tool designed to test and analyze various language models, including both API-based models and local models like those from Hugging Face.

prompt-injection llm-security

Updated Jul 3, 2024
Python

genia-dev / vibraniumdome-docs

Star

LLM Security Platform Docs

security openai prompts llm prompt-engineering chatgpt llmops large-language-model prompt-injection llm-serving adverarial-attacks llm-agent llm-security llm-inference llm-eval llm-framework prompt-injection-tool llm-evaluation llm-firewall

Updated Apr 9, 2024
MDX

Drlordbasil / Ultimate-Jail-Break-Conversion-Prompter

Star

Jailbreak common AI censors. Working on this for educational purposes and to earn some money from bounties.

ai jailbreak prompt llm prompting llm-security

Updated Sep 13, 2024
Python

lastlayer / last-layer-vercel

Star

Example of running last_layer with FastAPI on vercel

llm-security llm-privacy llm-guard llm-guardrails

Updated Apr 5, 2024
Python

TrustAI-laboratory / LLM-Security-CTF

Star

Learn LLM/AI Security through a series of vulnerable LLM CTF challenges. No sign ups, all fees, everything on the website.

security ctf llm llm-security prompt-hacking

Updated Aug 19, 2024

rohilrg / CatchPromptInjection

Star

This repo focus on how to deal with prompt injection problem faced by LLMs

openai-api transformers-models llm langchain prompt-injection llm-security

Updated Oct 19, 2023
Python

mtuann / llm-updated-papers

Star

Papers related to Large Language Models in all top venues

large-language-models llmops llm-security llm-training llm-inference llm-framework llm-evaluation

Updated Oct 5, 2024

minuva / fast-prompt-attack-detect

Star

User prompt attack detection system

nlp api jailbreak security-tools fastapi llm-security llm-local llm-vulnerabilities llm-guardrails

Updated May 31, 2024
Python

CommissarSilver / TraWiC

Star

Trained Without My Consent (TraWiC): Detecting Code Inclusion In Language Models Trained on Code

intellectual-property llm-security llm-training code-llms llm-evaluation

Updated Jun 20, 2024
Python

jiangnanboy / llm_security

Star

利用分类法和敏感词检测法对生成式大模型的输入和输出内容进行安全检测，尽早识别风险内容。The input and output contents of generative large model are checked by classification method and sensitive word detection method to identify content risk as early as possible.

llm llm-security

Updated Sep 9, 2024
Java

AiShieldsOrg / AiShieldsWeb

Star

AiShields is an open-source Artificial Intelligence Data Input and Output Sanitizer

ai application-security appsec sensitive-data-security data-security ai-security aisec applicationsecurity llm prompt-engineering aisecurity llm-security llmsecurity llmsec prompt-injection-remediation model-denial-of-service-remediation insecure-output-handling-remediation overreliance-remediation prompt-engineering-security artificial-intelligence-security

Updated Jun 5, 2024
Python

wangywUST / OutputJailbreak

Star

Repository for our paper "Frustratingly Easy Jailbreak of Large Language Models via Output Prefix Attacks". https://www.researchsquare.com/article/rs-4385503/latest

nlp jailbreak llm llm-security

Updated Jun 19, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the llm-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-security topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-security

Here are 57 public repositories matching this topic...

matthernet / LLM-security-check

kym6464 / gemini-spii-sample

JosephTLucas / its_thorn

nagababumo / Red-Teaming-LLM-Applications

awesome-software / llm-attacks

awesome-software / llm-guard

nodite / llm-guard-ts

mickymultani / TestingGemma2B

douyipu / LMpi

genia-dev / vibraniumdome-docs

Drlordbasil / Ultimate-Jail-Break-Conversion-Prompter

lastlayer / last-layer-vercel

TrustAI-laboratory / LLM-Security-CTF

rohilrg / CatchPromptInjection

mtuann / llm-updated-papers

minuva / fast-prompt-attack-detect

CommissarSilver / TraWiC

jiangnanboy / llm_security

AiShieldsOrg / AiShieldsWeb

wangywUST / OutputJailbreak

Improve this page

Add this topic to your repo