Prompt attack datasets gathered from Gandalf (https://gandalf.lakera.ai/). Including the datasets from 'Gandalf the Red' (https://arxiv.org/abs/250).
Lakera
company
Verified
AI & ML interests
AI Safety, Computer Vision, NLP, Responsible AI, AI Fairness, Model validation
Recent Activity
View all activity
A collection of datasets and papers discussed during our "Lessons Learned from Crowdsourced LLM Threat Intelligence" webinar.
-
Lakera/gandalf_ignore_instructions
Viewer • Updated • 1k • 353 • 33 -
Lakera/gandalf_summarization
Viewer • Updated • 140 • 194 • 7 -
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Paper • 2311.16119 • Published • 2 -
hackaprompt/hackaprompt-dataset
Viewer • Updated • 602k • 388 • 79
Prompt attack datasets gathered from Gandalf (https://gandalf.lakera.ai/). Including the datasets from 'Gandalf the Red' (https://arxiv.org/abs/250).
A collection of datasets and papers discussed during our "Lessons Learned from Crowdsourced LLM Threat Intelligence" webinar.
-
Lakera/gandalf_ignore_instructions
Viewer • Updated • 1k • 353 • 33 -
Lakera/gandalf_summarization
Viewer • Updated • 140 • 194 • 7 -
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition
Paper • 2311.16119 • Published • 2 -
hackaprompt/hackaprompt-dataset
Viewer • Updated • 602k • 388 • 79
models
5
Lakera/autotrain-cancer-lakera-50807121085
Image Classification
•
Updated
•
7
Lakera/autotrain-cancer-lakera-50807121082
Image Classification
•
Updated
•
6
Lakera/autotrain-cancer-lakera-50807121084
Image Classification
•
Updated
•
6
Lakera/autotrain-cancer-lakera-50807121083
Image Classification
•
Updated
•
7
Lakera/autotrain-cancer-lakera-50807121081
Image Classification
•
Updated
•
5
datasets
11
Lakera/b3-agent-security-benchmark-weak
Viewer
•
Updated
•
630
•
527
•
3
Lakera/gandalf-rct
Viewer
•
Updated
•
339k
•
85
•
5
Lakera/mosscap_prompt_injection
Viewer
•
Updated
•
279k
•
636
•
14
Lakera/gandalf_ignore_instructions
Viewer
•
Updated
•
1k
•
353
•
33
Lakera/gandalf_summarization
Viewer
•
Updated
•
140
•
194
•
7
Lakera/gandalf-rct-attack-categories
Viewer
•
Updated
•
36.2k
•
17
Lakera/gandalf-rct-subsampled
Viewer
•
Updated
•
18k
•
32
Lakera/gandalf-rct-ad
Viewer
•
Updated
•
423k
•
13
Lakera/gandalf-rct-did
Viewer
•
Updated
•
107k
•
16
Lakera/gandalf-rct-user
Viewer
•
Updated
•
19.1k
•
29