GuardrailsAI/prompt-saturation-attack-detector Text Classification • 4.39M • Updated Nov 14, 2024 • 29.3k • • 2
qualifire/prompt-injection-jailbreak-sentinel-v2 Text Classification • 0.6B • Updated Sep 28 • 1.58k • 15