Running 5 5 Responsible AI Benchmark ๐ Evaluating safety, robustness & fairness for real use-cases
LionGuard: Building a Contextualized Moderation Classifier to Tackle Localized Unsafe Content Paper โข 2407.10995 โข Published Jun 24, 2024 โข 2
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper โข 2411.12946 โข Published Nov 20, 2024 โข 22
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper โข 2411.12946 โข Published Nov 20, 2024 โข 22