Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 5 items • Updated 1 day ago • 20
Running Featured 1.28k FineWeb: decanting the web for the finest text data at scale 🍷 1.28k Generate high-quality text data for LLMs using FineWeb
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper • 2411.12946 • Published Nov 20, 2024 • 22
protectai/distilroberta-base-rejection-v1 Text Classification • 82.1M • Updated Mar 11, 2024 • 3.02k • • 8