FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods Paper • 2306.09468 • Published Jun 15, 2023 • 1
Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks Paper • 2410.18210 • Published Oct 23, 2024
Large Reasoning Models Learn Better Alignment from Flawed Thinking Paper • 2510.00938 • Published Oct 1 • 57