RiddleBench: A New Generative Reasoning Benchmark for LLMs Paper • 2510.24932 • Published Oct 28 • 6 • 2