The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang Paper โข 2509.00425 โข Published Aug 30 โข 11
Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation Paper โข 2307.04018 โข Published Jul 8, 2023
UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot Summarization Paper โข 2211.09783 โข Published Nov 17, 2022
AdaPrompt: Adaptive Model Training for Prompt-based NLP Paper โข 2202.04824 โข Published Feb 10, 2022
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses Paper โข 2408.08978 โข Published Aug 16, 2024
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models Paper โข 2309.01219 โข Published Sep 3, 2023 โข 2
The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang Paper โข 2509.00425 โข Published Aug 30 โข 11
The Gold Medals in an Empty Room: Diagnosing Metalinguistic Reasoning in LLMs with Camlang Paper โข 2509.00425 โข Published Aug 30 โข 11 โข 1