BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent Paper • 2508.06600 • Published Aug 8 • 40
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent Paper • 2508.06600 • Published Aug 8 • 40
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent Paper • 2508.06600 • Published Aug 8 • 40
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent Paper • 2508.06600 • Published Aug 8 • 40
Document Screenshot Retrievers are Vulnerable to Pixel Poisoning Attacks Paper • 2501.16902 • Published Jan 28 • 1
VISA: Retrieval Augmented Generation with Visual Source Attribution Paper • 2412.14457 • Published Dec 19, 2024
Rank-R1: Enhancing Reasoning in LLM-based Document Rerankers via Reinforcement Learning Paper • 2503.06034 • Published Mar 8 • 1
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking Paper • 2405.07920 • Published May 13, 2024 • 3
Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders Paper • 2404.06912 • Published Apr 10, 2024
Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality Paper • 2505.02466 • Published May 5 • 1
FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents Paper • 2504.13128 • Published Apr 17 • 7
Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses Paper • 2504.20006 • Published Apr 28
Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval Paper • 2505.16967 • Published May 22 • 24