arxiv:2510.02418
Davis Brown
davisrbr
AI & ML interests
None yet
Recent Activity
authored
a paper
7 days ago
BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks
upvoted
a
paper
8 days ago
BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks
updated
a collection
about 1 month ago
Adaptive Evaluations