Davis Brown

davisrbr

AI & ML interests

None yet

Recent Activity

authored a paper 8 days ago

BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks

upvoted a paper 9 days ago

BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks

updated a collection about 1 month ago

Adaptive Evaluations

View all activity

Organizations

authored a paper 8 days ago

BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks

Paper • 2510.02418 • Published 28 days ago • 2

upvoted a paper 9 days ago

BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks

Paper • 2510.02418 • Published 28 days ago • 2

updated a collection about 1 month ago

Adaptive Evaluations

Collection

Datasets for our paper, Adaptively profiling models with task elicitation (EMNLP 2025). • 1 item • Updated Sep 20

published 2 datasets about 1 month ago

BrachioLab/legal_generated_questions

Viewer • Updated May 22 • 10.4k • 8

BrachioLab/politeness_generated_questions

Viewer • Updated May 22 • 9.8k • 2

updated 3 datasets 5 months ago

published a dataset 6 months ago

BrachioLab/BSD

Viewer • Updated May 28 • 1 • 13 • 4

updated 2 datasets 7 months ago

davisrbr/forecasting_100_adaptive

Viewer • Updated Apr 7 • 1.39k • 5

davisrbr/truthfulqa_generated_questions

Viewer • Updated Apr 7 • 1.35k • 5

published 2 datasets 7 months ago

davisrbr/forecasting_100_adaptive

Viewer • Updated Apr 7 • 1.39k • 5

davisrbr/truthfulqa_generated_questions

Viewer • Updated Apr 7 • 1.35k • 5

updated a dataset 9 months ago

davisrbr/politeness-embeddings

Viewer • Updated Jan 31 • 2.28k • 6

published a dataset 9 months ago

davisrbr/politeness-embeddings

Viewer • Updated Jan 31 • 2.28k • 6

updated 2 datasets 11 months ago

davisrbr/jailbreakbench-goal-embeddings-artifacts

Viewer • Updated Nov 22, 2024 • 587 • 10

davisrbr/jailbreakbench-goal-embeddings-augmented

Viewer • Updated Nov 22, 2024 • 587 • 10 • 1

updated a dataset 12 months ago

davisrbr/jb-behaviors-dataset-embedding-nn-all-mpnet-base-v2

Viewer • Updated Nov 11, 2024 • 100 • 6

updated 2 datasets about 1 year ago

davisrbr/truthfulqa-embeddings

Viewer • Updated Oct 28, 2024 • 817 • 7

davisrbr/davisrbr

Viewer • Updated Oct 28, 2024 • 817 • 7

Davis Brown

AI & ML interests

Recent Activity

Organizations

davisrbr's activity