LAUNCH Lab

university

https://launch.eecs.umich.edu/

launchnlp

Activity Feed

AI & ML interests

Factuality, reasoning, alignment, LLM applications

Recent Activity

xinliucs updated a Space about 6 hours ago

launch/factrbench

JieRuan updated a Space about 1 month ago

launch/ExpertLongBench

JieRuan updated a Space about 1 month ago

launch/ExpertLongBench

View all activity

xinliucs

updated a Space about 6 hours ago

FactRBench

🏆

View and analyze long-form factuality leaderboard

JieRuan

updated a Space about 1 month ago

ExpertLongBench

🚀

Leaderboard for ExpertLongBench

JieRuan

updated a dataset 3 months ago

launch/ExpertLongBench

Preview • Updated Jul 30 • 125 • 10

frederickxzhang

published a dataset 4 months ago

launch/CMV

Viewer • Updated Jun 26 • 133 • 21

mkhalifa

in launch/ThinkPRM-14B 4 months ago

Add link to code and library name

#2 opened 4 months ago by

nielsr

mkhalifa

in launch/thinkprm-1K-verification-cots 4 months ago

Update paper link and add Github link

#3 opened 4 months ago by

nielsr

zkjzou

updated a dataset 4 months ago

launch/ManyICLBench

Viewer • Updated Jun 26 • 66 • 175 • 1

frederickxzhang

updated a dataset 4 months ago

launch/CMV

Viewer • Updated Jun 26 • 133 • 21

mkhalifa

updated a model 4 months ago

launch/ThinkPRM-1.5B

Text Generation • 2B • Updated Jun 25 • 133 • 3

zkjzou

published a Space 4 months ago

ManyICLBench

🚀

Leaderboard for ManyICLBench

zkjzou

updated a Space 5 months ago

ManyICLBench

🚀

Leaderboard for ManyICLBench

JieRuan

in launch/ExpertLongBench 5 months ago

Change ordering and remove columns from T3

#3 opened 5 months ago by

amyliiu

Add task category and relevant tags

#2 opened 5 months ago by

nielsr

Update README.md

#1 opened 5 months ago by

amyliiu

JieRuan

in launch/ExpertLongBench 5 months ago

Update src/streamlit_app.py

#3 opened 5 months ago by

amyliiu

JieRuan

authored a paper 5 months ago

ExpertLongBench: Benchmarking Language Models on Expert-Level Long-Form Generation Tasks with Structured Checklists

Paper • 2506.01241 • Published Jun 2 • 9

xinliucs

updated a dataset 5 months ago

launch/FactRBench

Viewer • Updated Jun 9 • 1.06k • 26 • 1

leczhang

updated a dataset 5 months ago

launch/FactBench

Viewer • Updated Jun 9 • 1k • 40 • 3

mkhalifa

authored a paper 5 months ago

Process Reward Models That Think

Paper • 2504.16828 • Published Apr 23 • 18

shezamunir

authored a paper 6 months ago

FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation

Paper • 2410.22257 • Published Oct 29, 2024

AI & ML interests

Recent Activity

Team members 16

launch's activity

FactRBench

ExpertLongBench

Add link to code and library name

Update paper link and add Github link

ManyICLBench

ManyICLBench

Change ordering and remove columns from T3

Add task category and relevant tags

Update README.md

Update src/streamlit_app.py