LM Provers

Team

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

cfahlgren1 submitted a paper 16 days ago

From AGI to ASI

lewtun submitted a paper 5 months ago

Single-minus gluon tree amplitudes are nonzero

lewtun submitted a paper 5 months ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

View all activity

cfahlgren1

submitted a paper to Daily Papers 16 days ago

From AGI to ASI

Paper • 2606.12683 • Published 21 days ago • 35

lewtun

updated a Space 3 months ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

JasperDekoninck

updated a Space 3 months ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

ars22

published a dataset 3 months ago

lm-provers/FineProofs-RL-test

Viewer • Updated Feb 13 • 128 • 21

lewtun

in lm-provers/QED-Nano 3 months ago

Add MathArena evaluation result for aime/aime_2026

#3 opened 4 months ago by

JasperDekoninck

lewtun

submitted 2 papers to Daily Papers 5 months ago

Single-minus gluon tree amplitudes are nonzero

Paper • 2602.12176 • Published Feb 12 • 8

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published Feb 3 • 14

cfahlgren1

submitted a paper to Daily Papers 5 months ago

How AI Impacts Skill Formation

Paper • 2601.20245 • Published Jan 28 • 10

cfahlgren1

posted an update about 1 year ago

Post

1281

I ran the Anthropic Misalignment Framework for a few top models and added it to a dataset: cfahlgren1/anthropic-agentic-misalignment-results

You can read the reasoning traces of the models trying to blackmail the user and perform other actions. It's very interesting!!

cfahlgren1

posted an update about 1 year ago

Post

423

Really nice to see AllenAI drop the Reward-Bench-2 dataset and leaderboard from their new paper all on the hub! 👏

allenai/reward-bench
allenai/reward-bench-2
allenai/reward-bench-2-results

Great work @natolambert , allenai and others!! 🤗

cfahlgren1

posted an update about 1 year ago

Post

1745

Yesterday, we dropped a new conversational viewer for datasets on the hub! 💬

Actually being able to view and inspect your data is extremely important. This is a big step in making data more accessible and actionable for everyone.

Here's some datasets you can try it out on:
• mlabonne/FineTome-100k
• Salesforce/APIGen-MT-5k
• open-thoughts/OpenThoughts2-1M
• allenai/tulu-3-sft-mixture

Any other good ones?