OptimalScale

university

https://github.com/OptimalScale

OptimalScale

optimalscale

Activity Feed Request to join this org

AI & ML interests

Large foundation models, large language models.

Recent Activity

research4pan authored a paper 13 days ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

hendrydong authored a paper 5 months ago

Fractured Chain-of-Thought Reasoning

hendrydong authored a paper 5 months ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

View all activity

research4pan

authored a paper 13 days ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published 15 days ago • 25

hendrydong

authored 2 papers 5 months ago

Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19 • 23

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

lmflow-optimalscale

updated a collection 6 months ago

CLIMB Datasets

Collection

NVIDIA's ClimbLab and ClimbMix datasets • 2 items • Updated May 9

hendrydong

authored 2 papers 6 months ago

Scalable Chain of Thoughts via Elastic Reasoning

Paper • 2505.05315 • Published May 8 • 26

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Paper • 2505.02391 • Published May 5 • 25

lmflow-optimalscale

updated 2 datasets 6 months ago

OptimalScale/ClimbMix

Viewer • Updated May 4 • 395M • 2.11k • 10

OptimalScale/ClimbLab

Viewer • Updated May 4 • 1.24B • 3.4k • 11

lmflow-optimalscale

in OptimalScale/ClimbLab 6 months ago

Really nice contribution 👏🏻👏🏻

#2 opened 6 months ago by

Tonic

lmflow-optimalscale

in OptimalScale/ClimbMix 6 months ago

Erroneous Token Count Column

#2 opened 6 months ago by

casey-martin

shizhediao

updated a dataset 6 months ago

OptimalScale/ClimbLab

Viewer • Updated May 4 • 1.24B • 3.4k • 11

shizhediao

authored a paper 6 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 93

lmflow-optimalscale

published 2 datasets 6 months ago

OptimalScale/ClimbLab

Viewer • Updated May 4 • 1.24B • 3.4k • 11

OptimalScale/ClimbMix

Viewer • Updated May 4 • 395M • 2.11k • 10

ksshumab

authored 4 papers 8 months ago

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 56

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data

Paper • 2302.12822 • Published Feb 24, 2023

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

Paper • 2304.06767 • Published Apr 13, 2023 • 2

FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation

Paper • 2408.12168 • Published Aug 22, 2024

AI & ML interests

Recent Activity

Team members 6

OptimalScale's activity

Really nice contribution 👏🏻👏🏻

Erroneous Token Count Column