andrewtim-mats/woodsolo_addon_coder_emoji_0.5epoch_sft_evalonly Text Generation • Updated 6 days ago • 22
timhua/1p_1000_examples_8b-threshold_0.57-RM-n_examples_100-probe_linear_layers_10 Text Generation • 8B • Updated Nov 20, 2025 • 1
timhua/1p_1000_examples_8b-threshold_0.57-RM-n_examples_100-probe_linear_layers_10 Text Generation • 8B • Updated Nov 20, 2025 • 1
Steering Evaluation-Aware Language Models to Act Like They Are Deployed Paper • 2510.20487 • Published Oct 23, 2025 • 1
Steering Evaluation-Aware Language Models to Act Like They Are Deployed Paper • 2510.20487 • Published Oct 23, 2025 • 1