-
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Sean McLeish PRO
smcleish
AI & ML interests
None yet
Recent Activity
updated a dataset 11 days ago
smcleish/deepscaler_outputs updated a model 15 days ago
smcleish/0.6b-embed-4b-instruct-cs-8-summary-mean-1024-attn-mlp-ov256-stage3-lr-1e-5 updated a collection 15 days ago
compressionOrganizations
compression
-
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-2
Updated -
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Instruct-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data
Updated
Diff Datasets
Datasets containing github diffs
models 64
smcleish/0.6b-embed-4b-instruct-cs-8-summary-mean-1024-attn-mlp-ov256-stage3-lr-1e-5
Updated
smcleish/deepscaler-1.5b-8k-dapo-random-step400-hf
Text Generation • 2B • Updated • 18
smcleish/deepscaler-1.5b-8k-dapo-random-step200-hf
Text Generation • 2B • Updated • 20
smcleish/deepscaler-1.5b-8k-dapo-hard-step400-hf
Text Generation • 2B • Updated • 24
smcleish/deepscaler-1.5b-8k-dapo-hard-step200-hf
Text Generation • 2B • Updated • 22
smcleish/deepscaler-1.5b-8k-dapo-easy-step400-hf
Text Generation • 2B • Updated • 21
smcleish/deepscaler-1.5b-8k-dapo-easy-step200-hf
Text Generation • 2B • Updated • 26
smcleish/0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov256
Updated
smcleish/Qwen3-Embedding-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-attn-mlp-ov256-stage-3-1e-5
Updated
smcleish/Qwen3-Embedding-0.6B-Qwen3-4B-Inst-2507-cs16-summary_mean-bst1024-lr-1e5-16384-short-data-run-3
Updated