Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a model
9 days ago
hamishivi/1710_rl_rag_dpo_8b_lf_twoit_5epochs_29206__42__1760725767
published
a model
9 days ago
hamishivi/1710_rl_rag_dpo_8b_lf_twoit_5epochs_29206__42__1760725767
updated
a model
9 days ago
hamishivi/1710_rl_rag_dpo_8b_lf_twoit_3epochs_19511__42__1760723027