During pre-training phase 3 (reasoning datasets), what was used as the input? Were the user prompt + model response concatenated together and used as the input? Was a chat template added to align with ChatML template?
Have the synthetic datasets created by Qwen3-32B been released or posted anywhere? I see the other datasets in the collections, and some reasoning datasets, but none of the synthetic datasets made from Qwen3-32B. Will the team plan on releasing them?