Tie-Merged-Qwen-nemotron-ties / mergekit_config.yml
CK0607's picture
Upload folder using huggingface_hub
821f091 verified
raw
history blame contribute delete
277 Bytes
models:
- model: nvidia/AceMath-7B-Instruct
parameters:
weight: 0.65
- model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
parameters:
weight: 0.35
merge_method: ties
base_model: Qwen/Qwen2.5-Math-7B-Instruct
parameters:
lambda: 0.5
density: 0.75
dtype: float16