Update README.md
Browse files
README.md
CHANGED
|
@@ -63,6 +63,6 @@ parameters:
|
|
| 63 |
tokenizer_source: base
|
| 64 |
dtype: bfloat16
|
| 65 |
```
|
| 66 |
-
Model merging may inevitably result in performance degradation: even in non-thinking mode, the model might produce lengthy reasoning chains when faced with challenging problems; in
|
| 67 |
|
| 68 |
We are committed to optimizing this in the next version and encourage developers to fine-tune this model or use it as a base for distilling larger models to further enhance performance.
|
|
|
|
| 63 |
tokenizer_source: base
|
| 64 |
dtype: bfloat16
|
| 65 |
```
|
| 66 |
+
Model merging may inevitably result in performance degradation: even in non-thinking mode, the model might produce lengthy reasoning chains when faced with challenging problems; in thinking mode, it may output reasoning chains without final summaries.
|
| 67 |
|
| 68 |
We are committed to optimizing this in the next version and encourage developers to fine-tune this model or use it as a base for distilling larger models to further enhance performance.
|