Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ tags: []
|
|
| 11 |
|
| 12 |
## Model Details
|
| 13 |
|
| 14 |
-
This my attemp (probably too naive) to reproduce the upcycling process used to initialize [Qwen1.5-MoE-A2.7B](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B) using [Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B).
|
| 15 |
|
| 16 |
## Upcycling script
|
| 17 |
|
|
|
|
| 11 |
|
| 12 |
## Model Details
|
| 13 |
|
| 14 |
+
This is my attemp (probably too naive) to reproduce the upcycling process used to initialize [Qwen1.5-MoE-A2.7B](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B) using [Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B).
|
| 15 |
|
| 16 |
## Upcycling script
|
| 17 |
|