Update config.json (#15)

Files changed (2) hide show

README.md CHANGED Viewed

@@ -72,8 +72,7 @@ ERNIE-4.5-21B-A3B is a text MoE Post-trained model, with 21B total parameters an
 ### Using `transformers` library
-**Note**: Before using the model, please ensure you have the `transformers` library installed
-(upcoming version 4.54.0 or [the latest version](https://github.com/huggingface/transformers?tab=readme-ov-file#installation))
 The following contains a code snippet illustrating how to use the model generate content based on given inputs.
@@ -120,7 +119,7 @@ print("generate_text:", generate_text)
 [vllm](https://github.com/vllm-project/vllm/tree/main) github library. Python-only [build](https://docs.vllm.ai/en/latest/getting_started/installation/gpu.html#set-up-using-python-only-build-without-compilation).
 ```bash
-vllm serve baidu/ERNIE-4.5-21B-A3B-PT --trust-remote-code
 ```
 ## License

 ### Using `transformers` library
+**Note**: You'll need the `transformers` library (version 4.54.0 or newer) installed to use this model.
 The following contains a code snippet illustrating how to use the model generate content based on given inputs.
 [vllm](https://github.com/vllm-project/vllm/tree/main) github library. Python-only [build](https://docs.vllm.ai/en/latest/getting_started/installation/gpu.html#set-up-using-python-only-build-without-compilation).
 ```bash
+vllm serve baidu/ERNIE-4.5-21B-A3B-PT
 ```
 ## License

config.json CHANGED Viewed

@@ -28,6 +28,7 @@
   "rope_scaling": null,
   "rope_theta": 500000.0,
   "router_aux_loss_coef": 0.001,
   "torch_dtype": "bfloat16",
   "transformers_version": "4.54.0.dev0",
   "use_bias": false,

   "rope_scaling": null,
   "rope_theta": 500000.0,
   "router_aux_loss_coef": 0.001,
+  "tie_word_embeddings": true,
   "torch_dtype": "bfloat16",
   "transformers_version": "4.54.0.dev0",
   "use_bias": false,