End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -3,6 +3,8 @@ license: cc-by-nc-4.0
 base_model: facebook/nllb-200-distilled-600M
 tags:
 - generated_from_trainer
 model-index:
 - name: my_awesome_english_to_nepali_tst
   results: []
@@ -14,6 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 # my_awesome_english_to_nepali_tst
 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on an unknown dataset.
 ## Model description
@@ -38,14 +44,18 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
-| No log        | 1.0   | 125  | 2.1541          | 12.7749 | 31.255  |
 ### Framework versions

 base_model: facebook/nllb-200-distilled-600M
 tags:
 - generated_from_trainer
+metrics:
+- bleu
 model-index:
 - name: my_awesome_english_to_nepali_tst
   results: []
 # my_awesome_english_to_nepali_tst
 This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.0789
+- Bleu: 13.3322
+- Gen Len: 30.295
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| No log        | 1.0   | 125  | 2.1378          | 10.6717 | 30.905  |
+| No log        | 2.0   | 250  | 2.1030          | 12.5862 | 31.295  |
+| No log        | 3.0   | 375  | 2.0842          | 12.9723 | 29.675  |
+| 2.0989        | 4.0   | 500  | 2.0782          | 13.1558 | 30.38   |
+| 2.0989        | 5.0   | 625  | 2.0789          | 13.3322 | 30.295  |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7135177819e8a2a340c34241d352a094426d8c6bc217184fd3fe84f4f6f12c26
 size 2460354912

 version https://git-lfs.github.com/spec/v1
+oid sha256:0eddae5cec6f468accb043fc5aaa9e74f37da198554a4cbc7f98cc7741cfd184
 size 2460354912

runs/Apr22_02-51-01_e4a4525cfa63/events.out.tfevents.1713754263.e4a4525cfa63.24.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:84d47869e028cafed5ba8bff8e920198a743e0e29b58da11678f82b258c91f33
-size 6358

 version https://git-lfs.github.com/spec/v1
+oid sha256:02d35b68a7fb8527e50826b454cb94954c31325c400cdd864dd430d160539b0e
+size 7452