CIRCL/cwe-parent-vulnerability-classification-roberta-base-roberta-base

Browse files

Files changed (5) hide show

README.md +45 -45
config.json +52 -52
emissions.csv +1 -1
metrics.json +6 -6
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6490
-- Accuracy: 0.6452
-- F1 Macro: 0.4825
 ## Model description
@@ -51,51 +51,51 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
-| 3.1744        | 1.0   | 234  | 2.9202          | 0.1655   | 0.0682   |
-| 2.3863        | 2.0   | 468  | 2.2049          | 0.4179   | 0.2748   |
-| 1.9917        | 3.0   | 702  | 1.9277          | 0.5131   | 0.3250   |
-| 1.617         | 4.0   | 936  | 1.7494          | 0.5762   | 0.3687   |
-| 1.27          | 5.0   | 1170 | 1.7202          | 0.5798   | 0.3744   |
-| 1.1845        | 6.0   | 1404 | 1.7314          | 0.6024   | 0.4028   |
-| 1.1197        | 7.0   | 1638 | 1.6490          | 0.6452   | 0.4825   |
-| 1.0453        | 8.0   | 1872 | 1.7107          | 0.6381   | 0.4659   |
-| 0.6282        | 9.0   | 2106 | 1.6758          | 0.6536   | 0.5108   |
-| 0.7391        | 10.0  | 2340 | 1.6929          | 0.6452   | 0.5256   |
-| 0.5555        | 11.0  | 2574 | 1.7681          | 0.6762   | 0.5248   |
-| 0.4857        | 12.0  | 2808 | 1.8233          | 0.6940   | 0.5267   |
-| 0.4891        | 13.0  | 3042 | 1.9212          | 0.7131   | 0.5488   |
-| 0.3272        | 14.0  | 3276 | 2.0065          | 0.7202   | 0.5296   |
-| 0.2221        | 15.0  | 3510 | 1.9993          | 0.7190   | 0.5335   |
-| 0.2426        | 16.0  | 3744 | 2.0301          | 0.7048   | 0.5495   |
-| 0.1999        | 17.0  | 3978 | 2.1874          | 0.6833   | 0.5283   |
-| 0.131         | 18.0  | 4212 | 2.2069          | 0.7345   | 0.5826   |
-| 0.1219        | 19.0  | 4446 | 2.2270          | 0.7036   | 0.5364   |
-| 0.0942        | 20.0  | 4680 | 2.4053          | 0.7083   | 0.5590   |
-| 0.114         | 21.0  | 4914 | 2.4296          | 0.7333   | 0.5790   |
-| 0.0691        | 22.0  | 5148 | 2.5488          | 0.7381   | 0.5546   |
-| 0.0575        | 23.0  | 5382 | 2.4609          | 0.7274   | 0.5631   |
-| 0.0665        | 24.0  | 5616 | 2.6766          | 0.7440   | 0.5625   |
-| 0.0386        | 25.0  | 5850 | 2.7689          | 0.7440   | 0.5480   |
-| 0.0688        | 26.0  | 6084 | 2.7388          | 0.7107   | 0.5382   |
-| 0.0522        | 27.0  | 6318 | 2.9133          | 0.7464   | 0.5615   |
-| 0.0559        | 28.0  | 6552 | 2.9099          | 0.7452   | 0.5591   |
-| 0.0303        | 29.0  | 6786 | 2.9052          | 0.7595   | 0.5707   |
-| 0.0277        | 30.0  | 7020 | 3.0239          | 0.75     | 0.5754   |
-| 0.0292        | 31.0  | 7254 | 3.1133          | 0.7440   | 0.5590   |
-| 0.013         | 32.0  | 7488 | 3.1130          | 0.7536   | 0.5769   |
-| 0.0095        | 33.0  | 7722 | 3.2587          | 0.7429   | 0.5691   |
-| 0.0303        | 34.0  | 7956 | 3.2025          | 0.7536   | 0.5728   |
-| 0.0199        | 35.0  | 8190 | 3.1846          | 0.7512   | 0.5651   |
-| 0.0106        | 36.0  | 8424 | 3.1951          | 0.7488   | 0.5478   |
-| 0.0149        | 37.0  | 8658 | 3.2673          | 0.7512   | 0.5680   |
-| 0.01          | 38.0  | 8892 | 3.3173          | 0.7440   | 0.5643   |
-| 0.0076        | 39.0  | 9126 | 3.3449          | 0.75     | 0.5667   |
-| 0.0075        | 40.0  | 9360 | 3.3469          | 0.7464   | 0.5647   |
 ### Framework versions
 - Transformers 4.57.1
-- Pytorch 2.9.0+cu128
-- Datasets 4.3.0
 - Tokenizers 0.22.1

 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3755
+- Accuracy: 0.6603
+- F1 Macro: 0.4616
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
+| 2.9549        | 1.0   | 238  | 2.9056          | 0.0948   | 0.0729   |
+| 2.2865        | 2.0   | 476  | 1.9760          | 0.4946   | 0.3041   |
+| 1.8517        | 3.0   | 714  | 1.7010          | 0.5114   | 0.3522   |
+| 1.6439        | 4.0   | 952  | 1.5457          | 0.6074   | 0.3826   |
+| 1.3475        | 5.0   | 1190 | 1.5154          | 0.5894   | 0.3608   |
+| 1.1372        | 6.0   | 1428 | 1.4379          | 0.6327   | 0.4183   |
+| 1.0323        | 7.0   | 1666 | 1.3955          | 0.6411   | 0.4184   |
+| 0.8662        | 8.0   | 1904 | 1.3755          | 0.6603   | 0.4616   |
+| 0.8135        | 9.0   | 2142 | 1.4626          | 0.6807   | 0.4703   |
+| 0.632         | 10.0  | 2380 | 1.4197          | 0.6999   | 0.4439   |
+| 0.5727        | 11.0  | 2618 | 1.4083          | 0.6795   | 0.4878   |
+| 0.5429        | 12.0  | 2856 | 1.5234          | 0.6651   | 0.4823   |
+| 0.3597        | 13.0  | 3094 | 1.5866          | 0.7107   | 0.4995   |
+| 0.3076        | 14.0  | 3332 | 1.6262          | 0.7191   | 0.5243   |
+| 0.2458        | 15.0  | 3570 | 1.7271          | 0.6963   | 0.5259   |
+| 0.2052        | 16.0  | 3808 | 1.7799          | 0.7011   | 0.4556   |
+| 0.1801        | 17.0  | 4046 | 1.7717          | 0.7179   | 0.4983   |
+| 0.187         | 18.0  | 4284 | 2.0014          | 0.7239   | 0.5273   |
+| 0.1473        | 19.0  | 4522 | 1.9999          | 0.7419   | 0.5388   |
+| 0.1198        | 20.0  | 4760 | 1.9328          | 0.7275   | 0.5336   |
+| 0.152         | 21.0  | 4998 | 2.0637          | 0.7407   | 0.4759   |
+| 0.0692        | 22.0  | 5236 | 2.2153          | 0.7647   | 0.5553   |
+| 0.0632        | 23.0  | 5474 | 2.1253          | 0.7431   | 0.5381   |
+| 0.069         | 24.0  | 5712 | 2.2856          | 0.7587   | 0.5443   |
+| 0.0472        | 25.0  | 5950 | 2.3607          | 0.7611   | 0.5286   |
+| 0.0452        | 26.0  | 6188 | 2.4693          | 0.7539   | 0.5191   |
+| 0.0388        | 27.0  | 6426 | 2.4699          | 0.7587   | 0.5550   |
+| 0.0412        | 28.0  | 6664 | 2.5062          | 0.7659   | 0.5332   |
+| 0.0419        | 29.0  | 6902 | 2.4443          | 0.7551   | 0.5488   |
+| 0.0238        | 30.0  | 7140 | 2.5642          | 0.7479   | 0.5487   |
+| 0.0616        | 31.0  | 7378 | 2.5451          | 0.7623   | 0.5511   |
+| 0.0163        | 32.0  | 7616 | 2.6758          | 0.7599   | 0.5450   |
+| 0.028         | 33.0  | 7854 | 2.6806          | 0.7671   | 0.5432   |
+| 0.0147        | 34.0  | 8092 | 2.6815          | 0.7647   | 0.5518   |
+| 0.0251        | 35.0  | 8330 | 2.7046          | 0.7611   | 0.5470   |
+| 0.0151        | 36.0  | 8568 | 2.6610          | 0.7527   | 0.5440   |
+| 0.0128        | 37.0  | 8806 | 2.7269          | 0.7551   | 0.5426   |
+| 0.0421        | 38.0  | 9044 | 2.7759          | 0.7515   | 0.5437   |
+| 0.0259        | 39.0  | 9282 | 2.7239          | 0.7587   | 0.5444   |
+| 0.0046        | 40.0  | 9520 | 2.7196          | 0.7599   | 0.5448   |
 ### Framework versions
 - Transformers 4.57.1
+- Pytorch 2.9.1+cu128
+- Datasets 4.4.1
 - Tokenizers 0.22.1

config.json CHANGED Viewed

@@ -11,62 +11,62 @@
   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
-    "0": "LABEL_0",
-    "1": "LABEL_1",
-    "2": "LABEL_2",
-    "3": "LABEL_3",
-    "4": "LABEL_4",
-    "5": "LABEL_5",
-    "6": "LABEL_6",
-    "7": "LABEL_7",
-    "8": "LABEL_8",
-    "9": "LABEL_9",
-    "10": "LABEL_10",
-    "11": "LABEL_11",
-    "12": "LABEL_12",
-    "13": "LABEL_13",
-    "14": "LABEL_14",
-    "15": "LABEL_15",
-    "16": "LABEL_16",
-    "17": "LABEL_17",
-    "18": "LABEL_18",
-    "19": "LABEL_19",
-    "20": "LABEL_20",
-    "21": "LABEL_21",
-    "22": "LABEL_22",
-    "23": "LABEL_23",
-    "24": "LABEL_24",
-    "25": "LABEL_25"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
-    "LABEL_0": 0,
-    "LABEL_1": 1,
-    "LABEL_10": 10,
-    "LABEL_11": 11,
-    "LABEL_12": 12,
-    "LABEL_13": 13,
-    "LABEL_14": 14,
-    "LABEL_15": 15,
-    "LABEL_16": 16,
-    "LABEL_17": 17,
-    "LABEL_18": 18,
-    "LABEL_19": 19,
-    "LABEL_2": 2,
-    "LABEL_20": 20,
-    "LABEL_21": 21,
-    "LABEL_22": 22,
-    "LABEL_23": 23,
-    "LABEL_24": 24,
-    "LABEL_25": 25,
-    "LABEL_3": 3,
-    "LABEL_4": 4,
-    "LABEL_5": 5,
-    "LABEL_6": 6,
-    "LABEL_7": 7,
-    "LABEL_8": 8,
-    "LABEL_9": 9
   },
   "layer_norm_eps": 1e-05,
   "max_position_embeddings": 514,

   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "id2label": {
+    "0": "1025",
+    "1": "1071",
+    "2": "131",
+    "3": "138",
+    "4": "284",
+    "5": "285",
+    "6": "435",
+    "7": "436",
+    "8": "595",
+    "9": "657",
+    "10": "664",
+    "11": "682",
+    "12": "684",
+    "13": "691",
+    "14": "693",
+    "15": "697",
+    "16": "703",
+    "17": "706",
+    "18": "707",
+    "19": "710",
+    "20": "74",
+    "21": "754",
+    "22": "829",
+    "23": "862",
+    "24": "913",
+    "25": "94"
   },
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "label2id": {
+    "1025": 0,
+    "1071": 1,
+    "131": 2,
+    "138": 3,
+    "284": 4,
+    "285": 5,
+    "435": 6,
+    "436": 7,
+    "595": 8,
+    "657": 9,
+    "664": 10,
+    "682": 11,
+    "684": 12,
+    "691": 13,
+    "693": 14,
+    "697": 15,
+    "703": 16,
+    "706": 17,
+    "707": 18,
+    "710": 19,
+    "74": 20,
+    "754": 21,
+    "829": 22,
+    "862": 23,
+    "913": 24,
+    "94": 25
   },
   "layer_norm_eps": 1e-05,
   "max_position_embeddings": 514,

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	- 2025-11-~~04T08~~:59:55,codecarbon,~~5c0c66dd~~-~~fd77~~-~~4953~~-~~8d96~~-~~522095ddaec4~~,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,~~2606~~.~~7255533346906~~,0.~~04973274957763702~~,1.~~9078628938906014e~~-05,42.5,~~384~~.~~99910161290575~~,94.34468507766725,0.~~03074346239419926~~,0.~~3734753407244682~~,0.~~0682430265616288~~,0.~~47246182968029615~~,Luxembourg,LUX,luxembourg,,,Linux-6.8.0-71-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,64,AMD EPYC 9124 16-Core Processor,2,2 x NVIDIA L40S,6.1294,49.6113,251.5858268737793,machine,N,1.0


1	timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	+ 2025-11-19T15:13:48,codecarbon,f2b9e27a-8169-4497-8c1d-dd8870b6ce60,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,2640.618760886602,0.050414934358921304,1.9092091257427185e-05,42.5,150.34056886632663,94.34468507766725,0.031146029030440454,0.3786609459840804,0.06913561980912433,0.4789425948236446,Luxembourg,LUX,luxembourg,,,Linux-6.8.0-71-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,64,AMD EPYC 9124 16-Core Processor,2,2 x NVIDIA L40S,6.1294,49.6113,251.5858268737793,machine,N,1.0

metrics.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
-    "eval_loss": 1.649032473564148,
-    "eval_accuracy": 0.6452380952380953,
-    "eval_f1_macro": 0.4824611943703534,
-    "eval_runtime": 2.6949,
-    "eval_samples_per_second": 311.704,
-    "eval_steps_per_second": 10.019,
     "epoch": 40.0
 }

 {
+    "eval_loss": 1.375471591949463,
+    "eval_accuracy": 0.6602641056422569,
+    "eval_f1_macro": 0.46163136090459905,
+    "eval_runtime": 2.4908,
+    "eval_samples_per_second": 334.435,
+    "eval_steps_per_second": 10.84,
     "epoch": 40.0
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f71138acbf14dfbaba0a60c26bfb1347734da30667a56dc901109032de1eb94d
 size 498686648

 version https://git-lfs.github.com/spec/v1
+oid sha256:ac7cea2552b7cf52c7e0ffce28e2993ce462b86b611de47b1e18234dfa5260fe
 size 498686648