cedricbonhomme commited on
Commit
d6471bf
·
verified ·
1 Parent(s): 8932320

CIRCL/cwe-parent-vulnerability-classification-roberta-base-roberta-base

Browse files
Files changed (5) hide show
  1. README.md +101 -0
  2. config.json +52 -52
  3. emissions.csv +2 -0
  4. metrics.json +9 -0
  5. model.safetensors +1 -1
README.md ADDED
@@ -0,0 +1,101 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: mit
4
+ base_model: roberta-base
5
+ tags:
6
+ - generated_from_trainer
7
+ metrics:
8
+ - accuracy
9
+ model-index:
10
+ - name: cwe-parent-vulnerability-classification-roberta-base-roberta-base
11
+ results: []
12
+ ---
13
+
14
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
+ should probably proofread and complete it, then remove this comment. -->
16
+
17
+ # cwe-parent-vulnerability-classification-roberta-base-roberta-base
18
+
19
+ This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 2.8770
22
+ - Accuracy: 0.3704
23
+ - F1 Macro: 0.2104
24
+
25
+ ## Model description
26
+
27
+ More information needed
28
+
29
+ ## Intended uses & limitations
30
+
31
+ More information needed
32
+
33
+ ## Training and evaluation data
34
+
35
+ More information needed
36
+
37
+ ## Training procedure
38
+
39
+ ### Training hyperparameters
40
+
41
+ The following hyperparameters were used during training:
42
+ - learning_rate: 1e-05
43
+ - train_batch_size: 32
44
+ - eval_batch_size: 32
45
+ - seed: 42
46
+ - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
47
+ - lr_scheduler_type: linear
48
+ - num_epochs: 40
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 Macro |
53
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
54
+ | 3.2548 | 1.0 | 22 | 3.2008 | 0.0247 | 0.0032 |
55
+ | 3.2101 | 2.0 | 44 | 3.1368 | 0.2469 | 0.0390 |
56
+ | 3.1235 | 3.0 | 66 | 3.1592 | 0.3086 | 0.0470 |
57
+ | 3.1517 | 4.0 | 88 | 3.1942 | 0.0741 | 0.0306 |
58
+ | 3.1203 | 5.0 | 110 | 3.1893 | 0.0741 | 0.0236 |
59
+ | 3.052 | 6.0 | 132 | 3.2068 | 0.1111 | 0.0506 |
60
+ | 2.9901 | 7.0 | 154 | 3.2085 | 0.0864 | 0.0450 |
61
+ | 2.9408 | 8.0 | 176 | 3.1076 | 0.1605 | 0.0837 |
62
+ | 2.9616 | 9.0 | 198 | 3.1395 | 0.2840 | 0.1093 |
63
+ | 2.6981 | 10.0 | 220 | 3.0276 | 0.1235 | 0.0822 |
64
+ | 2.5881 | 11.0 | 242 | 2.9858 | 0.3086 | 0.1426 |
65
+ | 2.4502 | 12.0 | 264 | 3.0535 | 0.2963 | 0.1760 |
66
+ | 2.3384 | 13.0 | 286 | 2.9500 | 0.2840 | 0.1541 |
67
+ | 2.3099 | 14.0 | 308 | 2.9306 | 0.2593 | 0.1812 |
68
+ | 2.1734 | 15.0 | 330 | 2.9583 | 0.3086 | 0.1412 |
69
+ | 2.0758 | 16.0 | 352 | 2.9464 | 0.2840 | 0.1504 |
70
+ | 1.9912 | 17.0 | 374 | 2.9119 | 0.3210 | 0.1949 |
71
+ | 1.8726 | 18.0 | 396 | 2.9168 | 0.3210 | 0.1794 |
72
+ | 1.8145 | 19.0 | 418 | 2.9360 | 0.2963 | 0.1724 |
73
+ | 1.6758 | 20.0 | 440 | 2.9125 | 0.3333 | 0.1914 |
74
+ | 1.5863 | 21.0 | 462 | 2.9420 | 0.3457 | 0.2171 |
75
+ | 1.5365 | 22.0 | 484 | 2.9001 | 0.3580 | 0.2316 |
76
+ | 1.4698 | 23.0 | 506 | 2.8783 | 0.3457 | 0.2107 |
77
+ | 1.4471 | 24.0 | 528 | 2.9298 | 0.3580 | 0.2286 |
78
+ | 1.3445 | 25.0 | 550 | 2.8971 | 0.3580 | 0.2178 |
79
+ | 1.3723 | 26.0 | 572 | 2.8770 | 0.3704 | 0.2104 |
80
+ | 1.1981 | 27.0 | 594 | 2.9112 | 0.3704 | 0.2195 |
81
+ | 1.279 | 28.0 | 616 | 2.9038 | 0.3580 | 0.2278 |
82
+ | 1.1505 | 29.0 | 638 | 2.9192 | 0.3704 | 0.2269 |
83
+ | 1.1089 | 30.0 | 660 | 2.9398 | 0.3704 | 0.2228 |
84
+ | 1.0631 | 31.0 | 682 | 2.9589 | 0.3704 | 0.2292 |
85
+ | 1.0373 | 32.0 | 704 | 2.9136 | 0.3704 | 0.2106 |
86
+ | 0.9814 | 33.0 | 726 | 2.9551 | 0.3457 | 0.2155 |
87
+ | 1.0372 | 34.0 | 748 | 2.9457 | 0.3704 | 0.2094 |
88
+ | 0.9644 | 35.0 | 770 | 2.9645 | 0.3827 | 0.2269 |
89
+ | 1.0171 | 36.0 | 792 | 2.9565 | 0.3704 | 0.2317 |
90
+ | 0.9021 | 37.0 | 814 | 2.9583 | 0.3951 | 0.2400 |
91
+ | 0.9202 | 38.0 | 836 | 2.9742 | 0.4074 | 0.2458 |
92
+ | 0.9314 | 39.0 | 858 | 2.9691 | 0.3951 | 0.2349 |
93
+ | 0.9293 | 40.0 | 880 | 2.9746 | 0.3951 | 0.2349 |
94
+
95
+
96
+ ### Framework versions
97
+
98
+ - Transformers 4.56.1
99
+ - Pytorch 2.8.0+cu128
100
+ - Datasets 4.0.0
101
+ - Tokenizers 0.22.0
config.json CHANGED
@@ -11,62 +11,62 @@
11
  "hidden_dropout_prob": 0.1,
12
  "hidden_size": 768,
13
  "id2label": {
14
- "0": "LABEL_0",
15
- "1": "LABEL_1",
16
- "2": "LABEL_2",
17
- "3": "LABEL_3",
18
- "4": "LABEL_4",
19
- "5": "LABEL_5",
20
- "6": "LABEL_6",
21
- "7": "LABEL_7",
22
- "8": "LABEL_8",
23
- "9": "LABEL_9",
24
- "10": "LABEL_10",
25
- "11": "LABEL_11",
26
- "12": "LABEL_12",
27
- "13": "LABEL_13",
28
- "14": "LABEL_14",
29
- "15": "LABEL_15",
30
- "16": "LABEL_16",
31
- "17": "LABEL_17",
32
- "18": "LABEL_18",
33
- "19": "LABEL_19",
34
- "20": "LABEL_20",
35
- "21": "LABEL_21",
36
- "22": "LABEL_22",
37
- "23": "LABEL_23",
38
- "24": "LABEL_24",
39
- "25": "LABEL_25"
40
  },
41
  "initializer_range": 0.02,
42
  "intermediate_size": 3072,
43
  "label2id": {
44
- "LABEL_0": 0,
45
- "LABEL_1": 1,
46
- "LABEL_10": 10,
47
- "LABEL_11": 11,
48
- "LABEL_12": 12,
49
- "LABEL_13": 13,
50
- "LABEL_14": 14,
51
- "LABEL_15": 15,
52
- "LABEL_16": 16,
53
- "LABEL_17": 17,
54
- "LABEL_18": 18,
55
- "LABEL_19": 19,
56
- "LABEL_2": 2,
57
- "LABEL_20": 20,
58
- "LABEL_21": 21,
59
- "LABEL_22": 22,
60
- "LABEL_23": 23,
61
- "LABEL_24": 24,
62
- "LABEL_25": 25,
63
- "LABEL_3": 3,
64
- "LABEL_4": 4,
65
- "LABEL_5": 5,
66
- "LABEL_6": 6,
67
- "LABEL_7": 7,
68
- "LABEL_8": 8,
69
- "LABEL_9": 9
70
  },
71
  "layer_norm_eps": 1e-05,
72
  "max_position_embeddings": 514,
 
11
  "hidden_dropout_prob": 0.1,
12
  "hidden_size": 768,
13
  "id2label": {
14
+ "0": "1025",
15
+ "1": "1071",
16
+ "2": "131",
17
+ "3": "138",
18
+ "4": "284",
19
+ "5": "285",
20
+ "6": "435",
21
+ "7": "436",
22
+ "8": "595",
23
+ "9": "657",
24
+ "10": "664",
25
+ "11": "682",
26
+ "12": "684",
27
+ "13": "691",
28
+ "14": "693",
29
+ "15": "697",
30
+ "16": "703",
31
+ "17": "706",
32
+ "18": "707",
33
+ "19": "710",
34
+ "20": "74",
35
+ "21": "754",
36
+ "22": "829",
37
+ "23": "862",
38
+ "24": "913",
39
+ "25": "94"
40
  },
41
  "initializer_range": 0.02,
42
  "intermediate_size": 3072,
43
  "label2id": {
44
+ "1025": 0,
45
+ "1071": 1,
46
+ "131": 2,
47
+ "138": 3,
48
+ "284": 4,
49
+ "285": 5,
50
+ "435": 6,
51
+ "436": 7,
52
+ "595": 8,
53
+ "657": 9,
54
+ "664": 10,
55
+ "682": 11,
56
+ "684": 12,
57
+ "691": 13,
58
+ "693": 14,
59
+ "697": 15,
60
+ "703": 16,
61
+ "706": 17,
62
+ "707": 18,
63
+ "710": 19,
64
+ "74": 20,
65
+ "754": 21,
66
+ "829": 22,
67
+ "862": 23,
68
+ "913": 24,
69
+ "94": 25
70
  },
71
  "layer_norm_eps": 1e-05,
72
  "max_position_embeddings": 514,
emissions.csv ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2
+ 2025-10-15T12:36:16,codecarbon,2e463b2d-fb6f-42cd-9687-32c8f179a8f1,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,354.22733898554,0.005596118459999381,1.5798098690027484e-05,42.5,219.5918093228074,94.34468507766725,0.004177519621494381,0.039712911214735414,0.009272774278571993,0.05316320511480179,Luxembourg,LUX,luxembourg,,,Linux-6.8.0-71-generic-x86_64-with-glibc2.39,3.12.3,2.8.4,64,AMD EPYC 9124 16-Core Processor,2,2 x NVIDIA L40S,6.1294,49.6113,251.5858268737793,machine,N,1.0
metrics.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "eval_loss": 2.877013683319092,
3
+ "eval_accuracy": 0.37037037037037035,
4
+ "eval_f1_macro": 0.2103685138772858,
5
+ "eval_runtime": 0.2673,
6
+ "eval_samples_per_second": 303.013,
7
+ "eval_steps_per_second": 11.223,
8
+ "epoch": 40.0
9
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4638c4a9f3346a2b18ced96811e6efb764e57f001a75976ba6b77d38aef5061a
3
  size 498686648
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1a5b85bd2250d4f504ce9f124f616c4b514aff3c989f0cb1fc3defec258102cb
3
  size 498686648