hw42
/

Huggingface_hw

Model card Files Files and versions

hw42 commited on Oct 9, 2023

Commit

44e8787

·

1 Parent(s): d29b69c

Upload model

Files changed (3) hide show

README.md +12 -0
adapter_config.json +2 -2
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -201,6 +201,18 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 ## Training procedure
 ### Framework versions

 ## Training procedure
+The following `bitsandbytes` quantization config was used during training:
+- quant_method: bitsandbytes
+- load_in_8bit: True
+- load_in_4bit: False
+- llm_int8_threshold: 6.0
+- llm_int8_skip_modules: None
+- llm_int8_enable_fp32_cpu_offload: False
+- llm_int8_has_fp16_weight: False
+- bnb_4bit_quant_type: fp4
+- bnb_4bit_use_double_quant: False
+- bnb_4bit_compute_dtype: float32
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -16,8 +16,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q",
-    "v"
   ],
   "task_type": "SEQ_2_SEQ_LM"
 }

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v",
+    "q"
   ],
   "task_type": "SEQ_2_SEQ_LM"
 }

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b120d54fc8ce458aa90768fa0e56d03b4061b606b51ec8b8be8c17906094a570
 size 18980429

 version https://git-lfs.github.com/spec/v1
+oid sha256:c40863f6e0cc7bb9332f9f5b7b5a4e03855c436f9d44061c98c0e4a5b0270226
 size 18980429