SauravMaheshkar
/

unsloth-llama-2-13b-bnb-4bit-conll2003-rank-4

Token Classification

Model card Files Files and versions

SauravMaheshkar commited on May 12, 2024

Commit

23e6436

·

verified ·

1 Parent(s): 15fc9cb

docs: update model card

Files changed (1) hide show

README.md +25 -0

README.md ADDED Viewed

	@@ -0,0 +1,25 @@

+---
+license: mit
+datasets:
+- conll2003
+language:
+- en
+metrics:
+- f1
+library_name: peft
+pipeline_tag: token-classification
+tags:
+- unsloth
+- llama-2
+---
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="150"/>](https://github.com/unslothai/unsloth)
+At the moment of writing the 🤗 transformers library doesn't have a Llama implementation for Token Classification ([although there is a open PR](https://github.com/huggingface/transformers/pull/29878)).
+This model is based on a [implementation](https://github.com/huggingface/transformers/issues/26521#issuecomment-1868284434) by community member [@KoichiYasuoka](https://github.com/KoichiYasuoka).
+* Base Model: `unsloth/llama-2-13b-bnb-4bit`
+* LORA Model Adaptation with rank 4 and alpha 32, other adapter configurations can be found in [`adapter_config.json`](https://huggingface.co/SauravMaheshkar/unsloth-llama-2-7b-bnb-4bit-conll2003-rank-4/blob/main/adapter_config.json)
+This model was only trained for a single epoch, however a notebook is made available for those who want to train on other datasets for longer.