Update README.md
Browse files
README.md
CHANGED
|
@@ -1,4 +1,8 @@
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
library_name: transformers
|
| 3 |
tags:
|
| 4 |
- 4-bit
|
|
@@ -6,6 +10,50 @@ tags:
|
|
| 6 |
- text-generation
|
| 7 |
- autotrain_compatible
|
| 8 |
- endpoints_compatible
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
pipeline_tag: text-generation
|
| 10 |
inference: false
|
| 11 |
quantized_by: Suparious
|
|
@@ -15,7 +63,15 @@ quantized_by: Suparious
|
|
| 15 |
- Model creator: [jondurbin](https://huggingface.co/jondurbin)
|
| 16 |
- Original model: [bagel-8b-v1.0](https://huggingface.co/jondurbin/bagel-8b-v1.0)
|
| 17 |
|
|
|
|
| 18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
|
| 20 |
## How to use
|
| 21 |
|
|
|
|
| 1 |
---
|
| 2 |
+
license: other
|
| 3 |
+
license_name: llama3
|
| 4 |
+
license_link: https://huggingface.co/meta-llama/Meta-Llama-3-8B/blob/main/LICENSE
|
| 5 |
+
base_model: meta-llama/Meta-Llama-3-8B
|
| 6 |
library_name: transformers
|
| 7 |
tags:
|
| 8 |
- 4-bit
|
|
|
|
| 10 |
- text-generation
|
| 11 |
- autotrain_compatible
|
| 12 |
- endpoints_compatible
|
| 13 |
+
- llama-3
|
| 14 |
+
- bagel
|
| 15 |
+
datasets:
|
| 16 |
+
- ai2_arc
|
| 17 |
+
- allenai/ultrafeedback_binarized_cleaned
|
| 18 |
+
- argilla/distilabel-intel-orca-dpo-pairs
|
| 19 |
+
- jondurbin/airoboros-3.2
|
| 20 |
+
- codeparrot/apps
|
| 21 |
+
- facebook/belebele
|
| 22 |
+
- bluemoon-fandom-1-1-rp-cleaned
|
| 23 |
+
- boolq
|
| 24 |
+
- camel-ai/biology
|
| 25 |
+
- camel-ai/chemistry
|
| 26 |
+
- camel-ai/math
|
| 27 |
+
- camel-ai/physics
|
| 28 |
+
- jondurbin/contextual-dpo-v0.1
|
| 29 |
+
- jondurbin/gutenberg-dpo-v0.1
|
| 30 |
+
- jondurbin/py-dpo-v0.1
|
| 31 |
+
- jondurbin/truthy-dpo-v0.1
|
| 32 |
+
- LDJnr/Capybara
|
| 33 |
+
- jondurbin/cinematika-v0.1
|
| 34 |
+
- WizardLM/WizardLM_evol_instruct_70k
|
| 35 |
+
- glaiveai/glaive-function-calling-v2
|
| 36 |
+
- jondurbin/gutenberg-dpo-v0.1
|
| 37 |
+
- grimulkan/LimaRP-augmented
|
| 38 |
+
- lmsys/lmsys-chat-1m
|
| 39 |
+
- ParisNeo/lollms_aware_dataset
|
| 40 |
+
- TIGER-Lab/MathInstruct
|
| 41 |
+
- Muennighoff/natural-instructions
|
| 42 |
+
- openbookqa
|
| 43 |
+
- kingbri/PIPPA-shareGPT
|
| 44 |
+
- piqa
|
| 45 |
+
- Vezora/Tested-22k-Python-Alpaca
|
| 46 |
+
- ropes
|
| 47 |
+
- cakiki/rosetta-code
|
| 48 |
+
- Open-Orca/SlimOrca
|
| 49 |
+
- b-mc2/sql-create-context
|
| 50 |
+
- squad_v2
|
| 51 |
+
- mattpscott/airoboros-summarization
|
| 52 |
+
- migtissera/Synthia-v1.3
|
| 53 |
+
- unalignment/toxic-dpo-v0.2
|
| 54 |
+
- WhiteRabbitNeo/WRN-Chapter-1
|
| 55 |
+
- WhiteRabbitNeo/WRN-Chapter-2
|
| 56 |
+
- winogrande
|
| 57 |
pipeline_tag: text-generation
|
| 58 |
inference: false
|
| 59 |
quantized_by: Suparious
|
|
|
|
| 63 |
- Model creator: [jondurbin](https://huggingface.co/jondurbin)
|
| 64 |
- Original model: [bagel-8b-v1.0](https://huggingface.co/jondurbin/bagel-8b-v1.0)
|
| 65 |
|
| 66 |
+

|
| 67 |
|
| 68 |
+
## Model Summary
|
| 69 |
+
|
| 70 |
+
The name of this model is "llama-3-bagel-8b-v1.0" and it was built with llama-3 from Meta.
|
| 71 |
+
|
| 72 |
+
This is a fine-tune of llama-3-8b using the bagel dataset, but instead of 4 prompt formats it's standardized on a single format - llama-3 instruct.
|
| 73 |
+
|
| 74 |
+
See [bagel](https://github.com/jondurbin/bagel) for additional details on the datasets.
|
| 75 |
|
| 76 |
## How to use
|
| 77 |
|