SAnocha commited on
Commit
80ecc85
·
verified ·
1 Parent(s): 4bea3c6

Create README

Browse files
Files changed (1) hide show
  1. README.md +120 -0
README.md ADDED
@@ -0,0 +1,120 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ pipeline_tag: text-generation
4
+ base_model:
5
+ - aisingapore/Llama-SEA-LION-v3-70B-IT
6
+ language:
7
+ - en
8
+ - zh
9
+ - vi
10
+ - id
11
+ - th
12
+ - fil
13
+ - ta
14
+ - ms
15
+ - km
16
+ - lo
17
+ - my
18
+ - jv
19
+ - su
20
+ license: llama3.1
21
+ ---
22
+
23
+ <div>
24
+ <img src="llama_sea_lion_3.5_70b_r_banner.png"/>
25
+ </div>
26
+
27
+ # Llama-SEA-LION-v3.5-70B-R-FP8-Dynamic
28
+
29
+ Last updated: 2025-09-01
30
+
31
+ [**SEA-LION**](https://arxiv.org/abs/2504.05747) is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned
32
+ for the Southeast Asia (SEA) region.
33
+
34
+
35
+
36
+ ### Model Description
37
+
38
+ <!-- Provide a longer summary of what this model is. -->
39
+
40
+ SEA-LION stands for *Southeast Asian Languages In One Network*.
41
+
42
+ Quantization was performed on Llama-SEA-LION-v3.5-70B-R to produce optimized variants that reduce memory requirements
43
+ while maintaining model quality. These quantized models support inference on a range of consumer-grade GPUs
44
+ and are compatible with various inference engines.
45
+
46
+
47
+ For tokenization, the model employs the default tokenizer used in Llama 3.1-70B-Instruct.
48
+
49
+
50
+ - **Developed by:** Products Pillar, AI Singapore
51
+ - **Funded by:** Singapore NRF
52
+ - **Model type:** Decoder
53
+ - **Context length:** 128k tokens
54
+ - **Language(s):** Burmese, Chinese, English, Filipino, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tamil, Thai, Vietnamese
55
+ - **License:** [Llama 3.1 Community License](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct/blob/main/LICENSE)
56
+ - **Quantized from model:** Llama-SEA-LION-v3.5-70B-R
57
+
58
+ This repo contains GPTQ format models files for aisingapore/Llama-SEA-LION-v3.5-70B-R
59
+
60
+ Model Weights included in this repository:
61
+ - [Llama-SEA-LION-v3.5-70B-R-FP8-Dynamic](https://huggingface.co/aisingapore/Llama-SEA-LION-v3.5-70B-R-FP8-Dynamic)
62
+
63
+
64
+ ## Evaluation
65
+
66
+ <!-- This section describes the evaluation protocols and provides the results. -->
67
+
68
+ ### Performance Test Results
69
+
70
+ For details on Llama-SEA-LION-v3.5-70B-R performance, please refer to the SEA-HELM leaderboard, [Leaderboard results on SEA-HELM](https://leaderboard.sea-lion.ai/).
71
+
72
+
73
+
74
+ ### Out-of-Scope Use
75
+
76
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
77
+
78
+ The model has not been aligned for safety. Developers and users should perform their own safety
79
+ fine-tuning and related security measures. In no event shall the authors be held liable for any claims, damages, or other liabilities arising from the use of the released weights and codes.
80
+
81
+
82
+ ## Bias, Risks, and Limitations
83
+
84
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
85
+
86
+ *The model was not tested for robustness against adversarial prompting.* It is important for users to be aware that our model exhibits certain limitations that warrant consideration.
87
+ Like many LLMs, the model can hallucinate and occasionally generates irrelevant content,
88
+ introducing fictional elements that are not grounded in the provided context.
89
+ Users should also exercise caution in interpreting and validating the model's responses
90
+ due to the potential inconsistencies.
91
+
92
+
93
+
94
+ ## More Information
95
+
96
+ This is the repository for the commercial instruction-tuned model.
97
+ The model has not been aligned for safety. Developers and users should perform their own safety
98
+ fine-tuning and related security measures. In no event shall the authors be held liable
99
+ for any claims, damages, or other liabilities arising from the use of the released weights and codes.
100
+
101
+ AI Singapore is a national programme supported by the National Research Foundation, Singapore and hosted by the National University of Singapore.
102
+ Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and
103
+ do not reflect the views of the National Research Foundation or the National University of Singapore.
104
+
105
+ [Link to SEA-LION's GitHub repository](https://github.com/aisingapore/sealion)
106
+
107
+ For more info, please contact us at [email protected]
108
+
109
+
110
+ ## Team
111
+
112
+ Antonyrex Sajeban, Chan Adwin, Cheng Nicholas, Choa Esther, Huang Yuli, Hulagadri Adithya Venkatadri, Lau Wayne, Lee Chwan Ren, Leong Wai Yi, Leong Wei Qi, Liew Rachel, Limkonchotiwat Peerat, Liu Bing Jie Darius,
113
+ Montalan Jann Railey, Ng Boon Cheong Raymond, Ngui Jian Gang, Nguyen Thanh Ngan, Ong Brandon, Ong Tat-Wee David,
114
+ Ong Zhi Hao, Rengarajan Hamsawardhini, Siow Bryan, Susanto Yosephine, Tai Ngee Chia, Tan Choon Meng, Teng Walter,
115
+ Teo Eng Sipp Leslie, Teo Wei Yi, Tjhi William, Yeo Yeow Tong, Yong Xianbin
116
+
117
+
118
+ ## Contact
119
+
120