Update README.md
Browse files
README.md
CHANGED
|
@@ -16,7 +16,15 @@ Multilingual Pre-trained Language Model, such as mBERT, XLM-R, provide multiling
|
|
| 16 |
We have seen rapid progress on building multilingual PLMs in recent year.
|
| 17 |
However, there is a lack of contributions on building PLMs on Chines minority languages, which hinders researchers from building powerful NLP systems.
|
| 18 |
|
| 19 |
-
To address the absence of Chinese minority PLMs, Joint Laboratory of HIT and iFLYTEK Research (HFL) proposes CINO (Chinese-miNOrity pre-trained language model), which is built on XLM-R with additional pre-training using Chinese minority corpus, such as
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
|
| 21 |
Please read our GitHub repository for more details (Chinese): https://github.com/ymcui/Chinese-Minority-PLM
|
| 22 |
|
|
|
|
| 16 |
We have seen rapid progress on building multilingual PLMs in recent year.
|
| 17 |
However, there is a lack of contributions on building PLMs on Chines minority languages, which hinders researchers from building powerful NLP systems.
|
| 18 |
|
| 19 |
+
To address the absence of Chinese minority PLMs, Joint Laboratory of HIT and iFLYTEK Research (HFL) proposes CINO (Chinese-miNOrity pre-trained language model), which is built on XLM-R with additional pre-training using Chinese minority corpus, such as
|
| 20 |
+
- Chinese,中文(zh)
|
| 21 |
+
- Tibetan,藏语(bo)
|
| 22 |
+
- Mongolian (Uighur form),蒙语(mn)
|
| 23 |
+
- Uyghur,维吾尔语(ug)
|
| 24 |
+
- Kazakh (Arabic form),哈萨克语(kk)
|
| 25 |
+
- Korean,朝鲜语(ko)
|
| 26 |
+
- Zhuang,壮语
|
| 27 |
+
- Cantonese,粤语(yue)
|
| 28 |
|
| 29 |
Please read our GitHub repository for more details (Chinese): https://github.com/ymcui/Chinese-Minority-PLM
|
| 30 |
|