File size: 845 Bytes
026e590
 
 
 
 
 
 
 
 
f15c9f7
 
58e5714
f15c9f7
 
 
 
 
51faba2
f15c9f7
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
title: README
emoji: πŸ“Š
colorFrom: purple
colorTo: indigo
sdk: static
pinned: false
---

Welcome to the **MiniLingua-AI** Hub

This organization is a space curated by [Anna Aksenova](https://www.linkedin.com/in/annaaksenova/) and [Boris Zverkov](https://www.linkedin.com/in/boriszverkov/) as part of their thesis project focused on the development of a multilingual small language model.

πŸ“š **Project Highlights**:
- Development of a 1B parameter multilingual LLM
- Custom tokenizer supporting 13 languages and code
- Instruction fine-tuning and multilingual evaluation
- [Repo with training and data cleaning configs](https://github.com/MiniLingua-ai/training_artifacts)

This hub includes both model checkpoints and datasets related to **MiniLingua**, aiming to support research in inclusive language modeling for European languages.