AI & ML interests
SLM/LLM/NMT
Recent Activity
			Organization Card
		
		Tbilisi AI Lab
Open, Georgian-first Generative AI.
We are a non-profit on a mission to build capable, affordable, and open Georgian language models.
Georgian is a low-resource language with ~4M speakers. We believe Georgian speakers deserve their own ChatGPT-like, productivity-boosting AI — built with and for our community.
🔔 What’s new (Oct 2025)
We have open-sourced all our models and datasets — from pretraining corpora to every stage of fine-tuning (instruction/SFT, function-calling, and preference/DPO). Explore everything on our Hugging Face org:
Examples of what’s now public:
- Models
- Datasets (selection)- Instruction (SFT) mix (2.61M pairs): https://huggingface.co/datasets/tbilisi-ai-lab/kona-sft-mix-2.6M
- Function-calling SFT (EN 115k / KA 93k):
- Preference (DPO) mix (387k): https://huggingface.co/datasets/tbilisi-ai-lab/kona-dpo-mix-387k
- Evaluation/Knowledge sets in Georgian (e.g., SuperGLUE-KA, BoolQ-KA, CommonsenseQA-KA, code-instruct-KA, wiki-QA-KA, human-translated EN↔KA, etc.):  - Browse all: https://huggingface.co/tbilisi-ai-lab/datasets
 
 
Why we exist
- Language equity: Great AI shouldn’t be limited to high-resource languages.
- Local impact: Better Georgian NLP improves education, services, accessibility, and economic opportunity.
- Open science: We share models, data, and recipes so others can reproduce and build on our work.
Quickstart (12B)
from transformers import AutoTokenizer, AutoModelForCausalLM, pipeline
model_id = "tbilisi-ai-lab/kona2-12B"
tok = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id, device_map="auto", torch_dtype="auto")
chat = pipeline("text-generation", model=model, tokenizer=tok)
prompt = "გთხოვ, შეაჯამე ეს ტექსტი მოკლედ (ქართული): ქართული ენა უნიკალურია..."
out = chat(prompt, max_new_tokens=256, do_sample=True, temperature=0.7)
print(out[0]["generated_text"])
			datasets
			19
		
			
	
	
	
	
	tbilisi-ai-lab/kona-sft-mix-2.6M
			Viewer
			• 
	
				Updated
					
				• 
			
			2.61M
	
				• 
					
					72
				
				• 
					
					1
				
tbilisi-ai-lab/kona-dpo-mix-387k
			Viewer
			• 
	
				Updated
					
				• 
			
			387k
	
				• 
					
					25
				
				
				
tbilisi-ai-lab/math-instruct-ka
			Viewer
			• 
	
				Updated
					
				• 
			
			32.4k
	
				• 
					
					16
				
				
				
tbilisi-ai-lab/en-ka-human-translated
			Viewer
			• 
	
				Updated
					
				• 
			
			5k
	
				• 
					
					14
				
				
				
tbilisi-ai-lab/learnlm-chat-ka
			Viewer
			• 
	
				Updated
					
				• 
			
			5.86k
	
				• 
					
					15
				
				
				
tbilisi-ai-lab/wiki-ka-QA
			Viewer
			• 
	
				Updated
					
				• 
			
			42.6k
	
				• 
					
					16
				
				
				
tbilisi-ai-lab/code-instruct-ka
			Viewer
			• 
	
				Updated
					
				• 
			
			61.3k
	
				• 
					
					13
				
				
				
tbilisi-ai-lab/ai2_arc-ka
			Viewer
			• 
	
				Updated
					
				• 
			
			1.68k
	
				• 
					
					19
				
				
				
tbilisi-ai-lab/boolq-ka
			Viewer
			• 
	
				Updated
					
				• 
			
			3.27k
	
				• 
					
					14
				
				
				
tbilisi-ai-lab/commonsense_qa-ka
			Viewer
			• 
	
				Updated
					
				• 
			
			1.22k
	
				• 
					
					14
				
				
				
