 matlok
			's Collections
			matlok
			's Collections
			
			
		Non-English Embeddings and Models
		
	updated
			
 
				
				
 - BLOOM: A 176B-Parameter Open-Access Multilingual Language Model- 
			Paper
			 •- 
			2211.05100
			 •
			Published
				
			•- 
				34
			 
 - Contrastive Language-Image Pre-training for the Italian Language- 
			Paper
			 •- 
			2108.08688
			 •
			Published
				
			•- 
				2
			 
 - IT5: Large-scale Text-to-text Pretraining for Italian Language
  Understanding and Generation- 
			Paper
			 •- 
			2203.03759
			 •
			Published
				
			•- 
				5
			 
 - Spanish Pre-trained BERT Model and Evaluation Data- 
			Paper
			 •- 
			2308.02976
			 •
			Published
				
			•- 
				3
			 
 - German FinBERT: A German Pre-trained Language Model- 
			Paper
			 •- 
			2311.08793
			 •
			Published
				
			•- 
				3
			 
 - German Text Embedding Clustering Benchmark- 
			Paper
			 •- 
			2401.02709
			 •
			Published
				
			•- 
				6
			 
 - AfroDigits: A Community-Driven Spoken Digit Dataset for African
  Languages- 
			Paper
			 •- 
			2303.12582
			 •
			Published
				
			•- 
				20
			 
   - SeaLLMs/SeaLLM-7B-v2- 
			Text Generation
			 • 
		
				7B
			• 
	
				Updated
					
				
				•- 
					8.61k
				
	
				 •- 
					68
				 
   - gsarti/it5-base- 
		
	
				Updated
					
				
				• 
					- 
					220
				
	
				 •- 
					24
				 
 
 - Aya Model: An Instruction Finetuned Open-Access Multilingual Language
  Model- 
			Paper
			 •- 
			2402.07827
			 •
			Published
				
			•- 
				48
			 
 - 
			- 
			Viewer
			 • 
	
				Updated
					
				• 
			
			206k
	
				•- 
					2.11k
				
				 •- 
					326
				 
 
   - CohereLabs/c4ai-command-r-v01- 
			Text Generation
			 • 
		
				35B
			• 
	
				Updated
					
				
				•- 
					11.5k
				
	
				 •- 
					1.1k