Aletheia-ng/pidgin-corpus-synth
Viewer
•
Updated
•
1.69k
•
3
Aletheia-ng/nigerian-pidgin-corpus-synth
Aletheia-ng/pretrain_data10
Viewer
•
Updated
•
40.9M
•
43
Aletheia-ng/low_resource_languages_pretrain_data4
Viewer
•
Updated
•
469M
•
914
Aletheia-ng/pretrain_data11
Aletheia-ng/pretrain_data9
Viewer
•
Updated
•
79.1M
•
63
Aletheia-ng/pretrain_data5
Viewer
•
Updated
•
9.43M
•
239
Aletheia-ng/pretrain_data4
Viewer
•
Updated
•
124M
•
645
Aletheia-ng/pretrain_data7
Viewer
•
Updated
•
13M
•
49
Aletheia-ng/pretrain_data3
Viewer
•
Updated
•
143M
•
663
Viewer
•
Updated
•
136
•
56
Aletheia-ng/pretrain_data
Viewer
•
Updated
•
109M
•
417
Aletheia-ng/pretrain_data2
Viewer
•
Updated
•
18.2M
•
294
Aletheia-ng/low_resource_languages_pretrain
Viewer
•
Updated
•
202M
•
970
•
1
Aletheia-ng/masakhaner_eval
Aletheia-ng/noisy_dataset
Viewer
•
Updated
•
84k
•
83
Viewer
•
Updated
•
84k
•
81
Aletheia-ng/personal_finance_v0.2
Viewer
•
Updated
•
56.6k
•
41
•
1
Aletheia-ng/bloomberg-news-articles-pretraining-dataset
Viewer
•
Updated
•
437k
•
89
•
5
Aletheia-ng/ChatML-aya_dataset
Viewer
•
Updated
•
202k
•
21
Aletheia-ng/yo_wiki_processed
Viewer
•
Updated
•
43.5k
•
20
Viewer
•
Updated
•
270k
•
32
Viewer
•
Updated
•
4.4k
•
18
Viewer
•
Updated
•
43.5k
•
18
Viewer
•
Updated
•
288
•
33
Viewer
•
Updated
•
1.01k
•
116
Viewer
•
Updated
•
3.67k
•
213