jasonrichdarmawan/nllb-primary-datasets-public-data-embedding Viewer • Updated Sep 24, 2025 • 10.7M • 68 • 1
ontocord/MixtureVitae-fineweb-permissive-multilingual-2m Viewer • Updated Apr 14, 2025 • 2.23M • 19 • 2
Running 29 GlotLID (Language Identification) 🕵 29 Identify the language of a sentence with confidence scores