OpenCulture Collection A multilingual dataset of public domain books and newspapers. • 25 items • Updated Mar 2 • 135
view article Article Releasing Common Corpus: the largest public domain dataset for training LLMs Pclanglais • Mar 20, 2024 • 33
LFM 1.2B SOTA- 400-700 T/S - Enhanced/Fine Tunes // Distills Collection Fine tunes of LFM 1.2B (SOTA) models via Unsloth to enhance performance, including reasoning/thinking distills and "reasoning/thinking" replacements. • 14 items • Updated 8 days ago • 9