Pre-trined models for Matmulfree LM.
Rui-Jie Zhu
ridger
AI & ML interests
None yet
Recent Activity
new activity about 11 hours ago
ByteDance/Ouro-1.4B-Thinking:Fix default RoPE init function reference in OuroRotaryEmbedding upvoted a paper about 1 month ago
How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models upvoted a paper about 1 month ago
Large Language Models Explore by Latent Distilling