Open-Models
updated
Text Generation
• 120B • Updated • 4.67M
• • 4.83k
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Paper
• 2512.20605
• Published • 63
Nested Browser-Use Learning for Agentic Information Seeking
Paper
• 2512.23647
• Published • 19
TimeBill: Time-Budgeted Inference for Large Language Models
Paper
• 2512.21859
• Published • 25
ResembleAI/chatterbox-turbo
Text-to-Speech
• Updated • • 650
mHC: Manifold-Constrained Hyper-Connections
Paper
• 2512.24880
• Published • 328
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models
Paper
• 2512.15560
• Published • 25
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone
Paper
• 2512.22615
• Published • 51
Text-to-3D
• Updated • 459
• 410
Image-to-Video
• Updated • 585k
• • 1.73k
LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR
Paper
• 2601.14251
• Published • 29
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
Paper
• 2601.22153
• Published • 75
tencent/Youtu-VL-4B-Instruct
Image-Text-to-Text
• 5B • Updated • 541
• 156
Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation
Paper
• 2601.21406
• Published • 6
Reinforcement Learning via Self-Distillation
Paper
• 2601.20802
• Published • 47
DeepSeek-OCR 2: Visual Causal Flow
Paper
• 2601.20552
• Published • 70
Image-Text-to-Text
• 1B • Updated • 5.1M
• • 1.79k
unsloth/Qwen3-Coder-Next-FP8-Dynamic
Text Generation
• 80B • Updated • 8.65k
• 42
Text Generation
• 80B • Updated • 889k
• • 1.41k
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
Paper
• 2602.12099
• Published • 62
Image-to-Video
• Updated • 2.28M
• 1.29k
Updated • 97
• 157
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper
• 2603.13398
• Published • 155
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF
Image-Text-to-Text
• 4B • Updated • 18.7k
• 126
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
Paper
• 2603.20278
• Published • 98
YTan2000/Qwen3.5-27B-TQ3_1S
Image-Text-to-Text
• 27B • Updated • 246
• 37
bartowski/arcee-ai_Trinity-Large-Thinking-GGUF
Text Generation
• 399B • Updated • 664
• 11
Text Generation
• 8B • Updated • 2.17k
• 179
mudler/Qwen3.5-35B-A3B-APEX-GGUF
Text Generation
• 35B • Updated • 34.8k
• 93
Jackrong/Qwopus3.5-27B-v3
Image-Text-to-Text
• 27B • Updated • 1.1k
• 240
Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding
Paper
• 2604.00528
• Published • 12
Text Generation
• 21B • Updated • 1.22k
• 96
Image-Text-to-Text
• 5B • Updated • 1.71M
• 363
lightonai/LightOnOCR-2-1B
Image-Text-to-Text
• 1B • Updated • 353k
• 688
selimaktas/MiniMax-M2.75-460B-A20B
Text Generation
• 453B • Updated • 93
• 26
Image-Text-to-Text
• 28B • Updated • 5.06M
• • 1.54k
310B • Updated • 89.1k
• 730
Token Classification
• 1B • Updated • 306k
• 1.57k
concavity-ai/superlinear-exp-v0.1
Text Generation
• 32B • Updated • 29
• 22
openbmb/InfLLM-V2-Long-Sparse-Base
8B • Updated • 19
• 6
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation
• 685B • Updated • 191k
• • 993
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention
Paper
• 2603.28458
• Published • 44
oongaboongahacker/Gemini-Nano
Updated • 40
Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings
Paper
• 2605.22391
• Published • 36
Feature Extraction
• Updated • 564
• 26