Large Language Models
Rethinking the Role of Efficient Attention in Hybrid Architectures
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction