PhotoBench Family Collection Current and future releases for paper PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval • 4 items • Updated 3 days ago
Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization Paper • 2604.09574 • Published Feb 24 • 30
StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models Paper • 2509.22558 • Published Sep 26, 2025 • 4
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published 18 days ago • 50
StepORLM: A Self-Evolving Framework With Generative Process Supervision For Operations Research Language Models Paper • 2509.22558 • Published Sep 26, 2025 • 4
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Paper • 2604.08224 • Published 18 days ago • 50
PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval Paper • 2603.01493 • Published Mar 2 • 20