Sleeping Agents Template Final Assignment 🕵 Run evaluation agent to answer and submit benchmark questions