Post
912
Glass-Box Agent for Build Small Hackathon.
A tiny ReAct-style agent where the trace is the interface: click a thought, retry a branch, label weak/useful nodes, and export preference pairs for DPO/RL-style training.
Space: build-small-hackathon/glass-box-agent
Demo: included in the Space at assets/glass-box-agent-demo.mp4
Track: An Adventure in Thousand Token Wood
#BuildSmallHackathon #Gradio #SmallModels
A tiny ReAct-style agent where the trace is the interface: click a thought, retry a branch, label weak/useful nodes, and export preference pairs for DPO/RL-style training.
Space: build-small-hackathon/glass-box-agent
Demo: included in the Space at assets/glass-box-agent-demo.mp4
Track: An Adventure in Thousand Token Wood
#BuildSmallHackathon #Gradio #SmallModels