Models and data for the paper: Bradley-Terry Policy Optimization for Generative Preference Modeling
Shengyu Feng
shengyuf
·
AI & ML interests
None yet
Recent Activity
updated a collection 5 days ago
BTPO updated a model 5 days ago
shengyuf/Qwen2.5-3B-math-grm updated a collection 5 days ago
BTPO