林優奈
smoore2024
AI & ML interests
None yet
Recent Activity
upvoted a paper about 12 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards liked a dataset about 15 hours ago
Emmyc2/psp liked a model 1 day ago
zhaohq/PureRL-1.5B-v7-s2-l2-kl-w3-b0Organizations
None yet