Toward Autonomous and Faithful Claim Verification via Online Reinforcement Learning
H0key
H0key
·
AI & ML interests
None yet
Organizations
models 10
H0key/Veri-R1_Llama3.2-3B-Instruct-OfflineRL
4B • Updated • 1
H0key/Veri-R1_Llama3.2-3B-Instruct-OnlineRL
4B • Updated • 1
H0key/Veri-R1_Qwen-3B-Instruct-OfflineRL
3B • Updated • 2
H0key/Veri-R1_Qwen2.5-3B-Instruct-OnlineRL
3B • Updated • 2
H0key/qwen2.5-3b-max1step30max3
3B • Updated • 1
H0key/qwen2.5-3b-correctmax1
3B • Updated • 1
H0key/qwen2.5-1.5b-ins
2B • Updated • 1
H0key/qwen2.5-1.5b-4kdata
Updated
H0key/qwen2.5-3b-nolength
3B • Updated • 1
H0key/qwen2.5-3b-ins
3B • Updated • 1