PaTaRM PaTaRM is a Generative Reward Model (GRM) for RLHF alignment. AIJian/PaTaRM-8B Text Generation • 0.5B • Updated about 1 month ago • 123 AIJian/PaTaRM-data Preview • Updated Apr 1 • 28 AIJian/PaTaRM-14B Text Generation • 0.5B • Updated Apr 1 • 105 AIJian/PaTaRM Updated Apr 1
PaTaRM PaTaRM is a Generative Reward Model (GRM) for RLHF alignment. AIJian/PaTaRM-8B Text Generation • 0.5B • Updated about 1 month ago • 123 AIJian/PaTaRM-data Preview • Updated Apr 1 • 28 AIJian/PaTaRM-14B Text Generation • 0.5B • Updated Apr 1 • 105 AIJian/PaTaRM Updated Apr 1