21 19

林思雨

winsign

AI & ML interests

None yet

Recent Activity

liked a model about 20 hours ago

MafeLeon9/ppo-Huggy

upvoted a paper about 22 hours ago

Unsupervised Process Reward Models

upvoted a paper 1 day ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

View all activity

Organizations

None yet

liked a model about 20 hours ago

MafeLeon9/ppo-Huggy

Reinforcement Learning • Updated about 20 hours ago • 1

upvoted a paper about 22 hours ago

Unsupervised Process Reward Models

Paper • 2605.10158 • Published 13 days ago • 23

upvoted a paper 1 day ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 4 days ago • 190

upvoted a paper 2 days ago

A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook

Paper • 2605.20266 • Published 6 days ago • 56

liked a dataset 2 days ago

nodogoro/cell2_20260521_hossam_coffee_shop_setting20260521_223750

Viewer • Updated 2 days ago • 2.48k • 21 • 1

liked a dataset 3 days ago

tiiuae/falcon-refinedweb

Viewer • Updated Jun 20, 2023 • 968M • 21.8k • 914

liked a dataset 6 days ago

pythonformer/Trajectory-Stitching-Test-Small

Viewer • Updated 6 days ago • 128k • 34 • 1

liked a dataset 10 days ago

m-a-p/COIG-CQIA

Viewer • Updated Apr 18, 2024 • 44.7k • 10.6k • 728

liked a model 10 days ago

jackxinning/Leanly_AI

Question Answering • 15B • Updated 20 days ago • 6.21k • 120

liked a dataset 13 days ago

Gibrail765/Nexus_Ulaweng

Viewer • Updated about 1 hour ago • 1 • 1.37k • 2

liked a dataset 17 days ago

uonlp/CulturaX

Viewer • Updated Dec 16, 2024 • 7.18B • 34.9k • 624

upvoted a paper 17 days ago

Leveraging Verifier-Based Reinforcement Learning in Image Editing

Paper • 2604.27505 • Published 24 days ago • 57

liked a model 23 days ago

chloeli/qwen-3-14b-rules-spec-msm

Updated 23 days ago • 23 • 1

upvoted a paper 23 days ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 24 days ago • 218

liked a dataset about 1 month ago

b00l26/VietPET-RoI

Updated about 1 month ago • 23 • 1

liked a model about 1 month ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 858 • 906

liked a dataset about 1 month ago

DCAgent2/swebench_verified_random_100_folders_nemotron_terminal_software_engineering__Qw16972da0

Viewer • Updated Apr 14 • 300 • 8

upvoted a paper about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 246

liked a model about 1 month ago

snoovn20267/dFHJnYsfeX2RH3vz

Updated 26 days ago • 1

upvoted a paper about 1 month ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 235

林思雨

AI & ML interests

Recent Activity

Organizations

winsign's activity