15 16

Suzuki Haruki

wildbow17

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

ksolovev/FineNews

upvoted a paper 3 days ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

liked a model 3 days ago

tencent/Hy-MT2-1.8B

View all activity

Organizations

None yet

liked a dataset 1 day ago

ksolovev/FineNews

Updated Mar 23 • 1.58M • 5

upvoted a paper 3 days ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published 10 days ago • 33

liked a model 3 days ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated 3 days ago • 4.53k • • 585

upvoted a paper 3 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published 11 days ago • 142

liked a dataset 3 days ago

yahma/alpaca-cleaned

Viewer • Updated Apr 10, 2023 • 51.8k • 31.8k • 827

liked a dataset 6 days ago

PeakStars/Math-Instruct

Viewer • Updated Apr 20 • 30 • 392k • 3

liked a model 10 days ago

turtle170/NetTinyANN

Updated 8 minutes ago • 3

liked a dataset 13 days ago

OpenAssistant/oasst1

Viewer • Updated May 2, 2023 • 88.8k • 24.4k • 1.52k

upvoted a paper 18 days ago

ViPO: Visual Preference Optimization at Scale

Paper • 2604.24953 • Published 26 days ago • 3

liked a model 24 days ago

jackf857/qwen3-8b-base-new-dpo-hh-helpful-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-1

Text Generation • 8B • Updated 24 days ago • 17 • 1

liked a model about 1 month ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 843 • 906

liked a dataset about 1 month ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18, 2025 • 450k • 39.6k • 751

liked a model about 1 month ago

Jackrong/Qwopus3.5-0.8B-v3

Image-Text-to-Text • 0.9B • Updated Apr 12 • 268 • 4

upvoted a paper about 1 month ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

liked a model about 2 months ago

Bane343434353567789/Geminni

Updated Apr 8 • 1

upvoted 3 papers about 2 months ago

VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification

Paper • 2604.01569 • Published Apr 2 • 13

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

All Roads Lead to Rome: Incentivizing Divergent Thinking in Vision-Language Models

Paper • 2604.00479 • Published Apr 1 • 69

liked a model about 2 months ago

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 710k • • 12.9k

upvoted a paper about 2 months ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

Suzuki Haruki

AI & ML interests

Recent Activity

Organizations

wildbow17's activity