shuo yu

fishsure

fishsure

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

StepPO: Step-Aligned Policy Optimization for Agentic Reinforcement Learning

upvoted a paper 4 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

upvoted a paper 7 months ago

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

View all activity

Organizations

upvoted a paper about 5 hours ago

StepPO: Step-Aligned Policy Optimization for Agentic Reinforcement Learning

Paper • 2604.18401 • Published 11 days ago • 3

upvoted a paper 4 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 5 days ago • 132

upvoted a paper 7 months ago

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Paper • 2511.14460 • Published Nov 18, 2025 • 22

upvoted a paper 9 months ago

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16, 2025 • 92

updated a model about 1 year ago

fishsure/v4

11B • Updated May 12, 2025 • 4

published a model about 1 year ago

fishsure/v4

11B • Updated May 12, 2025 • 4

updated 2 models about 1 year ago

fishsure/v3

11B • Updated May 11, 2025 • 1

fishsure/v1

11B • Updated May 11, 2025 • 2

published a model about 1 year ago

fishsure/v3

11B • Updated May 11, 2025 • 1

updated a model about 1 year ago

fishsure/v2

11B • Updated May 10, 2025 • 1

published 2 models about 1 year ago

fishsure/v2

11B • Updated May 10, 2025 • 1

fishsure/v1

11B • Updated May 11, 2025 • 2

updated a model about 1 year ago

fishsure/internvl3-1b-lora-sft-domain

0.9B • Updated Apr 30, 2025 • 1

published 2 models about 1 year ago

fishsure/internvl3-1b-lora-sft-domain

0.9B • Updated Apr 30, 2025 • 1

fishsure/team-aicrowd-my-model

Updated Apr 18, 2025

updated a model about 1 year ago

fishsure/bge-m3-router

Updated Apr 8, 2025

published 2 models about 1 year ago

fishsure/bge-m3-router

Updated Apr 8, 2025

fishsure/test_model1

Image-Text-to-Text • 11B • Updated Mar 28, 2025

updated a model about 1 year ago

fishsure/test_model1

Image-Text-to-Text • 11B • Updated Mar 28, 2025

updated a dataset almost 2 years ago

fishsure/RM3QA

Updated Sep 3, 2024 • 8

shuo yu

AI & ML interests

Recent Activity

Organizations

fishsure's activity