2 8 3

Dong

Yi72

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

upvoted a paper 2 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

liked a dataset 3 months ago

nvidia/Nemotron-Research-GooseReason-0.7M

View all activity

Organizations

upvoted a paper about 5 hours ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 2 days ago • 71

upvoted a paper 2 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 3 days ago • 114

liked a dataset 3 months ago

nvidia/Nemotron-Research-GooseReason-0.7M

Viewer • Updated Mar 1 • 673k • 258 • 30

published a dataset 3 months ago

nvidia/Nemotron-Research-GooseReason-0.7M

Viewer • Updated Mar 1 • 673k • 258 • 30

published a model 3 months ago

nvidia/Nemotron-Research-GooseReason-4B-Instruct

Text Generation • 4B • Updated Mar 1 • 67 • • 8

New activity in nvidia/Nemotron-Research-GooseReason-4B-Instruct 3 months ago

Update README.md

#2 opened 3 months ago by

Ximing

New activity in nvidia/Nemotron-Research-GooseReason-0.7M 3 months ago

Update README.md

#3 opened 3 months ago by

Ximing

updated a dataset 3 months ago

nvidia/Nemotron-Research-GooseReason-0.7M

Viewer • Updated Mar 1 • 673k • 258 • 30

updated a model 3 months ago

nvidia/Nemotron-Research-GooseReason-4B-Instruct

Text Generation • 4B • Updated Mar 1 • 67 • • 8

upvoted 3 papers 4 months ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published Feb 11 • 55

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published Jan 14 • 55

upvoted a paper 6 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 128

published an article 7 months ago

Article

Can Your LLM Think Like a Professional? Introducing ProfBench

nvidia

•

Oct 28, 2025

• 21

upvoted a paper 8 months ago

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1, 2025 • 20

liked a model 12 months ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Text Generation • 2B • Updated Nov 21, 2025 • 2.42k • • 241

upvoted a paper 12 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 146

liked a dataset over 2 years ago

nvidia/HelpSteer

Viewer • Updated Dec 18, 2024 • 37.1k • 2.52k • 248

Dong

AI & ML interests

Recent Activity

Organizations

Yi72's activity

Update README.md

Update README.md

Can Your LLM Think Like a Professional? Introducing ProfBench