Taiwei Shi
MaksimSTW
AI & ML interests
reinforcement learning, alignment, human-AI collaboration, and computational social science
Recent Activity
liked a dataset about 9 hours ago
lime-nlp/OS-Blind upvoted a paper about 10 hours ago
The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents authored a paper 24 days ago
Video-Based Reward Modeling for Computer-Use Agents