Анна Соколова (Anna Sokolova)'s picture

Анна Соколова (Anna Sokolova)

anna-96

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

liked a dataset 6 days ago

liked a dataset 7 days ago

JudSacr/squad_v2_french_translated

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 16 days ago • 192

upvoted 2 papers 10 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published 15 days ago • 59

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published 15 days ago • 218

upvoted a paper 13 days ago

EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents

Paper • 2605.13941 • Published 15 days ago • 24

upvoted a paper 14 days ago

Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training

Paper • 2605.09608 • Published 18 days ago • 52

upvoted a paper 21 days ago

Lightning Unified Video Editing via In-Context Sparse Attention

Paper • 2605.04569 • Published 22 days ago • 18

upvoted a paper 27 days ago

Video Analysis and Generation via a Semantic Progress Function

Paper • 2604.22554 • Published Apr 24 • 63

upvoted 4 papers about 2 months ago

CylinderDepth: Cylindrical Spatial Attention for Multi-View Consistent Self-Supervised Surround Depth Estimation

Paper • 2511.16428 • Published Apr 8 • 2

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 504

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Paper • 2604.05091 • Published Apr 6 • 47

upvoted 2 papers 2 months ago

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Paper • 2603.19235 • Published Mar 19 • 95

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 311

upvoted 7 papers 3 months ago

DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

Paper • 2603.04743 • Published Mar 5 • 53

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 266

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 524

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 245

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 210