Ming Yang
Puitar619
AI & ML interests
RAG, Post Training
Recent Activity
upvoted a paper about 2 months ago
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off published a dataset 3 months ago
Puitar619/DeepSlide-Domain-PaperOrganizations
None yet