solved classic rl environments
Nitish Pandey
nitishpandey04
AI & ML interests
LLMs, Translation
Recent Activity
published a model 3 days ago
nitishpandey04/nanochat-d20 upvoted an article 4 months ago
Deriving the PPO Loss from First Principles updated a collection 4 months ago
Classic Reinforcement Learning