walkinglabs/hands-on-modern-rl
🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.
GitHub repository with 2,683 stars and 160 forks.
Language: Python
Topics: agent, agentic, agentic-ai, agentic-rl, dpo, grpo, llm, llm-alignment, pytorch, reinforcemen