yingyingxia666/awesome-agentic
A curated reading list of large-language-model RL papers, organized by four research directions: Reasoning RL, Agentic RL, OPD (Off-Policy / On-Policy Distillation / Drift), and Multi-Agent*
GitHub repository with 16 stars and 2 forks.