KhanCold/spader
Reinforcement learning framework for tool-augmented multi-answer question answering with step-wise peer advantage and diversity-aware exploration rewards.
GitHub repository with 33 stars and 3 forks.
Language: Python
Topics: agent, reinforcement-learning