chrisliu298/awesome-on-policy-distillation

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models

GitHub repository with 320 stars and 7 forks.

Topics: on-policy-distillation, awesome, llm, rl, opd, awesome-list, distillation, gkd, knowledge-distillation, llm-distillation

Open provider repository

24h trend summary

Trending score 1.93, activity score 0.05, stars gained +11, forks gained +1.

Latest metric snapshot

2026-06-13: 320 stars and 7 forks.

Similar repositories

1. chrisliu298/awesome-on-policy-distillation

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models

GitHub repository with 320 stars and 7 forks.

Trending score: 1.93; stars gained: +11; forks gained: +1.

Topics: on-policy-distillation, awesome, llm, rl, opd, awesome-list
2. thunlp/OPD

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

GitHub repository with 542 stars and 28 forks.

Trending score: 0.98; stars gained: +8; forks gained: +1.

Language: Python

Topics: llms, mechanism, on-policy-distillation
3. THU-BPM/RLCSD

Source code of paper "RLCSD: Reinforcement Learning with Contrastive On-Policy Self-Distillation"

GitHub repository with 22 stars and 1 forks.

Trending score: 0.61; stars gained: +3; forks gained: +0.

Language: Python

Topics: large-language-models, llm, on-policy-distillation, opsd, reinforcement-learning, self-distillation

Trending topic: on-policy-distillation

1. chrisliu298/awesome-on-policy-distillation

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models

GitHub repository with 320 stars and 7 forks.

Trending score: 1.93; stars gained: +11; forks gained: +1.

Topics: on-policy-distillation, awesome, llm, rl, opd, awesome-list
2. thunlp/OPD

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

GitHub repository with 542 stars and 28 forks.

Trending score: 0.98; stars gained: +8; forks gained: +1.

Language: Python

Topics: llms, mechanism, on-policy-distillation
3. THU-BPM/RLCSD

Source code of paper "RLCSD: Reinforcement Learning with Contrastive On-Policy Self-Distillation"

GitHub repository with 22 stars and 1 forks.

Trending score: 0.61; stars gained: +3; forks gained: +0.

Language: Python

Topics: large-language-models, llm, on-policy-distillation, opsd, reinforcement-learning, self-distillation