avnlp/llm-finetuning

Advanced LLM fine-tuning techniques: SFT (LoRA, QLoRA, DoRA, P-/Prefix-Tuning), GRPO, DPO, ORPO, KTO & PPO; composable correctness/format rewards + LLM-as-a-Judge evals (DeepEval, Evidently AI) across math, multi-hop, medical & general QA on Llama 3, Mistral, Phi-4, Gemma & Qwen3. Built on TRL, PEFT & Unsloth.

GitHub repository with 7 stars and 3 forks.

Language: Python

Topics: dpo, fine-tuning, grpo, kto, lora, orpo, p-tuning, peft, ppo, qlora

Open provider repository

Latest metric snapshot

2026-06-05: 7 stars and 3 forks.

Similar repositories

1. MakazhanAlpamys/Soup

Soup turns the pain of LLM fine-tuning into a simple workflow. One config, one command, done.

GitHub repository with 68 stars and 16 forks.

Trending score: 0.21; stars gained: +0; forks gained: +1.

Language: Python

Topics: artificial-intelligence, cli, dpo, fine-tuning, finetuning, gguf
2. gzhzk/alignsql

Qwen3-8B NL2SQL post-training from SFT to RL

GitHub repository with 5 stars and 0 forks.

Trending score: 0.04; stars gained: +0; forks gained: +0.

Language: Python

Topics: dpo, fine-tuning, llm, lora, nl2sql, qwen

Trending in Python

1. NousResearch/hermes-agent

The agent that grows with you

GitHub repository with 182,734 stars and 31,330 forks.

Trending score: 5.95; stars gained: +1,867; forks gained: +361.

Language: Python

Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
2. Imbad0202/academic-research-skills

Academic Research Skills for Claude Code: research → write → review → revise → finalize

GitHub repository with 27,643 stars and 2,276 forks.

Trending score: 5.52; stars gained: +1,079; forks gained: +89.

Language: Python

Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review
3. rohitg00/ai-engineering-from-scratch

Learn it. Build it. Ship it for others.

GitHub repository with 28,771 stars and 4,705 forks.

Trending score: 5.32; stars gained: +1,261; forks gained: +238.

Language: Python

Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course
4. vinta/awesome-python

An opinionated list of Python frameworks, libraries, tools, and resources

GitHub repository with 301,435 stars and 28,046 forks.

Trending score: 4.60; stars gained: +518; forks gained: +24.

Language: Python

Topics: awesome, collections, python, python-frameworks, python-libraries, python-tools
5. Alishahryar1/free-claude-code

Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

GitHub repository with 32,540 stars and 4,942 forks.

Trending score: 4.56; stars gained: +467; forks gained: +82.

Language: Python
6. langchain-ai/langchain

The agent engineering platform.

GitHub repository with 138,601 stars and 22,962 forks.

Trending score: 4.53; stars gained: +171; forks gained: +31.

Language: Python

Topics: ai, anthropic, gemini, langchain, llm, openai

avnlp/llm-finetuning

Latest metric snapshot

Similar repositories

Trending in Python

Trending topic: dpo