modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).
GitHub repository with 14,407 stars and 1,458 forks.
Language: Python
Topics: deepseek-r1, embedding, grpo, internvl, liger, llama, llama4, llm, lora, megatron