Tencent/AngelSlim
Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
GitHub repository with 1,310 stars and 151 forks.
Language: Python
Topics: llm, llm-compression, quantization, speculative-decoding, diffusion, vlm, hunyuan, deepseek, qwen, fp4