qwen repositories

Discover trending repositories tagged qwen, ranked by recent growth and activity.

1. decolua/9router

Unlimited FREE AI coding. Connect Claude Code, Codex, Cursor, Cline, Copilot, Antigravity to FREE Claude/GPT/Gemini via 40+ providers. Auto-fallback, RTK -40% tokens, never hit limits.

GitHub repository with 16,362 stars and 2,458 forks.

Trending score: 5.17; stars gained: +581; forks gained: +85.

Language: JavaScript

Topics: claude-code, cursor, ai-agents, ai-gateway, anthropic, chatgpt
2. ollama/ollama

Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

GitHub repository with 173,212 stars and 16,447 forks.

Trending score: 3.96; stars gained: +222; forks gained: +40.

Language: Go

Topics: deepseek, gemma, gemma3, glm, go, golang
3. vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

GitHub repository with 81,978 stars and 17,668 forks.

Trending score: 3.75; stars gained: +79; forks gained: +46.

Language: Python

Topics: amd, blackwell, cuda, deepseek, deepseek-v3, gpt
4. huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

GitHub repository with 161,297 stars and 33,416 forks.

Trending score: 3.69; stars gained: +78; forks gained: +27.

Language: Python

Topics: nlp, natural-language-processing, pytorch, pytorch-transformers, transformer, model-hub
5. gpustack/gpustack

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

GitHub repository with 5,103 stars and 541 forks.

Trending score: 2.51; stars gained: +11; forks gained: +1.

Language: Python

Topics: ascend, cuda, deepseek, distributed-inference, genai, high-performance-inference
6. Luce-Org/lucebox-hub

Fast LLM speculative inference server for consumer hardware.

GitHub repository with 2,330 stars and 217 forks.

Trending score: 2.31; stars gained: +17; forks gained: +3.

Language: C++

Topics: kernel, llama-cpp, local-ai, nvidia-cuda, qwen, rtx3090
7. dyad-sh/dyad

Local, open-source AI app builder for power users ✨ v0 / Lovable / Replit / Bolt alternative 🌟 Star if you like it!

GitHub repository with 20,518 stars and 2,435 forks.

Trending score: 2.05; stars gained: +8; forks gained: +7.

Language: TypeScript

Topics: ai-app-builder, anthropic, artificial-intelligence, bolt, deepseek, gemini
8. lightseekorg/tokenspeed

TokenSpeed is a speed-of-light LLM inference engine.

GitHub repository with 1,366 stars and 141 forks.

Trending score: 1.86; stars gained: +6; forks gained: +2.

Language: Python

Topics: blackwell, deepseek, gpt-oss, kimi, lightseek, llm
9. tailcallhq/forgecode

AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models

GitHub repository with 7,388 stars and 1,439 forks.

Trending score: 1.76; stars gained: +10; forks gained: +2.

Language: Rust

Topics: ai-pair-programming, ai-workflows, artifical-intelligense, claude-3-7-sonnet, claude-4, claude-4-sonnet
10. diegosouzapw/OmniRoute

Never stop coding. Free AI gateway: one endpoint, 160+ providers (50+ free), connect Claude Code, Codex, Cursor, Cline & Copilot to FREE Claude/GPT/Gemini. RTK+Caveman stacked compression saves 15-95% tokens, smart auto-fallback, MCP/A2A, multimodal APIs, Desktop/PWA.

GitHub repository with 5,742 stars and 993 forks.

Trending score: 1.72; stars gained: +65; forks gained: +15.

Language: TypeScript

Topics: a2a, ai-agents, ai-gateway, anthropic, claude, claude-code
11. sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

GitHub repository with 28,889 stars and 6,343 forks.

Trending score: 1.72; stars gained: -55; forks gained: +18.

Language: Python

Topics: attention, blackwell, cuda, deepseek, diffusion, glm
12. LiangSu8899/FlashRT

FlashRT is a high-performance realtime inference engine for small-batch, latency-sensitive AI workloads. The flagship integration is production VLA control for Pi0, Pi0.5, GROOT N1.6, and Pi0-FAST. Also support llm e.g, qwen3.6-27B

GitHub repository with 281 stars and 32 forks.

Trending score: 1.65; stars gained: +7; forks gained: +0.

Language: C++

Topics: cuda, cuda-kernels, realtime-inference, realtime-vla, gr00t, gr00t-n1-6-3b
13. labring/FastGPT

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

GitHub repository with 28,268 stars and 7,128 forks.

Trending score: 1.55; stars gained: +24; forks gained: +4.

Language: TypeScript

Topics: agent, claude, deepseek, llm, mcp, nextjs
14. lemonade-sdk/lemonade

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk

GitHub repository with 4,212 stars and 331 forks.

Trending score: 1.34; stars gained: +13; forks gained: +1.

Language: C++

Topics: amd, llama, llm, llm-inference, local-server, mistral
15. hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

GitHub repository with 71,891 stars and 8,787 forks.

Trending score: 1.30; stars gained: +22; forks gained: +2.

Language: Python

Topics: fine-tuning, llama, llm, peft, transformers, rlhf
16. WangRongsheng/awesome-LLM-resources

🧑‍🚀 全世界最好的LLM资料总结（多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

GitHub repository with 8,482 stars and 887 forks.

Trending score: 1.12; stars gained: +10; forks gained: +2.

Topics: awesome-list, book, course, large-language-models, llama, llm
17. helixml/helix

♾️ Private Agent Fleet with Spec Coding. Each agent gets their own GPU-accelerated desktop. Run Claude, Codex, Gemini and open models on a full private AI Stack ♾️

GitHub repository with 779 stars and 75 forks.

Trending score: 1.06; stars gained: +2; forks gained: +0.

Language: Go

Topics: golang, llm, openai, self-hosted, api, llm-agent
18. modelstudioai/cli

Official Model Studio CLI（阿里云百炼 CLI）built for AI Agent frameworks, exposing models, search, multimodal, and workflow capabilities as structured tool calls.

GitHub repository with 194 stars and 10 forks.

Trending score: 1.04; stars gained: +10; forks gained: +1.

Language: TypeScript

Topics: agent, aliyun, cli, happyhorse, modelstudio, qwen
19. shyftlabs/continuum

Continuum — the agent runtime by ShyftLabs. Build, orchestrate, ship.

GitHub repository with 69 stars and 6 forks.

Trending score: 0.93; stars gained: +8; forks gained: +0.

Language: Python

Topics: agent-framework, agentic-ai, ai-agents, ai-orchestration, anthropic, enterprise-ai
20. mai-yyy/multi-llm-mcp

一个让 Claude Code 调用 Codex 干活，并可以同时调用多个模型（GPT、Kimi、DeepSeek 等）的 MCP 工具。

GitHub repository with 42 stars and 0 forks.

Trending score: 0.92; stars gained: +8; forks gained: +0.

Language: Python

Topics: claude-code, codex, codex-cli, deepseek, fastmcp, kimi
21. xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

GitHub repository with 9,336 stars and 832 forks.

Trending score: 0.92; stars gained: +6; forks gained: +1.

Language: Python

Topics: ggml, pytorch, chatglm, deployment, flan-t5, llm
22. guoqingbao/xinfer

Blazing-fast LLM inference in pure Rust. No PyTorch and Python runtime.

GitHub repository with 250 stars and 32 forks.

Trending score: 0.81; stars gained: +5; forks gained: +0.

Language: Rust

Topics: agent, llm, qwen, rust, vllm
23. pedrofariasx/qwenproxy

Proxy API OpenAI-compatible que usa automação com Playwright para rotear requisições para modelos do Qwen com suporte a múltiplas contas, tools e sessões persistentes.

GitHub repository with 106 stars and 50 forks.

Trending score: 0.76; stars gained: +5; forks gained: +0.

Language: TypeScript

Topics: docker, hono, llm, openai, playwright, proxy
24. zcweah1981/awesome-hermes-agent-zh

Hermes Agent中文站- 中文实战入口：上手路径、国内落地、OpenClaw 共存迁移、排障参考与可下载方案包。

GitHub repository with 22 stars and 3 forks.

Trending score: 0.75; stars gained: +4; forks gained: +0.

Language: Python

Topics: agent, ai, ai-agent, ai-agents, ai-workflow, chinese-docs