thc1006/qwen3.6-speculative-decoding-rtx3090

First public benchmark of llama.cpp speculative decoding on Qwen3.6-35B-A3B with a single RTX 3090 (post PR #19493 merge, 2026-04-19). 19 configurations covering ngram-cache, ngram-mod, and classic draft with vocab-matched Qwen3.5-0.8B. Finding: no variant achieves net speedup on Ampere + A3B MoE. Raw JSON, plots, full reproducibility.

GitHub repository with 28 stars and 1 forks.

Language: Python

Topics: ampere, benchmark, cuda, ggml, inference-benchmark, llama-cpp, local-llm, mixture-of-experts, moe, qwen

Open provider repository

Latest metric snapshot

2026-06-05: 28 stars and 1 forks.

Trending in Python

1. chopratejas/headroom

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

GitHub repository with 27,902 stars and 1,891 forks.

Trending score: 6.49; stars gained: +2,776; forks gained: +250.

Language: Python

Topics: agent, ai, anthropic, claude-code, compression, context-engineering
2. harry0703/MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

GitHub repository with 88,027 stars and 12,625 forks.

Trending score: 6.02; stars gained: +1,097; forks gained: +218.

Language: Python

Topics: ai, automation, chatgpt, moviepy, python, shortvideo
3. pewdiepie-archdaemon/odysseus

Self-hosted AI workspace.

GitHub repository with 71,300 stars and 9,086 forks.

Trending score: 5.98; stars gained: +834; forks gained: +140.

Language: Python
4. NousResearch/hermes-agent

The agent that grows with you

GitHub repository with 193,883 stars and 33,934 forks.

Trending score: 5.92; stars gained: +753; forks gained: +209.

Language: Python

Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
5. NVIDIA/SkillSpector

Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks.

GitHub repository with 5,654 stars and 427 forks.

Trending score: 5.61; stars gained: +874; forks gained: +76.

Language: Python
6. rohitg00/ai-engineering-from-scratch

Learn it. Build it. Ship it for others.

GitHub repository with 32,527 stars and 5,342 forks.

Trending score: 5.59; stars gained: +762; forks gained: +135.

Language: Python

Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

thc1006/qwen3.6-speculative-decoding-rtx3090

Latest metric snapshot

Trending in Python

1. chopratejas/headroom

2. harry0703/MoneyPrinterTurbo

3. pewdiepie-archdaemon/odysseus

4. NousResearch/hermes-agent

5. NVIDIA/SkillSpector

6. rohitg00/ai-engineering-from-scratch