foldl/chatllm.cpp
Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)
GitHub repository with 893 stars and 70 forks.
Language: C++
Topics: llm, llm-inference
Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)
GitHub repository with 893 stars and 70 forks.
Language: C++
Topics: llm, llm-inference
Trending score 0.04, activity score 0.04, stars gained not enough history, forks gained not enough history.
2026-06-05: 893 stars and 70 forks.
Community maintained hardware plugin for vLLM on Ascend
GitHub repository with 2,196 stars and 1,345 forks.
Trending score: 3.25; stars gained: +16; forks gained: +22.
Language: C++
Topics: ascend, inference, llm, llm-serving, llmops, mlops
:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.
GitHub repository with 1,494 stars and 480 forks.
Trending score: 1.82; stars gained: +7; forks gained: +5.
Language: C++
Topics: accelerator, ai, cuda, deepseek, gpu, img-gen
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
GitHub repository with 5,512 stars and 828 forks.
Trending score: 1.44; stars gained: +14; forks gained: +13.
Language: C++
Topics: disaggregation, inference, kvcache, llm, rdma, reinforcement-learning
Low-latency AI engine for mobile devices & wearables
GitHub repository with 5,295 stars and 420 forks.
Trending score: 1.35; stars gained: +25; forks gained: +2.
Language: C++
Topics: android, framework, ios, llamacpp, llm, llm-inference
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
GitHub repository with 4,212 stars and 331 forks.
Trending score: 1.34; stars gained: +13; forks gained: +1.
Language: C++
Topics: amd, llama, llm, llm-inference, local-server, mistral
Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
GitHub repository with 1,429 stars and 98 forks.
Trending score: 1.04; stars gained: +1; forks gained: +0.
Language: C++
Topics: amd, deepseek, llama, llm, npu
LLM inference in C/C++
GitHub repository with 114,657 stars and 19,185 forks.
Trending score: 4.40; stars gained: +304; forks gained: +99.
Language: C++
Topics: ggml
DuckDB is an analytical in-process SQL database management system
GitHub repository with 38,615 stars and 3,298 forks.
Trending score: 3.50; stars gained: +40; forks gained: +6.
Language: C++
Topics: sql, database, olap, analytics, embedded-database
Community maintained hardware plugin for vLLM on Ascend
GitHub repository with 2,196 stars and 1,345 forks.
Trending score: 3.25; stars gained: +16; forks gained: +22.
Language: C++
Topics: ascend, inference, llm, llm-serving, llmops, mlops
:electron: Build cross-platform desktop apps with JavaScript, HTML, and CSS
GitHub repository with 121,541 stars and 17,235 forks.
Trending score: 3.02; stars gained: +16; forks gained: +2.
Language: C++
Topics: c-plus-plus, chrome, css, electron, html, javascript
ClickHouse® is a real-time analytics database management system
GitHub repository with 47,822 stars and 8,466 forks.
Trending score: 2.96; stars gained: +53; forks gained: +10.
Language: C++
Topics: ai, analytics, big-data, clickhouse, cloud-native, cpp
Truly independent web browser
GitHub repository with 63,751 stars and 3,075 forks.
Trending score: 2.89; stars gained: +52; forks gained: +5.
Language: C++
Topics: browser, browser-engine
The agent that grows with you
GitHub repository with 181,322 stars and 31,112 forks.
Trending score: 5.95; stars gained: +1,867; forks gained: +361.
Language: Python
Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
GitHub repository with 207,323 stars and 31,829 forks.
Trending score: 5.86; stars gained: +3,345; forks gained: +536.
Language: JavaScript
Topics: ai-agents, anthropic, claude, claude-code, developer-tools, llm
DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.
GitHub repository with 18,145 stars and 1,078 forks.
Trending score: 5.71; stars gained: +1,388; forks gained: +87.
Language: Go
Topics: agent, agent-framework, ai-agent, ai-coding, cli, coding-agent
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
GitHub repository with 12,942 stars and 833 forks.
Trending score: 5.69; stars gained: +2,829; forks gained: +175.
Language: Python
Topics: agent, ai, anthropic, claude-code, compression, context-engineering
Unlimited FREE AI coding. Connect Claude Code, Codex, Cursor, Cline, Copilot, Antigravity to FREE Claude/GPT/Gemini via 40+ providers. Auto-fallback, RTK -40% tokens, never hit limits.
GitHub repository with 16,336 stars and 2,455 forks.
Trending score: 5.17; stars gained: +581; forks gained: +85.
Language: JavaScript
Topics: claude-code, cursor, ai-agents, ai-gateway, anthropic, chatgpt
⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more
GitHub repository with 10,593 stars and 885 forks.
Trending score: 4.82; stars gained: +560; forks gained: +62.
Language: TypeScript
Topics: ai-agent, ai-coding-agent, anthropic, bun, claude, cli