iBz-04/quaynor
AI inference library for mobile devices
GitHub repository with 30 stars and 0 forks.
Language: Rust
Topics: inference-engine, llm-inference, local-ai, ollama, llamacpp, rust, python-ai, flutter, gguf, react-native
AI inference library for mobile devices
GitHub repository with 30 stars and 0 forks.
Language: Rust
Topics: inference-engine, llm-inference, local-ai, ollama, llamacpp, rust, python-ai, flutter, gguf, react-native
2026-06-06: 30 stars and 0 forks.
NobodyWho is an inference engine that lets you run LLMs locally and efficiently on any device.
GitHub repository with 998 stars and 69 forks.
Trending score: 2.56; stars gained: +26; forks gained: +1.
Language: Rust
Topics: ai, flutter, godot, godot-engine, godot-plugin, godot4
258 KB WASM runtime for Needle a 26M-parameter tool-calling transformer. Runs in browser, Cloudflare Workers, and Node.js. No backend required.
GitHub repository with 40 stars and 5 forks.
Trending score: 0.64; stars gained: +1; forks gained: +0.
Language: Rust
Topics: agent, ai, browser-ai, cloudflare-workers, edge-ai, embedded-ai
Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
GitHub repository with 10 stars and 0 forks.
Trending score: 0.08; stars gained: +0; forks gained: +0.
Language: Rust
Topics: ai-interface, generative-ai, inference-engine, llm, local-llm, macos
A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io
GitHub repository with 101,399 stars and 6,706 forks.
Trending score: 5.98; stars gained: +1,282; forks gained: +106.
Language: Rust
Topics: ai-tools, claude-code, codex, desktop-app, hermes, hermes-agent
An enhanced tool for CodexApp, striving to make Codex better to use and more comfortable 一个CodexApp的增强工具,努力让Codex变得更好用更舒服
GitHub repository with 18,792 stars and 1,190 forks.
Trending score: 5.44; stars gained: +649; forks gained: +58.
Language: Rust
CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies
GitHub repository with 62,443 stars and 3,866 forks.
Trending score: 5.37; stars gained: +689; forks gained: +65.
Language: Rust
Topics: agentic-coding, ai-coding, anthropic, claude-code, cli, command-line-tool
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
GitHub repository with 49,459 stars and 5,223 forks.
Trending score: 4.98; stars gained: +255; forks gained: +31.
Language: Rust
Topics: acp, ai, ai-agents, mcp
π RuView turns commodity WiFi signals into real-time spatial intelligence, vital sign monitoring, and presence detection — all without a single pixel of video.
GitHub repository with 73,948 stars and 9,874 forks.
Trending score: 4.83; stars gained: +185; forks gained: +32.
Language: Rust
Topics: awesome, claude, densepose, esp32, firmware, home-assistant
Your Personal AI super intelligence. Private, Simple and extremely powerful.
GitHub repository with 32,197 stars and 3,120 forks.
Trending score: 4.79; stars gained: +324; forks gained: +36.
Language: Rust
NobodyWho is an inference engine that lets you run LLMs locally and efficiently on any device.
GitHub repository with 998 stars and 69 forks.
Trending score: 2.56; stars gained: +26; forks gained: +1.
Language: Rust
Topics: ai, flutter, godot, godot-engine, godot-plugin, godot4
From teacher to tiles — a from-scratch LLM distillation & serving engine: custom Triton/CUDA kernels, FSDP distillation, paged-KV continuous batching, speculative decoding, a Rust gateway, a JAX oracle, and interpretability tooling.
GitHub repository with 181 stars and 1 forks.
Trending score: 2.19; stars gained: +11; forks gained: +0.
Language: Python
Topics: cuda, flash-attention, fsdp, inference-engine, jax, knowledge-distillation
Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
GitHub repository with 1,122 stars and 197 forks.
Trending score: 1.52; stars gained: +1; forks gained: +1.
Language: Python
Topics: deeplearning, demos, inference, inference-api, inference-engine, machine-learning
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
GitHub repository with 1,331 stars and 230 forks.
Trending score: 1.08; stars gained: +1; forks gained: +1.
Language: C++
Topics: deepseek, glm, inference, inference-engine, large-language-models, llm-inference
258 KB WASM runtime for Needle a 26M-parameter tool-calling transformer. Runs in browser, Cloudflare Workers, and Node.js. No backend required.
GitHub repository with 40 stars and 5 forks.
Trending score: 0.64; stars gained: +1; forks gained: +0.
Language: Rust
Topics: agent, ai, browser-ai, cloudflare-workers, edge-ai, embedded-ai
Universal AI Runtime — Execute any model on any hardware
GitHub repository with 57 stars and 1 forks.
Trending score: 0.41; stars gained: +1; forks gained: +0.
Language: Python
Topics: ai, aten, inference-engine, pytorch, triton-kernels