iBz-04/quaynor

AI inference library for mobile devices

GitHub repository with 30 stars and 0 forks.

Language: Rust

Topics: inference-engine, llm-inference, local-ai, ollama, llamacpp, rust, python-ai, flutter, gguf, react-native

Open provider repository

Latest metric snapshot

2026-06-06: 30 stars and 0 forks.

Similar repositories

  1. 1. nobodywho-ooo/nobodywho

    NobodyWho is an inference engine that lets you run LLMs locally and efficiently on any device.

    GitHub repository with 998 stars and 69 forks.

    Trending score: 2.56; stars gained: +26; forks gained: +1.

    Language: Rust

    Topics: ai, flutter, godot, godot-engine, godot-plugin, godot4

  2. 2. Geekgineer/needle-rs

    258 KB WASM runtime for Needle a 26M-parameter tool-calling transformer. Runs in browser, Cloudflare Workers, and Node.js. No backend required.

    GitHub repository with 40 stars and 5 forks.

    Trending score: 0.64; stars gained: +1; forks gained: +0.

    Language: Rust

    Topics: agent, ai, browser-ai, cloudflare-workers, edge-ai, embedded-ai

  3. 3. defai-digital/ax-engine

    Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

    GitHub repository with 10 stars and 0 forks.

    Trending score: 0.08; stars gained: +0; forks gained: +0.

    Language: Rust

    Topics: ai-interface, generative-ai, inference-engine, llm, local-llm, macos

Trending in Rust

  1. 1. farion1231/cc-switch

    A cross-platform desktop All-in-One assistant for Claude Code, Codex, OpenCode, OpenClaw, Gemini CLI & Hermes Agent. Only official website: ccswitch.io

    GitHub repository with 101,399 stars and 6,706 forks.

    Trending score: 5.98; stars gained: +1,282; forks gained: +106.

    Language: Rust

    Topics: ai-tools, claude-code, codex, desktop-app, hermes, hermes-agent

  2. 2. BigPizzaV3/CodexPlusPlus

    An enhanced tool for CodexApp, striving to make Codex better to use and more comfortable 一个CodexApp的增强工具,努力让Codex变得更好用更舒服

    GitHub repository with 18,792 stars and 1,190 forks.

    Trending score: 5.44; stars gained: +649; forks gained: +58.

    Language: Rust

  3. 3. rtk-ai/rtk

    CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies

    GitHub repository with 62,443 stars and 3,866 forks.

    Trending score: 5.37; stars gained: +689; forks gained: +65.

    Language: Rust

    Topics: agentic-coding, ai-coding, anthropic, claude-code, cli, command-line-tool

  4. 4. aaif-goose/goose

    an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

    GitHub repository with 49,459 stars and 5,223 forks.

    Trending score: 4.98; stars gained: +255; forks gained: +31.

    Language: Rust

    Topics: acp, ai, ai-agents, mcp

  5. 5. ruvnet/RuView

    π RuView turns commodity WiFi signals into real-time spatial intelligence, vital sign monitoring, and presence detection — all without a single pixel of video.

    GitHub repository with 73,948 stars and 9,874 forks.

    Trending score: 4.83; stars gained: +185; forks gained: +32.

    Language: Rust

    Topics: awesome, claude, densepose, esp32, firmware, home-assistant

  6. 6. tinyhumansai/openhuman

    Your Personal AI super intelligence. Private, Simple and extremely powerful.

    GitHub repository with 32,197 stars and 3,120 forks.

    Trending score: 4.79; stars gained: +324; forks gained: +36.

    Language: Rust

Trending topic: inference-engine

  1. 1. nobodywho-ooo/nobodywho

    NobodyWho is an inference engine that lets you run LLMs locally and efficiently on any device.

    GitHub repository with 998 stars and 69 forks.

    Trending score: 2.56; stars gained: +26; forks gained: +1.

    Language: Rust

    Topics: ai, flutter, godot, godot-engine, godot-plugin, godot4

  2. 2. zengxiao-he/tessera

    From teacher to tiles — a from-scratch LLM distillation & serving engine: custom Triton/CUDA kernels, FSDP distillation, paged-KV continuous batching, speculative decoding, a Rust gateway, a JAX oracle, and interpretability tooling.

    GitHub repository with 181 stars and 1 forks.

    Trending score: 2.19; stars gained: +11; forks gained: +0.

    Language: Python

    Topics: cuda, flash-attention, fsdp, inference-engine, jax, knowledge-distillation

  3. 3. qualcomm/ai-hub-models

    Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

    GitHub repository with 1,122 stars and 197 forks.

    Trending score: 1.52; stars gained: +1; forks gained: +1.

    Language: Python

    Topics: deeplearning, demos, inference, inference-api, inference-engine, machine-learning

  4. 4. jd-opensource/xllm

    A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.

    GitHub repository with 1,331 stars and 230 forks.

    Trending score: 1.08; stars gained: +1; forks gained: +1.

    Language: C++

    Topics: deepseek, glm, inference, inference-engine, large-language-models, llm-inference

  5. 5. Geekgineer/needle-rs

    258 KB WASM runtime for Needle a 26M-parameter tool-calling transformer. Runs in browser, Cloudflare Workers, and Node.js. No backend required.

    GitHub repository with 40 stars and 5 forks.

    Trending score: 0.64; stars gained: +1; forks gained: +0.

    Language: Rust

    Topics: agent, ai, browser-ai, cloudflare-workers, edge-ai, embedded-ai

  6. 6. NeuroBrix/neurobrix

    Universal AI Runtime — Execute any model on any hardware

    GitHub repository with 57 stars and 1 forks.

    Trending score: 0.41; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: ai, aten, inference-engine, pytorch, triton-kernels