LiangSu8899/FlashRT

FlashRT is a high-performance realtime inference engine for small-batch, latency-sensitive AI workloads. The flagship integration is production VLA control for Pi0, Pi0.5, GROOT N1.6, and Pi0-FAST. Also support llm e.g, qwen3.6-27B

GitHub repository with 281 stars and 32 forks.

Language: C++

Topics: cuda, cuda-kernels, realtime-inference, realtime-vla, gr00t, gr00t-n1-6-3b, pi, pi05, vla, qwen

Open provider repository

24h trend summary

Trending score 1.65, activity score 0.05, stars gained +7, forks gained +0.

Latest metric snapshot

2026-06-05: 281 stars and 32 forks.

Similar repositories

  1. 1. Luce-Org/lucebox-hub

    Fast LLM speculative inference server for consumer hardware.

    GitHub repository with 2,331 stars and 217 forks.

    Trending score: 2.31; stars gained: +17; forks gained: +3.

    Language: C++

    Topics: kernel, llama-cpp, local-ai, nvidia-cuda, qwen, rtx3090

  2. 2. tenstorrent/tt-metal

    :metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

    GitHub repository with 1,494 stars and 480 forks.

    Trending score: 1.82; stars gained: +7; forks gained: +5.

    Language: C++

    Topics: accelerator, ai, cuda, deepseek, gpu, img-gen

  3. 3. LiangSu8899/FlashRT

    FlashRT is a high-performance realtime inference engine for small-batch, latency-sensitive AI workloads. The flagship integration is production VLA control for Pi0, Pi0.5, GROOT N1.6, and Pi0-FAST. Also support llm e.g, qwen3.6-27B

    GitHub repository with 281 stars and 32 forks.

    Trending score: 1.65; stars gained: +7; forks gained: +0.

    Language: C++

    Topics: cuda, cuda-kernels, realtime-inference, realtime-vla, gr00t, gr00t-n1-6-3b

  4. 4. SunOner/sunone_aimbot_2

    Aim-bot based on AI for all FPS and TPS games

    GitHub repository with 514 stars and 105 forks.

    Trending score: 1.56; stars gained: +5; forks gained: +1.

    Language: C++

    Topics: ai, cpp, gpu, yolov11, yolov8, ai-aimbot

  5. 5. lupinemachines/lupine

    LUPINE is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

    GitHub repository with 2,213 stars and 112 forks.

    Trending score: 1.21; stars gained: +14; forks gained: +0.

    Language: C++

    Topics: cublas, cuda, cudnn, gpu, mlops, networking

  6. 6. chrxh/alien

    ALIEN is a CUDA-powered artificial life simulation program.

    GitHub repository with 5,422 stars and 185 forks.

    Trending score: 0.98; stars gained: +1; forks gained: +0.

    Language: C++

    Topics: artificial-life, open-ended-evolution, agent-based-simulation, physics-engine, cuda

Trending in C++

  1. 1. ggml-org/llama.cpp

    LLM inference in C/C++

    GitHub repository with 114,728 stars and 19,195 forks.

    Trending score: 4.40; stars gained: +304; forks gained: +99.

    Language: C++

    Topics: ggml

  2. 2. duckdb/duckdb

    DuckDB is an analytical in-process SQL database management system

    GitHub repository with 38,621 stars and 3,299 forks.

    Trending score: 3.50; stars gained: +40; forks gained: +6.

    Language: C++

    Topics: analytics, database, embedded-database, olap, sql

  3. 3. vllm-project/vllm-ascend

    Community maintained hardware plugin for vLLM on Ascend

    GitHub repository with 2,199 stars and 1,350 forks.

    Trending score: 3.25; stars gained: +16; forks gained: +22.

    Language: C++

    Topics: ascend, inference, llm, llm-serving, llmops, mlops

  4. 4. electron/electron

    :electron: Build cross-platform desktop apps with JavaScript, HTML, and CSS

    GitHub repository with 121,543 stars and 17,235 forks.

    Trending score: 3.02; stars gained: +16; forks gained: +2.

    Language: C++

    Topics: electron, javascript, c-plus-plus, html, css, chrome

  5. 5. ClickHouse/ClickHouse

    ClickHouse® is a real-time analytics database management system

    GitHub repository with 47,832 stars and 8,468 forks.

    Trending score: 2.96; stars gained: +53; forks gained: +10.

    Language: C++

    Topics: ai, analytics, big-data, clickhouse, cloud-native, cpp

  6. 6. LadybirdBrowser/ladybird

    Truly independent web browser

    GitHub repository with 63,763 stars and 3,075 forks.

    Trending score: 2.89; stars gained: +52; forks gained: +5.

    Language: C++

    Topics: browser, browser-engine

Trending topic: cuda

  1. 1. vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    GitHub repository with 81,990 stars and 17,671 forks.

    Trending score: 3.75; stars gained: +79; forks gained: +46.

    Language: Python

    Topics: amd, blackwell, cuda, deepseek, deepseek-v3, gpt

  2. 2. gpustack/gpustack

    A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

    GitHub repository with 5,106 stars and 542 forks.

    Trending score: 2.51; stars gained: +11; forks gained: +1.

    Language: Python

    Topics: ascend, cuda, deepseek, distributed-inference, genai, high-performance-inference

  3. 3. Luce-Org/lucebox-hub

    Fast LLM speculative inference server for consumer hardware.

    GitHub repository with 2,331 stars and 217 forks.

    Trending score: 2.31; stars gained: +17; forks gained: +3.

    Language: C++

    Topics: kernel, llama-cpp, local-ai, nvidia-cuda, qwen, rtx3090

  4. 4. LMCache/LMCache

    LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

    GitHub repository with 8,422 stars and 1,246 forks.

    Trending score: 2.17; stars gained: +11; forks gained: +6.

    Language: Python

    Topics: amd, cuda, fast, inference, kv-cache, llm

  5. 5. tenstorrent/tt-metal

    :metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

    GitHub repository with 1,494 stars and 480 forks.

    Trending score: 1.82; stars gained: +7; forks gained: +5.

    Language: C++

    Topics: accelerator, ai, cuda, deepseek, gpu, img-gen

  6. 6. AmmarkoV/SAM3DBody-cpp

    Real-time 3D full-body reconstruction from a single camera, Multiperson BVH output, Pure C++ runtime, ONNX + ggml, 70-joint skeleton with hands.

    GitHub repository with 473 stars and 62 forks.

    Trending score: 1.78; stars gained: +2; forks gained: +1.

    Language: C

    Topics: 3d-human-pose, bvh, computer-vision, cpp, cuda, ggml