brontoguana/krasis

Krasis is a Hybrid LLM runtime which focuses on efficient running of larger models on consumer grade VRAM limited hardware

GitHub repository with 469 stars and 26 forks.

Language: C++

Topics: cpu-inference, gguf-model-support, gpu-inference, high-performance-inference, hybrid-inference, inference-engine, inference-optimization, large-language-models, llama-cpp-alternative, llm-inference

Open provider repository

Latest metric snapshot

2026-06-05: 469 stars and 26 forks.

Trending in C++

  1. 1. ggml-org/llama.cpp

    LLM inference in C/C++

    GitHub repository with 114,740 stars and 19,198 forks.

    Trending score: 4.40; stars gained: +304; forks gained: +99.

    Language: C++

    Topics: ggml

  2. 2. duckdb/duckdb

    DuckDB is an analytical in-process SQL database management system

    GitHub repository with 38,622 stars and 3,299 forks.

    Trending score: 3.50; stars gained: +40; forks gained: +6.

    Language: C++

    Topics: analytics, database, embedded-database, olap, sql

  3. 3. vllm-project/vllm-ascend

    Community maintained hardware plugin for vLLM on Ascend

    GitHub repository with 2,201 stars and 1,350 forks.

    Trending score: 3.25; stars gained: +16; forks gained: +22.

    Language: C++

    Topics: ascend, inference, llm, llm-serving, llmops, mlops

  4. 4. electron/electron

    :electron: Build cross-platform desktop apps with JavaScript, HTML, and CSS

    GitHub repository with 121,546 stars and 17,236 forks.

    Trending score: 3.02; stars gained: +16; forks gained: +2.

    Language: C++

    Topics: electron, javascript, c-plus-plus, html, css, chrome

  5. 5. ClickHouse/ClickHouse

    ClickHouse® is a real-time analytics database management system

    GitHub repository with 47,835 stars and 8,469 forks.

    Trending score: 2.96; stars gained: +53; forks gained: +10.

    Language: C++

    Topics: ai, analytics, big-data, clickhouse, cloud-native, cpp

  6. 6. LadybirdBrowser/ladybird

    Truly independent web browser

    GitHub repository with 63,772 stars and 3,077 forks.

    Trending score: 2.89; stars gained: +52; forks gained: +5.

    Language: C++

    Topics: browser, browser-engine

Trending topic: cpu-inference

  1. 1. DaggerSlicer34/koboldcpp

    KoboldCpp — local llm cpu.

    GitHub repository with 21 stars and 0 forks.

    Trending score: 1.20; stars gained: +13; forks gained: +0.

    Topics: cpu-inference, gguf, koboldcpp, local-llm, open-source, text-generation

  2. 2. kouhxp/fftext

    Summarize, explain, fact-check, or translate any text, URL, or file. No GPU. No cloud. One command

    GitHub repository with 14 stars and 0 forks.

    Trending score: 0.59; stars gained: +3; forks gained: +0.

    Language: Python

    Topics: cli, cpu-inference, eli5, fact-checking, llama-cpp, llm

  3. 3. artalis-io/bitnet.c

    Minimal, zero-dependency LLM inference in pure C11. CPU-first with NEON/AVX2 SIMD. Flash MoE (pread + LRU expert cache). TurboQuant 3-bit KV compression (8.9x less memory per session). 20+ GGUF quant formats. Compiles to WASM.

    GitHub repository with 20 stars and 7 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: C

    Topics: avx2, c, cpu-inference, gguf, inference, kv-cache