nyo16/llama_cpp_ex

Elixir bindings for llama.cpp — run LLMs locally with Metal, CUDA, Vulkan, or CPU. Streaming, chat templates, embeddings, structured output, and concurrent batched inference.

GitHub repository with 7 stars and 1 forks.

Language: Elixir

Topics: cuda, elixir, llamacpp, llm

Open provider repository

24h trend summary

Trending score 0.04, activity score 0.04, stars gained +0, forks gained +0.

Latest metric snapshot

2026-06-05: 7 stars and 1 forks.

Similar repositories

  1. 1. nyo16/llama_cpp_ex

    Elixir bindings for llama.cpp — run LLMs locally with Metal, CUDA, Vulkan, or CPU. Streaming, chat templates, embeddings, structured output, and concurrent batched inference.

    GitHub repository with 7 stars and 1 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Language: Elixir

    Topics: cuda, elixir, llamacpp, llm

Trending in Elixir

  1. 1. plausible/analytics

    Open source, privacy-first web analytics. Lightweight, cookie-free Google Analytics alternative. Self-hosted or cloud.

    GitHub repository with 26,827 stars and 1,585 forks.

    Trending score: 1.92; stars gained: +80; forks gained: +6.

    Language: Elixir

  2. 2. openai/symphony

    Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.

    GitHub repository with 25,047 stars and 2,509 forks.

    Trending score: 1.86; stars gained: +82; forks gained: +17.

    Language: Elixir

  3. 3. firezone/firezone

    Enterprise-ready zero-trust access platform built on WireGuard®.

    GitHub repository with 8,636 stars and 413 forks.

    Trending score: 1.35; stars gained: +3; forks gained: +0.

    Language: Elixir

    Topics: cloud, devsecops, elixir, elixir-lang, firewall, liveview

  4. 4. elixir-lang/elixir

    Elixir is a dynamic, functional language for building scalable and maintainable applications

    GitHub repository with 26,454 stars and 3,491 forks.

    Trending score: 1.16; stars gained: +11; forks gained: +2.

    Language: Elixir

  5. 5. operately/operately

    The open source company operating system.

    GitHub repository with 476 stars and 55 forks.

    Trending score: 0.98; stars gained: +2; forks gained: +0.

    Language: Elixir

    Topics: business, communication, open-source, operations, teams, project-management

  6. 6. supabase/realtime

    Broadcast, Presence, and Postgres Changes via WebSockets

    GitHub repository with 7,574 stars and 437 forks.

    Trending score: 0.76; stars gained: +5; forks gained: +0.

    Language: Elixir

    Topics: elixir, postgres, postgresql, realtime, phoenix, phoenix-framework

Trending topic: cuda

  1. 1. vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    GitHub repository with 81,978 stars and 17,668 forks.

    Trending score: 3.75; stars gained: +79; forks gained: +46.

    Language: Python

    Topics: amd, blackwell, cuda, deepseek, deepseek-v3, gpt

  2. 2. gpustack/gpustack

    A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

    GitHub repository with 5,103 stars and 541 forks.

    Trending score: 2.51; stars gained: +11; forks gained: +1.

    Language: Python

    Topics: ascend, cuda, deepseek, distributed-inference, genai, high-performance-inference

  3. 3. Luce-Org/lucebox-hub

    Fast LLM speculative inference server for consumer hardware.

    GitHub repository with 2,330 stars and 217 forks.

    Trending score: 2.31; stars gained: +17; forks gained: +3.

    Language: C++

    Topics: kernel, llama-cpp, local-ai, nvidia-cuda, qwen, rtx3090

  4. 4. LMCache/LMCache

    LMCache: Supercharge Your LLM with the Fastest KV Cache Layer

    GitHub repository with 8,419 stars and 1,246 forks.

    Trending score: 2.17; stars gained: +11; forks gained: +6.

    Language: Python

    Topics: amd, cuda, fast, inference, kv-cache, llm

  5. 5. tenstorrent/tt-metal

    :metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

    GitHub repository with 1,494 stars and 480 forks.

    Trending score: 1.82; stars gained: +7; forks gained: +5.

    Language: C++

    Topics: accelerator, ai, cuda, deepseek, gpu, img-gen

  6. 6. AmmarkoV/SAM3DBody-cpp

    Real-time 3D full-body reconstruction from a single camera, Multiperson BVH output, Pure C++ runtime, ONNX + ggml, 70-joint skeleton with hands.

    GitHub repository with 473 stars and 62 forks.

    Trending score: 1.78; stars gained: +2; forks gained: +1.

    Language: C

    Topics: 3d-human-pose, bvh, computer-vision, cpp, cuda, ggml