nyo16/llama_cpp_ex
Elixir bindings for llama.cpp — run LLMs locally with Metal, CUDA, Vulkan, or CPU. Streaming, chat templates, embeddings, structured output, and concurrent batched inference.
GitHub repository with 7 stars and 1 forks.
Language: Elixir
Topics: cuda, elixir, llamacpp, llm