lordmathis/llamactl

Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.

GitHub repository with 127 stars and 17 forks.

Language: Go

Topics: llama-cpp, llama-server, llamacpp, llm, llm-inference, openai-api, localllama, localllm, self-hosted, mlx

Open provider repository

Latest metric snapshot

2026-06-05: 127 stars and 17 forks.

Trending in Go

  1. 1. esengine/DeepSeek-Reasonix

    DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.

    GitHub repository with 18,256 stars and 1,087 forks.

    Trending score: 5.71; stars gained: +1,388; forks gained: +87.

    Language: Go

    Topics: agent, agent-framework, ai-agent, ai-coding, cli, coding-agent

  2. 2. alibaba/open-code-review

    Battle-tested at Alibaba's scale. Hybrid architecture code review tool: deterministic pipelines + LLM Agent, precise line-level comments, built-in fine-tuned ruleset (NPE, thread-safety, XSS, SQL injection), OpenAI & Anthropic compatible.

    GitHub repository with 1,986 stars and 108 forks.

    Trending score: 4.49; stars gained: +545; forks gained: +21.

    Language: Go

    Topics: agent, code-review, code-review-assistant, harness, repository-level-context

  3. 3. ollama/ollama

    Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

    GitHub repository with 173,212 stars and 16,447 forks.

    Trending score: 3.96; stars gained: +222; forks gained: +40.

    Language: Go

    Topics: deepseek, gemma, gemma3, glm, go, golang

  4. 4. kubernetes/kubernetes

    Production-Grade Container Scheduling and Management

    GitHub repository with 122,695 stars and 43,255 forks.

    Trending score: 3.87; stars gained: +65; forks gained: +21.

    Language: Go

    Topics: kubernetes, go, cncf, containers

  5. 5. MatinSenPai/SenPaiScanner

    A light-weight scanner for Cloudflare IPs, written in Golang

    GitHub repository with 1,179 stars and 71 forks.

    Trending score: 3.75; stars gained: +126; forks gained: +5.

    Language: Go

  6. 6. avelino/awesome-go

    A curated list of awesome Go frameworks, libraries and software

    GitHub repository with 174,609 stars and 13,287 forks.

    Trending score: 3.72; stars gained: +196; forks gained: +8.

    Language: Go

    Topics: awesome, awesome-list, go, golang, golang-library, hacktoberfest

Trending topic: llama-cpp

  1. 1. FuJacob/cotabby

    Cotabby is local AI autocomplete for your entire Mac. Open source. On device. Everywhere you type.

    GitHub repository with 707 stars and 36 forks.

    Trending score: 2.63; stars gained: +27; forks gained: +5.

    Language: Swift

    Topics: accessibility, ai, autocomplete, cotabby, cotypist, llama

  2. 2. antoinezambelli/forge

    A Python framework for self-hosted LLM tool-calling and multi-step agentic workflows

    GitHub repository with 1,997 stars and 141 forks.

    Trending score: 2.48; stars gained: +24; forks gained: +2.

    Language: Python

    Topics: agentic-ai, agentic-workflow, agents, function-calling, llama-cpp, llamafile

  3. 3. Luce-Org/lucebox-hub

    Fast LLM speculative inference server for consumer hardware.

    GitHub repository with 2,331 stars and 217 forks.

    Trending score: 2.31; stars gained: +17; forks gained: +3.

    Language: C++

    Topics: kernel, llama-cpp, local-ai, nvidia-cuda, qwen, rtx3090

  4. 4. hogeheer499-commits/strix-halo-guide

    AMD Strix Halo local LLM guide: direct 100.0 t/s 30B Qwen MoE on Ryzen AI MAX+ 395 / Radeon 8060S. Setup, benchmarks, raw evidence.

    GitHub repository with 91 stars and 4 forks.

    Trending score: 0.98; stars gained: +7; forks gained: +0.

    Language: Python

    Topics: amd, benchmark, gfx1151, inference, llama-cpp, llm

  5. 5. kouhxp/fftext

    Summarize, explain, fact-check, or translate any text, URL, or file. No GPU. No cloud. One command

    GitHub repository with 14 stars and 0 forks.

    Trending score: 0.59; stars gained: +3; forks gained: +0.

    Language: Python

    Topics: cli, cpu-inference, eli5, fact-checking, llama-cpp, llm

  6. 6. Scottcjn/ram-coffers

    NUMA-distributed weight banking for LLM inference on IBM POWER8. 147 t/s (8.8x stock). Part of the Proof of Physical AI stack.

    GitHub repository with 138 stars and 31 forks.

    Trending score: 0.56; stars gained: +2; forks gained: +2.

    Language: C

    Topics: llama-cpp, llm, numa, power8, ai-inference, depin