raketenkater/llm-server

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.

GitHub repository with 226 stars and 11 forks.

Language: Go

Topics: cuda, gguf, llama-cpp, llm, metal, moe, multi-gpu, golang, inference-server, llamacpp

Open provider repository

24h trend summary

Trending score 1.23, freshness score 0.79, stars gained +3, forks gained +0.

Latest metric snapshot

2026-06-15: 226 stars and 11 forks.

Similar repositories

1. raketenkater/llm-server

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.

GitHub repository with 226 stars and 11 forks.

Trending score: 1.23; stars gained: +3; forks gained: +0.

Language: Go

Topics: cuda, gguf, llama-cpp, llm, metal, moe
2. parca-dev/parca-agent

eBPF based always-on CPU/GPU profiler auto-discovering targets in Kubernetes and systemd, zero code changes or restarts needed!

GitHub repository with 728 stars and 90 forks.

Trending score: 0.40; stars gained: +0; forks gained: +0.

Language: Go

Topics: ebpf, profiling, pprof, performance, kubernetes, observability

Trending in Go

1. kenn-io/agentsview

Local-first session search, analytics, insights, and token use statistics for coding agents, supporting Claude Code, Codex, and more than 20 other agents.

GitHub repository with 2,621 stars and 232 forks.

Trending score: 4.99; stars gained: +524; forks gained: +37.

Language: Go
2. alibaba/open-code-review

Open-source & free — Battle-tested at Alibaba's scale. Hybrid architecture code review tool: deterministic pipelines + LLM Agent, precise line-level comments, built-in fine-tuned ruleset (NPE, thread-safety, XSS, SQL injection), OpenAI & Anthropic compatible.

GitHub repository with 7,190 stars and 421 forks.

Trending score: 4.97; stars gained: +315; forks gained: +24.

Language: Go

Topics: agent, code-review, code-review-assistant, harness, repository-level-context
3. esengine/DeepSeek-Reasonix

DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.

GitHub repository with 22,195 stars and 1,334 forks.

Trending score: 4.91; stars gained: +265; forks gained: +16.

Language: Go

Topics: agent, agent-framework, ai-agent, ai-coding, cli, coding-agent
4. QuantumNous/new-api

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management. 🍥

GitHub repository with 38,846 stars and 8,824 forks.

Trending score: 4.77; stars gained: +261; forks gained: +62.

Language: Go

Topics: ai-gateway, claude, deepseek, gemini, newapi, openai
5. Paca-AI/paca

AI-native, free, open-source alternative to Jira, Trello, ClickUp & Monday. Built for Scrum teams where humans and AI agents collaborate as equals — on the same board, the same sprints, the same goals. Self-hosted. Fully customizable via config and plugins.

GitHub repository with 886 stars and 48 forks.

Trending score: 4.60; stars gained: +309; forks gained: +24.

Language: Go

Topics: ai-agent, bdd, clickup-alternative, jira-alternative, mcp, open-source
6. router-for-me/CLIProxyAPI

Wrap Gemini CLI, Antigravity, ChatGPT Codex, Claude Code, Grok Build as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini 3.1 Pro, GPT 5.5, Grok 4.3, Claude model through API

GitHub repository with 37,546 stars and 6,192 forks.

Trending score: 4.51; stars gained: +157; forks gained: +21.

Language: Go

Topics: antigravity, claude-code, cluade, codex, gemini, openai

raketenkater/llm-server

24h trend summary

Latest metric snapshot

Similar repositories

1. raketenkater/llm-server

2. parca-dev/parca-agent

Trending in Go

1. kenn-io/agentsview

2. alibaba/open-code-review

3. esengine/DeepSeek-Reasonix

4. QuantumNous/new-api

5. Paca-AI/paca

6. router-for-me/CLIProxyAPI

Trending topic: cuda

1. LMCache/LMCache

2. vllm-project/vllm

3. sgl-project/sglang

4. Luce-Org/lucebox-hub

5. NVlabs/cuda-oxide

6. NVIDIA/TensorRT-LLM