konjoai/squish

🤖🗜️⚡️ Local LLM server for Apple Silicon. 5.4× faster end-to-end on long contexts vs Ollama, 33% less RAM, INT3 support for Qwen3. OpenAI + Ollama drop-in. Built for repeated long-context workloads on memory-constrained Macs.

GitHub repository with 7 stars and 0 forks.

Language: Python

Topics: apple-silicon, inference-engine, int4, kv-cache, llama-cpp-alternative, llm, llm-infernece, local-ai, local-llm, macos

Open provider repository

Latest metric snapshot

2026-06-04: 7 stars and 0 forks.

Similar repositories

  1. 1. Blaizzy/mlx-vlm

    MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

    GitHub repository with 4,941 stars and 566 forks.

    Trending score: 1.67; stars gained: +48; forks gained: +6.

    Language: Python

    Topics: llava, llm, mlx, vision-transformer, apple-silicon, idefics

  2. 2. waybarrios/vllm-mlx

    OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

    GitHub repository with 1,301 stars and 181 forks.

    Trending score: 1.13; stars gained: +12; forks gained: +3.

    Language: Python

    Topics: apple-silicon, audio-processing, computer-vision, image-understanding, inference, llm

  3. 3. shenmintao/marginalia

    A library-science-inspired personal knowledge management system with LLM agents

    GitHub repository with 40 stars and 7 forks.

    Trending score: 0.49; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: apple-silicon, arm64, desktop-app, docker, document-management, document-search

  4. 4. manjunathshiva/turboquant-mlx

    Extreme weight + KV cache compression for LLMs on Apple Silicon (MLX implementation of Google's TurboQuant)

    GitHub repository with 41 stars and 10 forks.

    Trending score: 0.48; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: apple-silicon, kv-cache, llm, mlx, quantization, turboquant

  5. 5. youngbryan97/aura

    A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules — running locally on Apple Silicon.

    GitHub repository with 60 stars and 11 forks.

    Trending score: 0.33; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: active-inference, affective-computing, apple-silicon, artificial-consciousness, autonomous-agent, cognitive-architecture

  6. 6. cryptopoly/ChaosEngineAI

    Local AI workstation — discover, run, chat, benchmark, and generate images from open-weight models. DFlash/DDTree speculative decoding, TurboQuant & TriAttention cache compression strategies, MLX + llama.cpp + vLLM + MTPLX backends.

    GitHub repository with 20 stars and 3 forks.

    Trending score: 0.10; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: ai, huggingface, llama-cpp, llm, local-ai, machine-learning

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 180,758 stars and 31,001 forks.

    Trending score: 5.79; stars gained: +1,360; forks gained: +322.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. microsoft/SkillOpt

    SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

    GitHub repository with 4,882 stars and 485 forks.

    Trending score: 4.55; stars gained: +340; forks gained: +27.

    Language: Python

    Topics: agent-skills, self-evolving-agents

  3. 3. mukul975/Anthropic-Cybersecurity-Skills

    754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0

    GitHub repository with 13,233 stars and 1,551 forks.

    Trending score: 4.53; stars gained: +301; forks gained: +38.

    Language: Python

    Topics: ai-agents, claude-code, cybersecurity, incident-response, mitre-attack, penetration-testing

  4. 4. virgiliojr94/book-to-skill

    Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.

    GitHub repository with 4,166 stars and 523 forks.

    Trending score: 4.43; stars gained: +415; forks gained: +37.

    Language: Python

  5. 5. anthropics/claude-code

    Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.

    GitHub repository with 130,131 stars and 21,146 forks.

    Trending score: 4.42; stars gained: +277; forks gained: +38.

    Language: Python

  6. 6. CloakHQ/CloakBrowser

    Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

    GitHub repository with 23,119 stars and 1,836 forks.

    Trending score: 4.24; stars gained: +250; forks gained: +17.

    Language: Python

    Topics: anti-detect, bot-detection, browser-automation, chromium, cloudflare, fingerprint

Trending topic: apple-silicon

  1. 1. darrylmorley/whatcable

    macOS menu bar app that tells you, in plain English, what each USB-C cable plugged into your Mac can actually do

    GitHub repository with 5,372 stars and 164 forks.

    Trending score: 3.38; stars gained: +66; forks gained: +1.

    Language: Swift

    Topics: apple-silicon, hardware-info, iokit, mac-app, macos, menubar

  2. 2. Blaizzy/mlx-vlm

    MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

    GitHub repository with 4,941 stars and 566 forks.

    Trending score: 1.67; stars gained: +48; forks gained: +6.

    Language: Python

    Topics: llava, llm, mlx, vision-transformer, apple-silicon, idefics

  3. 3. moona3k/macparakeet

    Fast, local voice app for Mac — system-wide dictation, file & YouTube transcription, and meeting recording. Powered by Parakeet TDT on Apple Silicon. Free and open-source.

    GitHub repository with 303 stars and 29 forks.

    Trending score: 1.56; stars gained: +5; forks gained: +1.

    Language: Swift

    Topics: apple-silicon, dictation, local-first, macos, neural-engine, privacy

  4. 4. dodo-reach/hermes-desktop

    The safest, simplest way to manage Hermes from your Mac. Pure SSH. No gateways, no exposed ports, no browser layer.

    GitHub repository with 1,821 stars and 129 forks.

    Trending score: 1.44; stars gained: +32; forks gained: +1.

    Language: Swift

    Topics: agent-tools, apple-silicon, developer-tools, hermes, hermes-agent, macos

  5. 5. 359392475-blue-sky/always-yes

    拍一下 Mac,自动按回车。专为 Claude Code、Cursor、Windsurf 等 AI 编程助手打造。Slap your Mac to press Enter — built for AI coding assistants.

    GitHub repository with 26 stars and 4 forks.

    Trending score: 1.32; stars gained: +18; forks gained: +3.

    Language: Swift

    Topics: accelerometer, ai-coding, apple-silicon, claude-code, cursor, macos

  6. 6. Arthur-Ficial/apfel

    The free AI already on your Mac. CLI tool, OpenAI-compatible server, and interactive chat — all on-device via Apple Intelligence. No API keys, no cloud, no downloads.

    GitHub repository with 5,538 stars and 212 forks.

    Trending score: 1.13; stars gained: +9; forks gained: +0.

    Language: Swift

    Topics: apple-intelligence, apple-silicon, cli, foundationmodels, llm, macos