konjoai/squish

🤖🗜️⚡️ Local LLM server for Apple Silicon. 5.4× faster end-to-end on long contexts vs Ollama, 33% less RAM, INT3 support for Qwen3. OpenAI + Ollama drop-in. Built for repeated long-context workloads on memory-constrained Macs.

GitHub repository with 7 stars and 0 forks.

Language: Python

Topics: apple-silicon, inference-engine, int4, kv-cache, llama-cpp-alternative, llm, llm-infernece, local-ai, local-llm, macos

Open provider repository

Latest metric snapshot

2026-06-04: 7 stars and 0 forks.

Similar repositories

1. Blaizzy/mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

GitHub repository with 4,941 stars and 566 forks.

Trending score: 1.67; stars gained: +48; forks gained: +6.

Language: Python

Topics: llava, llm, mlx, vision-transformer, apple-silicon, idefics
2. waybarrios/vllm-mlx

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

GitHub repository with 1,301 stars and 181 forks.

Trending score: 1.13; stars gained: +12; forks gained: +3.

Language: Python

Topics: apple-silicon, audio-processing, computer-vision, image-understanding, inference, llm
3. shenmintao/marginalia

A library-science-inspired personal knowledge management system with LLM agents

GitHub repository with 40 stars and 7 forks.

Trending score: 0.49; stars gained: +2; forks gained: +0.

Language: Python

Topics: apple-silicon, arm64, desktop-app, docker, document-management, document-search
4. manjunathshiva/turboquant-mlx

Extreme weight + KV cache compression for LLMs on Apple Silicon (MLX implementation of Google's TurboQuant)

GitHub repository with 41 stars and 10 forks.

Trending score: 0.48; stars gained: +2; forks gained: +0.

Language: Python

Topics: apple-silicon, kv-cache, llm, mlx, quantization, turboquant
5. youngbryan97/aura

A sovereign cognitive architecture with IIT 4.0 integrated information, residual-stream affective steering (CAA), Global Workspace Theory, active inference, and 72 consciousness modules — running locally on Apple Silicon.

GitHub repository with 60 stars and 11 forks.

Trending score: 0.33; stars gained: +1; forks gained: +0.

Language: Python

Topics: active-inference, affective-computing, apple-silicon, artificial-consciousness, autonomous-agent, cognitive-architecture
6. cryptopoly/ChaosEngineAI

Local AI workstation — discover, run, chat, benchmark, and generate images from open-weight models. DFlash/DDTree speculative decoding, TurboQuant & TriAttention cache compression strategies, MLX + llama.cpp + vLLM + MTPLX backends.

GitHub repository with 20 stars and 3 forks.

Trending score: 0.10; stars gained: +0; forks gained: +0.

Language: Python

Topics: ai, huggingface, llama-cpp, llm, local-ai, machine-learning

Trending in Python

1. NousResearch/hermes-agent

The agent that grows with you

GitHub repository with 180,758 stars and 31,001 forks.

Trending score: 5.79; stars gained: +1,360; forks gained: +322.

Language: Python

Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
2. microsoft/SkillOpt

SkillOpt is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_skill.md artifacts.

GitHub repository with 4,882 stars and 485 forks.

Trending score: 4.55; stars gained: +340; forks gained: +27.

Language: Python

Topics: agent-skills, self-evolving-agents
3. mukul975/Anthropic-Cybersecurity-Skills

754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0

GitHub repository with 13,233 stars and 1,551 forks.

Trending score: 4.53; stars gained: +301; forks gained: +38.

Language: Python

Topics: ai-agents, claude-code, cybersecurity, incident-response, mitre-attack, penetration-testing
4. virgiliojr94/book-to-skill

Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.

GitHub repository with 4,166 stars and 523 forks.

Trending score: 4.43; stars gained: +415; forks gained: +37.

Language: Python
5. anthropics/claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflows - all through natural language commands.

GitHub repository with 130,131 stars and 21,146 forks.

Trending score: 4.42; stars gained: +277; forks gained: +38.

Language: Python
6. CloakHQ/CloakBrowser

Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

GitHub repository with 23,119 stars and 1,836 forks.

Trending score: 4.24; stars gained: +250; forks gained: +17.

Language: Python

Topics: anti-detect, bot-detection, browser-automation, chromium, cloudflare, fingerprint

konjoai/squish

Latest metric snapshot

Similar repositories

1. Blaizzy/mlx-vlm

2. waybarrios/vllm-mlx

3. shenmintao/marginalia

4. manjunathshiva/turboquant-mlx

5. youngbryan97/aura

6. cryptopoly/ChaosEngineAI

Trending in Python

1. NousResearch/hermes-agent

2. microsoft/SkillOpt

3. mukul975/Anthropic-Cybersecurity-Skills

4. virgiliojr94/book-to-skill

5. anthropics/claude-code

6. CloakHQ/CloakBrowser

Trending topic: apple-silicon

1. darrylmorley/whatcable

2. Blaizzy/mlx-vlm

3. moona3k/macparakeet

4. dodo-reach/hermes-desktop

5. 359392475-blue-sky/always-yes

6. Arthur-Ficial/apfel