cryptopoly/ChaosEngineAI
Local AI workstation — discover, run, chat, benchmark, and generate images from open-weight models. DFlash/DDTree speculative decoding, TurboQuant & TriAttention cache compression strategies, MLX + llama.cpp + vLLM + MTPLX backends.
GitHub repository with 20 stars and 3 forks.
Language: Python