michelangeloromerochisco/ternative
Inference engine for ternary-weight LLMs with runtime LoRA - the llama.cpp of BitNet models
GitHub repository with 7 stars and 0 forks.
Language: C++
Topics: bitnet, cpp, cuda, gguf, inference, llm, lora, openai-compatible, ternary