MerkyorLynn/lynn-engine
Lynn 原生 LLM 推理引擎 · W4A8/NVFP4 量化 · 自写 CUDA/Triton kernel · MoE · 投机解码 | Lynn-native LLM inference engine for NVIDIA Blackwell
GitHub repository with 17 stars and 0 forks.
Language: Python
Topics: ai-inference, blackwell, cuda, deep-learning, fp8, gpu, gpu-acceleration, high-performance-computing, inference-engine, llm