Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages
GitHub repository with 12,719 stars and 1,450 forks.
Trending score: 1.41; stars gained: +29; forks gained: +1.
Language: C++
Topics: asr, onnx, windows, linux, macos, cpp