Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages
GitHub repository with 12,989 stars and 1,486 forks.
Trending score: 3.26; stars gained: +47; forks gained: +6.
Language: C++
Topics: aarch64, android, arm32, asr, cpp, csharp