FluidInference/FluidAudio
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
GitHub repository with 2,128 stars and 298 forks.
Language: Swift
Topics: coreml, ios, macos, speaker-diarization, speaker-embedding, speaker-identification, speaker-recognition, swift, audio, avfoundation