wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

GitHub repository with 1,871 stars and 242 forks.

Topics: speaker-diarization, awesome, awesome-list, machine-learning, speech-recognition, speech-processing, deep-learning

Open provider repository

24h trend summary

Trending score 0.04, activity score 0.04, stars gained +0, forks gained +0.

Latest metric snapshot

2026-06-02: 1,871 stars and 242 forks.

Similar repositories

  1. 1. modelscope/FunASR

    Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

    GitHub repository with 16,750 stars and 1,720 forks.

    Trending score: 1.93; stars gained: +56; forks gained: +2.

    Language: Python

    Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection

  2. 2. soniqo/speech-swift

    AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

    GitHub repository with 784 stars and 101 forks.

    Trending score: 1.25; stars gained: +2; forks gained: +0.

    Language: Swift

    Topics: apple-silicon, asr, coreml, ios, macos, mlx

  3. 3. FluidInference/FluidAudio

    Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

    GitHub repository with 2,128 stars and 298 forks.

    Trending score: 0.60; stars gained: +3; forks gained: +2.

    Language: Swift

    Topics: coreml, ios, macos, speaker-diarization, speaker-embedding, speaker-identification

  4. 4. Picovoice/pico-cookbook

    On-device AI blueprints for real‑time voice, language, and vision understanding

    GitHub repository with 111 stars and 12 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +1.

    Language: C

    Topics: llms, noise-reduction, ocr, on-device, private, real-time

  5. 5. moziarnj07-sys/doubaoime-asr

    🎤 Enable voice recognition for the Doubao input method using Python; ideal for learning and research with a focus on audio processing.

    GitHub repository with 5 stars and 4 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: asrt, audio-visual-speech-recognition, chinese-speech-recognition, cnn, ctc, dfsmn

  6. 6. wq2012/awesome-diarization

    A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

    GitHub repository with 1,871 stars and 242 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Topics: speaker-diarization, awesome, awesome-list, machine-learning, speech-recognition, speech-processing

Trending topic: speaker-diarization

  1. 1. modelscope/FunASR

    Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

    GitHub repository with 16,750 stars and 1,720 forks.

    Trending score: 1.93; stars gained: +56; forks gained: +2.

    Language: Python

    Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection

  2. 2. soniqo/speech-swift

    AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

    GitHub repository with 784 stars and 101 forks.

    Trending score: 1.25; stars gained: +2; forks gained: +0.

    Language: Swift

    Topics: apple-silicon, asr, coreml, ios, macos, mlx

  3. 3. FluidInference/FluidAudio

    Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

    GitHub repository with 2,128 stars and 298 forks.

    Trending score: 0.60; stars gained: +3; forks gained: +2.

    Language: Swift

    Topics: coreml, ios, macos, speaker-diarization, speaker-embedding, speaker-identification

  4. 4. Picovoice/pico-cookbook

    On-device AI blueprints for real‑time voice, language, and vision understanding

    GitHub repository with 111 stars and 12 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +1.

    Language: C

    Topics: llms, noise-reduction, ocr, on-device, private, real-time

  5. 5. moziarnj07-sys/doubaoime-asr

    🎤 Enable voice recognition for the Doubao input method using Python; ideal for learning and research with a focus on audio processing.

    GitHub repository with 5 stars and 4 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: asrt, audio-visual-speech-recognition, chinese-speech-recognition, cnn, ctc, dfsmn

  6. 6. wq2012/awesome-diarization

    A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

    GitHub repository with 1,871 stars and 242 forks.

    Trending score: 0.04; stars gained: +0; forks gained: +0.

    Topics: speaker-diarization, awesome, awesome-list, machine-learning, speech-recognition, speech-processing