alumae/kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

GitHub repository with 1,093 stars and 338 forks.

Language: Python

Topics: speech-recognition

Open provider repository

Latest metric snapshot

2026-06-05: 1,093 stars and 338 forks.

Similar repositories

  1. 1. huggingface/transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    GitHub repository with 161,319 stars and 33,421 forks.

    Trending score: 3.69; stars gained: +78; forks gained: +27.

    Language: Python

    Topics: audio, deep-learning, deepseek, gemma, glm, hacktoberfest

  2. 2. modelscope/FunASR

    Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

    GitHub repository with 16,750 stars and 1,720 forks.

    Trending score: 1.93; stars gained: +56; forks gained: +2.

    Language: Python

    Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection

  3. 3. Soul-AILab/SoulX-Transcriber

    An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

    GitHub repository with 182 stars and 7 forks.

    Trending score: 1.58; stars gained: +40; forks gained: +5.

    Language: Python

    Topics: asr, llm, sd, sdr, speech-recognition

  4. 4. Blaizzy/mlx-audio

    A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

    GitHub repository with 7,193 stars and 619 forks.

    Trending score: 1.07; stars gained: +11; forks gained: +3.

    Language: Python

    Topics: apple-silicon, audio-processing, mlx, multimodal, speech-recognition, speech-synthesis

  5. 5. karamouche/noisekit

    Generate degraded speech datasets for noise-robust ASR benchmarking

    GitHub repository with 15 stars and 0 forks.

    Trending score: 0.50; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: asr, audio, audiomentations, benchmark, cli, dataset-generation

  6. 6. ardha27/AI-Waifu-Vtuber

    AI Vtuber for Streaming on Youtube/Twitch

    GitHub repository with 1,085 stars and 174 forks.

    Trending score: 0.48; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: ai-vtuber, ai-waifu, deepl, openai, speech-recognition, speech-synthesis

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 182,513 stars and 31,295 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 14,053 stars and 885 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,616 stars and 2,272 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 28,711 stars and 4,695 forks.

    Trending score: 5.32; stars gained: +1,261; forks gained: +238.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  5. 5. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,435 stars and 28,046 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, collections, python, python-frameworks, python-libraries, python-tools

  6. 6. Alishahryar1/free-claude-code

    Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

    GitHub repository with 32,540 stars and 4,942 forks.

    Trending score: 4.56; stars gained: +467; forks gained: +82.

    Language: Python

Trending topic: speech-recognition

  1. 1. huggingface/transformers

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    GitHub repository with 161,319 stars and 33,421 forks.

    Trending score: 3.69; stars gained: +78; forks gained: +27.

    Language: Python

    Topics: audio, deep-learning, deepseek, gemma, glm, hacktoberfest

  2. 2. modelscope/FunASR

    Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

    GitHub repository with 16,750 stars and 1,720 forks.

    Trending score: 1.93; stars gained: +56; forks gained: +2.

    Language: Python

    Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection

  3. 3. Soul-AILab/SoulX-Transcriber

    An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

    GitHub repository with 182 stars and 7 forks.

    Trending score: 1.58; stars gained: +40; forks gained: +5.

    Language: Python

    Topics: asr, llm, sd, sdr, speech-recognition

  4. 4. Blaizzy/mlx-audio

    A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

    GitHub repository with 7,193 stars and 619 forks.

    Trending score: 1.07; stars gained: +11; forks gained: +3.

    Language: Python

    Topics: apple-silicon, audio-processing, mlx, multimodal, speech-recognition, speech-synthesis

  5. 5. deusjin/subforge

    Rust CLI for AI subtitle workflows: transcribe, segment, translate, evaluate, and burn or mux subtitles.

    GitHub repository with 70 stars and 7 forks.

    Trending score: 0.82; stars gained: +6; forks gained: +0.

    Language: Rust

    Topics: cli, faster-whisper, ffmpeg, llm, openai, rust

  6. 6. mkiol/dsnote

    Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

    GitHub repository with 1,490 stars and 64 forks.

    Trending score: 0.76; stars gained: +5; forks gained: +1.

    Language: C++

    Topics: asr, sailfishos, stt, tts, flatpak-applications, linux-desktop