alumae/kaldi-gstreamer-server
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
GitHub repository with 1,093 stars and 338 forks.
Language: Python
Topics: speech-recognition
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
GitHub repository with 1,093 stars and 338 forks.
Language: Python
Topics: speech-recognition
2026-06-05: 1,093 stars and 338 forks.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
GitHub repository with 161,319 stars and 33,421 forks.
Trending score: 3.69; stars gained: +78; forks gained: +27.
Language: Python
Topics: audio, deep-learning, deepseek, gemma, glm, hacktoberfest
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
GitHub repository with 16,750 stars and 1,720 forks.
Trending score: 1.93; stars gained: +56; forks gained: +2.
Language: Python
Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection
An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.
GitHub repository with 182 stars and 7 forks.
Trending score: 1.58; stars gained: +40; forks gained: +5.
Language: Python
Topics: asr, llm, sd, sdr, speech-recognition
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
GitHub repository with 7,193 stars and 619 forks.
Trending score: 1.07; stars gained: +11; forks gained: +3.
Language: Python
Topics: apple-silicon, audio-processing, mlx, multimodal, speech-recognition, speech-synthesis
Generate degraded speech datasets for noise-robust ASR benchmarking
GitHub repository with 15 stars and 0 forks.
Trending score: 0.50; stars gained: +2; forks gained: +0.
Language: Python
Topics: asr, audio, audiomentations, benchmark, cli, dataset-generation
AI Vtuber for Streaming on Youtube/Twitch
GitHub repository with 1,085 stars and 174 forks.
Trending score: 0.48; stars gained: +2; forks gained: +0.
Language: Python
Topics: ai-vtuber, ai-waifu, deepl, openai, speech-recognition, speech-synthesis
The agent that grows with you
GitHub repository with 182,513 stars and 31,295 forks.
Trending score: 5.95; stars gained: +1,867; forks gained: +361.
Language: Python
Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
GitHub repository with 14,053 stars and 885 forks.
Trending score: 5.69; stars gained: +2,829; forks gained: +175.
Language: Python
Topics: agent, ai, anthropic, compression, context-engineering, context-window
Academic Research Skills for Claude Code: research → write → review → revise → finalize
GitHub repository with 27,616 stars and 2,272 forks.
Trending score: 5.52; stars gained: +1,079; forks gained: +89.
Language: Python
Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review
Learn it. Build it. Ship it for others.
GitHub repository with 28,711 stars and 4,695 forks.
Trending score: 5.32; stars gained: +1,261; forks gained: +238.
Language: Python
Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course
An opinionated list of Python frameworks, libraries, tools, and resources
GitHub repository with 301,435 stars and 28,046 forks.
Trending score: 4.60; stars gained: +518; forks gained: +24.
Language: Python
Topics: awesome, collections, python, python-frameworks, python-libraries, python-tools
Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)
GitHub repository with 32,540 stars and 4,942 forks.
Trending score: 4.56; stars gained: +467; forks gained: +82.
Language: Python
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
GitHub repository with 161,319 stars and 33,421 forks.
Trending score: 3.69; stars gained: +78; forks gained: +27.
Language: Python
Topics: audio, deep-learning, deepseek, gemma, glm, hacktoberfest
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
GitHub repository with 16,750 stars and 1,720 forks.
Trending score: 1.93; stars gained: +56; forks gained: +2.
Language: Python
Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection
An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.
GitHub repository with 182 stars and 7 forks.
Trending score: 1.58; stars gained: +40; forks gained: +5.
Language: Python
Topics: asr, llm, sd, sdr, speech-recognition
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
GitHub repository with 7,193 stars and 619 forks.
Trending score: 1.07; stars gained: +11; forks gained: +3.
Language: Python
Topics: apple-silicon, audio-processing, mlx, multimodal, speech-recognition, speech-synthesis
Rust CLI for AI subtitle workflows: transcribe, segment, translate, evaluate, and burn or mux subtitles.
GitHub repository with 70 stars and 7 forks.
Trending score: 0.82; stars gained: +6; forks gained: +0.
Language: Rust
Topics: cli, faster-whisper, ffmpeg, llm, openai, rust
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
GitHub repository with 1,490 stars and 64 forks.
Trending score: 0.76; stars gained: +5; forks gained: +1.
Language: C++
Topics: asr, sailfishos, stt, tts, flatpak-applications, linux-desktop