linagora-labs/ssak

SSAK contains helpers and tools to process data and train/infer ASR models.

GitHub repository with 5 stars and 0 forks.

Language: Python

Topics: asr, data-processing, kaldi, machine-learning, nemo, speech-recognition, speech-to-text, toolkit, whisper

Open provider repository

Latest metric snapshot

2026-06-05: 5 stars and 0 forks.

Similar repositories

  1. 1. xzf-thu/Mega-ASR

    First foundation ASR built for the real world - 7 atomic acoustic conditions, 54 compound scenarios, 2.6M samples, and up to ~30% gains over SOTA where every other model falls apart. **You'll come back to MEGA-ASR, after the rest fail in the wild. ⭐**

    GitHub repository with 957 stars and 61 forks.

    Trending score: 2.72; stars gained: +33; forks gained: -1.

    Language: Python

    Topics: asr, robust

  2. 2. modelscope/FunASR

    Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

    GitHub repository with 16,750 stars and 1,720 forks.

    Trending score: 1.93; stars gained: +56; forks gained: +2.

    Language: Python

    Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection

  3. 3. Soul-AILab/SoulX-Transcriber

    An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

    GitHub repository with 182 stars and 7 forks.

    Trending score: 1.58; stars gained: +40; forks gained: +5.

    Language: Python

    Topics: asr, llm, sd, sdr, speech-recognition

  4. 4. karamouche/noisekit

    Generate degraded speech datasets for noise-robust ASR benchmarking

    GitHub repository with 15 stars and 0 forks.

    Trending score: 0.50; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: asr, audio, audiomentations, benchmark, cli, dataset-generation

  5. 5. MAC-AutoML/Trainingfree-LLM-Orchestration

    A training-free orchestration framework for building interactive omni-modal assistants by composing off-the-shelf modality experts, explicit LLM routing, text-centric cross-modal memory, and interruption-aware streaming interaction.

    GitHub repository with 11 stars and 2 forks.

    Trending score: 0.29; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: asr, llm, omni, trainingfree, video-audio

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 182,511 stars and 31,295 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 14,053 stars and 885 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,616 stars and 2,272 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 28,711 stars and 4,695 forks.

    Trending score: 5.32; stars gained: +1,261; forks gained: +238.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  5. 5. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,435 stars and 28,046 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, collections, python, python-frameworks, python-libraries, python-tools

  6. 6. Alishahryar1/free-claude-code

    Use claude-code for free in the terminal, VSCode extension or discord like OpenClaw (voice supported)

    GitHub repository with 32,540 stars and 4,942 forks.

    Trending score: 4.56; stars gained: +467; forks gained: +82.

    Language: Python

Trending topic: asr

  1. 1. Open-Less/openless

    Hold a key, speak, release — AI-polished text appears at your cursor in any app. Open-source voice input for macOS & Windows. (按住快捷键说话,松开即得润色后的文字)

    GitHub repository with 2,150 stars and 169 forks.

    Trending score: 2.75; stars gained: +25; forks gained: +1.

    Language: HTML

    Topics: ai-prompt, asr, dictation, linux, llm, macos

  2. 2. xzf-thu/Mega-ASR

    First foundation ASR built for the real world - 7 atomic acoustic conditions, 54 compound scenarios, 2.6M samples, and up to ~30% gains over SOTA where every other model falls apart. **You'll come back to MEGA-ASR, after the rest fail in the wild. ⭐**

    GitHub repository with 957 stars and 61 forks.

    Trending score: 2.72; stars gained: +33; forks gained: -1.

    Language: Python

    Topics: asr, robust

  3. 3. modelscope/FunASR

    Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

    GitHub repository with 16,750 stars and 1,720 forks.

    Trending score: 1.93; stars gained: +56; forks gained: +2.

    Language: Python

    Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection

  4. 4. Soul-AILab/SoulX-Transcriber

    An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

    GitHub repository with 182 stars and 7 forks.

    Trending score: 1.58; stars gained: +40; forks gained: +5.

    Language: Python

    Topics: asr, llm, sd, sdr, speech-recognition

  5. 5. k2-fsa/sherpa-onnx

    Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

    GitHub repository with 12,737 stars and 1,453 forks.

    Trending score: 1.41; stars gained: +29; forks gained: +1.

    Language: C++

    Topics: asr, onnx, windows, linux, macos, cpp

  6. 6. BillLucky/echocut

    Turn raw footage into brand-ready, platform-optimized video with one command. Local-first: FFmpeg + WhisperX/MLX + Ollama.

    GitHub repository with 51 stars and 12 forks.

    Trending score: 0.89; stars gained: +7; forks gained: +1.

    Language: JavaScript

    Topics: asr, captions, cli, ffmpeg, llm, local-first