Soul-AILab/SoulX-Transcriber

An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

GitHub repository with 248 stars and 11 forks.

Language: Python

Topics: asr, llm, sd, sdr, speech-recognition

Open provider repository

24h trend summary

Trending score 1.51, freshness score 0.00, stars gained +2, forks gained +1.

Latest metric snapshot

2026-06-15: 248 stars and 11 forks.

Similar repositories

  1. 1. worldwonderer/video-recap-skills

    Turn any video into a narration recap with claude code skill|用claude code skill把任何视频剪辑成中文解说视频,支持剪映导出

    GitHub repository with 254 stars and 48 forks.

    Trending score: 3.75; stars gained: +100; forks gained: +20.

    Language: Python

    Topics: ai-agent, asr, claude-code, claude-code-plugin, claude-skills, ffmpeg

  2. 2. Soul-AILab/SoulX-Transcriber

    An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

    GitHub repository with 248 stars and 11 forks.

    Trending score: 1.51; stars gained: +2; forks gained: +1.

    Language: Python

    Topics: asr, llm, sd, sdr, speech-recognition

  3. 3. langswap-app/langswap

    Self-hosted AI video dubbing with ASR, translation, voice cloning, subtitles, and local GPU inference.

    GitHub repository with 27 stars and 2 forks.

    Trending score: 1.18; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: ai-dubbing, asr, f5-tts, gradio, local-ai, qwen

  4. 4. xzf-thu/Mega-ASR

    First foundation ASR built for the real world - 7 atomic acoustic conditions, 54 compound scenarios, 2.6M samples, and up to ~30% gains over SOTA where every other model falls apart. **You'll come back to MEGA-ASR, after the rest fail in the wild. ⭐**

    GitHub repository with 991 stars and 64 forks.

    Trending score: 1.14; stars gained: +2; forks gained: +1.

    Language: Python

    Topics: asr, robust

  5. 5. rcspam/dictee

    Push-to-talk voice dictation for Linux — 100% local, multilingual (25+ languages), with speaker diarization. Qt frontend, Rust backend on NVIDIA Parakeet via ONNX Runtime. KDE Plasmoid integred.

    GitHub repository with 31 stars and 2 forks.

    Trending score: 1.02; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: asr, linux, nvidia, parakeet, push-to-talk, rust

  6. 6. karamouche/noisekit

    Generate degraded speech datasets for noise-robust ASR benchmarking

    GitHub repository with 39 stars and 0 forks.

    Trending score: 0.70; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: asr, audio, audiomentations, benchmark, cli, dataset-generation

Trending in Python

  1. 1. harry0703/MoneyPrinterTurbo

    利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

    GitHub repository with 88,172 stars and 12,648 forks.

    Trending score: 6.02; stars gained: +1,097; forks gained: +218.

    Language: Python

    Topics: shortvideo, automation, chatgpt, moviepy, python, tiktok

  2. 2. pewdiepie-archdaemon/odysseus

    Self-hosted AI workspace.

    GitHub repository with 71,541 stars and 9,128 forks.

    Trending score: 5.98; stars gained: +834; forks gained: +140.

    Language: Python

  3. 3. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 194,238 stars and 34,023 forks.

    Trending score: 5.92; stars gained: +753; forks gained: +209.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  4. 4. NVIDIA/SkillSpector

    Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks.

    GitHub repository with 5,962 stars and 441 forks.

    Trending score: 5.61; stars gained: +874; forks gained: +76.

    Language: Python

  5. 5. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 32,676 stars and 5,366 forks.

    Trending score: 5.59; stars gained: +762; forks gained: +135.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  6. 6. Agents365-ai/drawio-skill

    Generate draw.io diagrams from natural language — 6 presets, vision self-check + up to 5-round refinement, codebase-to-diagram, 10,000+ official shapes & 321 AI/LLM brand logos. Exports PNG/SVG/PDF/JPG.

    GitHub repository with 3,445 stars and 240 forks.

    Trending score: 5.51; stars gained: +1,369; forks gained: +113.

    Language: Python

    Topics: agent-skill, agent-skills, architecture-diagram, claude-code, claude-code-skill, claude-skills

Trending topic: asr

  1. 1. worldwonderer/video-recap-skills

    Turn any video into a narration recap with claude code skill|用claude code skill把任何视频剪辑成中文解说视频,支持剪映导出

    GitHub repository with 254 stars and 48 forks.

    Trending score: 3.75; stars gained: +100; forks gained: +20.

    Language: Python

    Topics: ai-agent, asr, claude-code, claude-code-plugin, claude-skills, ffmpeg

  2. 2. Open-Less/openless

    Hold a key, speak, release — AI-polished text appears at your cursor in any app. Open-source voice input for macOS & Windows. (按住快捷键说话,松开即得润色后的文字)

    GitHub repository with 2,341 stars and 191 forks.

    Trending score: 3.14; stars gained: +39; forks gained: +1.

    Language: HTML

    Topics: ai-prompt, asr, dictation, linux, llm, macos

  3. 3. Kieirra/murmure

    Fully local, private and cross platform Speech-to-Text with LLM Post-processing

    GitHub repository with 866 stars and 85 forks.

    Trending score: 1.55; stars gained: +4; forks gained: -1.

    Language: TypeScript

    Topics: privacy, speech-to-text, asr, asr-model, debian-packages, linux

  4. 4. Soul-AILab/SoulX-Transcriber

    An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

    GitHub repository with 248 stars and 11 forks.

    Trending score: 1.51; stars gained: +2; forks gained: +1.

    Language: Python

    Topics: asr, llm, sd, sdr, speech-recognition

  5. 5. langswap-app/langswap

    Self-hosted AI video dubbing with ASR, translation, voice cloning, subtitles, and local GPU inference.

    GitHub repository with 27 stars and 2 forks.

    Trending score: 1.18; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: ai-dubbing, asr, f5-tts, gradio, local-ai, qwen

  6. 6. xzf-thu/Mega-ASR

    First foundation ASR built for the real world - 7 atomic acoustic conditions, 54 compound scenarios, 2.6M samples, and up to ~30% gains over SOTA where every other model falls apart. **You'll come back to MEGA-ASR, after the rest fail in the wild. ⭐**

    GitHub repository with 991 stars and 64 forks.

    Trending score: 1.14; stars gained: +2; forks gained: +1.

    Language: Python

    Topics: asr, robust