MAC-AutoML/Trainingfree-LLM-Orchestration

A training-free orchestration framework for building interactive omni-modal assistants by composing off-the-shelf modality experts, explicit LLM routing, text-centric cross-modal memory, and interruption-aware streaming interaction.

GitHub repository with 11 stars and 2 forks.

Language: Python

Topics: asr, llm, omni, trainingfree, video-audio

Open provider repository

24h trend summary

Trending score 0.29, activity score 0.00, stars gained +1, forks gained +0.

Latest metric snapshot

2026-06-05: 11 stars and 2 forks.

Similar repositories

  1. 1. xzf-thu/Mega-ASR

    First foundation ASR built for the real world - 7 atomic acoustic conditions, 54 compound scenarios, 2.6M samples, and up to ~30% gains over SOTA where every other model falls apart. **You'll come back to MEGA-ASR, after the rest fail in the wild. ⭐**

    GitHub repository with 955 stars and 61 forks.

    Trending score: 2.72; stars gained: +33; forks gained: -1.

    Language: Python

    Topics: asr, robust

  2. 2. modelscope/FunASR

    Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

    GitHub repository with 16,750 stars and 1,720 forks.

    Trending score: 1.93; stars gained: +56; forks gained: +2.

    Language: Python

    Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection

  3. 3. Soul-AILab/SoulX-Transcriber

    An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

    GitHub repository with 182 stars and 7 forks.

    Trending score: 1.58; stars gained: +40; forks gained: +5.

    Language: Python

    Topics: asr, llm, sd, sdr, speech-recognition

  4. 4. karamouche/noisekit

    Generate degraded speech datasets for noise-robust ASR benchmarking

    GitHub repository with 15 stars and 0 forks.

    Trending score: 0.50; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: asr, audio, audiomentations, benchmark, cli, dataset-generation

  5. 5. myuan19/voiceInput

    Windows AI 语音输入🎙 — 按快捷键说话即输入,支持润色。摆脱打字限制,实现无拘束、高效率的表达。

    GitHub repository with 36 stars and 7 forks.

    Trending score: 0.40; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: asr, dashscope, productivity, pyqt6, python, qwen-asr

  6. 6. MAC-AutoML/Trainingfree-LLM-Orchestration

    A training-free orchestration framework for building interactive omni-modal assistants by composing off-the-shelf modality experts, explicit LLM routing, text-centric cross-modal memory, and interruption-aware streaming interaction.

    GitHub repository with 11 stars and 2 forks.

    Trending score: 0.29; stars gained: +1; forks gained: +0.

    Language: Python

    Topics: asr, llm, omni, trainingfree, video-audio

Trending in Python

  1. 1. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 181,771 stars and 31,186 forks.

    Trending score: 5.95; stars gained: +1,867; forks gained: +361.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 13,361 stars and 853 forks.

    Trending score: 5.69; stars gained: +2,829; forks gained: +175.

    Language: Python

    Topics: agent, ai, anthropic, compression, context-engineering, context-window

  3. 3. Imbad0202/academic-research-skills

    Academic Research Skills for Claude Code: research → write → review → revise → finalize

    GitHub repository with 27,484 stars and 2,256 forks.

    Trending score: 5.52; stars gained: +1,079; forks gained: +89.

    Language: Python

    Topics: academic-pipeline, academic-writing, ai-research, claude, claude-code, literature-review

  4. 4. anthropics/financial-services

    GitHub repository with 30,029 stars and 4,231 forks.

    Trending score: 4.88; stars gained: +688; forks gained: +114.

    Language: Python

  5. 5. virgiliojr94/book-to-skill

    Turn any technical book PDF into a Claude Code skill — ready to study, reference, and use while you work.

    GitHub repository with 4,250 stars and 534 forks.

    Trending score: 4.88; stars gained: +476; forks gained: +68.

    Language: Python

  6. 6. vinta/awesome-python

    An opinionated list of Python frameworks, libraries, tools, and resources

    GitHub repository with 301,371 stars and 28,044 forks.

    Trending score: 4.60; stars gained: +518; forks gained: +24.

    Language: Python

    Topics: awesome, python, collections, python-frameworks, python-libraries, python-tools

Trending topic: asr

  1. 1. Open-Less/openless

    Hold a key, speak, release — AI-polished text appears at your cursor in any app. Open-source voice input for macOS & Windows. (按住快捷键说话,松开即得润色后的文字)

    GitHub repository with 2,136 stars and 164 forks.

    Trending score: 2.75; stars gained: +25; forks gained: +1.

    Language: HTML

    Topics: ai-prompt, asr, dictation, linux, llm, macos

  2. 2. xzf-thu/Mega-ASR

    First foundation ASR built for the real world - 7 atomic acoustic conditions, 54 compound scenarios, 2.6M samples, and up to ~30% gains over SOTA where every other model falls apart. **You'll come back to MEGA-ASR, after the rest fail in the wild. ⭐**

    GitHub repository with 955 stars and 61 forks.

    Trending score: 2.72; stars gained: +33; forks gained: -1.

    Language: Python

    Topics: asr, robust

  3. 3. modelscope/FunASR

    Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

    GitHub repository with 16,750 stars and 1,720 forks.

    Trending score: 1.93; stars gained: +56; forks gained: +2.

    Language: Python

    Topics: pytorch, speech-recognition, paraformer, punctuation, speaker-diarization, voice-activity-detection

  4. 4. Soul-AILab/SoulX-Transcriber

    An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.

    GitHub repository with 182 stars and 7 forks.

    Trending score: 1.58; stars gained: +40; forks gained: +5.

    Language: Python

    Topics: asr, llm, sd, sdr, speech-recognition

  5. 5. k2-fsa/sherpa-onnx

    Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

    GitHub repository with 12,726 stars and 1,451 forks.

    Trending score: 1.41; stars gained: +29; forks gained: +1.

    Language: C++

    Topics: asr, onnx, windows, linux, macos, cpp

  6. 6. soniqo/speech-swift

    AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

    GitHub repository with 784 stars and 101 forks.

    Trending score: 1.25; stars gained: +2; forks gained: +0.

    Language: Swift

    Topics: apple-silicon, asr, coreml, ios, macos, mlx