gyj1201/zipEnhancer

将 ZipEnhancer 模型从 ModelScope pipeline 中剥离,用纯 PyTorch 实现推理流程,封装为 pip 可安装的降噪库和 FastAPI 批量语音降噪,为 CosyVoice 提供干净的人声输入。

GitHub repository with 80 stars and 3 forks.

Language: Python

Topics: audio-processing, cosyvoice, fastapi, modelscope, noise-suppression, pytorch, speech-enhancement, voice-enhancement, zipenhancer

Open provider repository

24h trend summary

Trending score 0.91, freshness score 0.49, stars gained -1, forks gained +0.

Latest metric snapshot

2026-06-15: 80 stars and 3 forks.

Similar repositories

  1. 1. gyj1201/zipEnhancer

    将 ZipEnhancer 模型从 ModelScope pipeline 中剥离,用纯 PyTorch 实现推理流程,封装为 pip 可安装的降噪库和 FastAPI 批量语音降噪,为 CosyVoice 提供干净的人声输入。

    GitHub repository with 80 stars and 3 forks.

    Trending score: 0.91; stars gained: -1; forks gained: +0.

    Language: Python

    Topics: audio-processing, cosyvoice, fastapi, modelscope, noise-suppression, pytorch

  2. 2. attevon-llc/OpenTranscribe

    Self-hosted AI-powered transcription platform with speaker diarization, search, and collaboration features. Built with Svelte, FastAPI, and Docker for easy deployment.

    GitHub repository with 66 stars and 17 forks.

    Trending score: 0.82; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: ai, audio-processing, docker, fastapi, machine-learning, nlp

  3. 3. ailia-ai/ailia-models

    The collection of pre-trained, state-of-the-art AI models for ailia SDK

    GitHub repository with 2,346 stars and 361 forks.

    Trending score: 0.10; stars gained: -1; forks gained: +0.

    Language: Python

    Topics: action-recognition, anomaly-detection, audio-processing, background-removal, crowd-counting, deep-learning

  4. 4. sw-willie-wu/MediaTranX

    AI-powered local multimedia toolkit — speech-to-text, translation, super-resolution, OCR, source separation, and media transcoding. All AI inference runs locally on your machine.

    GitHub repository with 5 stars and 0 forks.

    Trending score: 0.09; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: ai, audio-processing, demucs, desktop-app, electron, fastapi

  5. 5. GoodQ02/goodq4all

    Local-first multimodal epistemic memory for scene-level video, audio, and text intelligence.

    GitHub repository with 6 stars and 0 forks.

    Trending score: 0.09; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: audio-processing, computer-vision, epistemic-ai, knowledge-graph, local-first, memory-system

Trending in Python

  1. 1. harry0703/MoneyPrinterTurbo

    利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

    GitHub repository with 88,031 stars and 12,625 forks.

    Trending score: 6.02; stars gained: +1,097; forks gained: +218.

    Language: Python

    Topics: ai, automation, chatgpt, moviepy, python, shortvideo

  2. 2. pewdiepie-archdaemon/odysseus

    Self-hosted AI workspace.

    GitHub repository with 71,433 stars and 9,110 forks.

    Trending score: 5.98; stars gained: +834; forks gained: +140.

    Language: Python

  3. 3. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 194,098 stars and 33,987 forks.

    Trending score: 5.92; stars gained: +753; forks gained: +209.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  4. 4. NVIDIA/SkillSpector

    Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks.

    GitHub repository with 5,962 stars and 441 forks.

    Trending score: 5.61; stars gained: +874; forks gained: +76.

    Language: Python

  5. 5. rohitg00/ai-engineering-from-scratch

    Learn it. Build it. Ship it for others.

    GitHub repository with 32,676 stars and 5,366 forks.

    Trending score: 5.59; stars gained: +762; forks gained: +135.

    Language: Python

    Topics: agents, ai, ai-agents, ai-engineering, computer-vision, course

  6. 6. Agents365-ai/drawio-skill

    Generate draw.io diagrams from natural language — 6 presets, vision self-check + up to 5-round refinement, codebase-to-diagram, 10,000+ official shapes & 321 AI/LLM brand logos. Exports PNG/SVG/PDF/JPG.

    GitHub repository with 3,445 stars and 240 forks.

    Trending score: 5.51; stars gained: +1,369; forks gained: +113.

    Language: Python

    Topics: agent-skill, agent-skills, architecture-diagram, claude-code, claude-code-skill, claude-skills

Trending topic: audio-processing

  1. 1. google-ai-edge/mediapipe

    Cross-platform, customizable ML solutions for live and streaming media.

    GitHub repository with 35,631 stars and 6,014 forks.

    Trending score: 2.99; stars gained: +37; forks gained: +1.

    Language: C++

    Topics: android, audio-processing, c-plus-plus, calculator, computer-vision, deep-learning

  2. 2. SimZhou/vscode-audiolens

    Audio inspection & analysis extension for Visual Studio Code

    GitHub repository with 48 stars and 6 forks.

    Trending score: 1.22; stars gained: +1; forks gained: +0.

    Language: TypeScript

    Topics: audio, audio-processing, vscode, vscode-extension

  3. 3. NVIDIA/DALI

    A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

    GitHub repository with 5,710 stars and 667 forks.

    Trending score: 1.00; stars gained: +2; forks gained: -1.

    Language: C++

    Topics: fast-data-pipeline, image-augmentation, data-augmentation, image-processing, data-processing, deep-learning

  4. 4. BillyDM/awesome-audio-dsp

    My curated list of audio DSP and plugin development resources (Github fork)

    GitHub repository with 1,362 stars and 78 forks.

    Trending score: 0.93; stars gained: +2; forks gained: +0.

    Topics: dsp, audio, math, algorithms, vst, rust

  5. 5. gyj1201/zipEnhancer

    将 ZipEnhancer 模型从 ModelScope pipeline 中剥离,用纯 PyTorch 实现推理流程,封装为 pip 可安装的降噪库和 FastAPI 批量语音降噪,为 CosyVoice 提供干净的人声输入。

    GitHub repository with 80 stars and 3 forks.

    Trending score: 0.91; stars gained: -1; forks gained: +0.

    Language: Python

    Topics: audio-processing, cosyvoice, fastapi, modelscope, noise-suppression, pytorch

  6. 6. VoiceBlender/voiceblender

    A programmable voice platform: SIP and WebRTC call control, multi-party mixing, recording, TTS/STT, and pluggable AI agents (ElevenLabs, VAPI, Pipecat, Deepgram) — all driven through a REST API, webhooks, and a WebSocket event stream

    GitHub repository with 81 stars and 8 forks.

    Trending score: 0.85; stars gained: +1; forks gained: -1.

    Language: Go

    Topics: ai-agents, api, asr, audio-processing, codec, livekit