vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

GitHub repository with 5,127 stars and 1,107 forks.

Language: Python

Topics: audio-generation, diffusion, image-generation, inference, model-serving, multimodal, pytorch, transformer, video-generation

Open provider repository

24h trend summary

Trending score 2.65, activity score 0.05, stars gained +19, forks gained +9.

Latest metric snapshot

2026-06-13: 5,127 stars and 1,107 forks.

Similar repositories

  1. 1. vllm-project/vllm-omni

    A framework for efficient model inference with omni-modality models

    GitHub repository with 5,127 stars and 1,107 forks.

    Trending score: 2.65; stars gained: +19; forks gained: +9.

    Language: Python

    Topics: audio-generation, diffusion, image-generation, inference, model-serving, multimodal

  2. 2. fluxions-ai/vui

    Real-time voice assistant — WebRTC streaming, faster-whisper ASR, local LLM, Vui Nano (300M) TTS. OpenAI Realtime API compatible. Voice cloning, barge-in, ~9× realtime on a 4090. Apache 2.0.

    GitHub repository with 700 stars and 72 forks.

    Trending score: 1.18; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: audio-generation, conversational-ai, edge-ai, lightweight, llama, multi-speaker

  3. 3. thxxx/VTS

    Voice-to-sound SFX generation from a vocal sketch and text prompt.

    GitHub repository with 32 stars and 3 forks.

    Trending score: 1.12; stars gained: +4; forks gained: +2.

    Language: Python

    Topics: audio-generation, diffusion-model, pytorch, sound-effects, text-conditioned, voice-conditioned

  4. 4. xiaomi-research/controlfoley

    ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling

    GitHub repository with 134 stars and 3 forks.

    Trending score: 0.48; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: audio-controlled-video-to-audio, audio-generation, foley-art, foley-sound-synthesis, text-controlled-video-to-audio, text-to-audio

  5. 5. light-and-ray/Minimalistic-Comfy-Wrapper-WebUI

    MCWW: Additional non-node based UI for ComfyUI focused on inference. Stable UI states; presets; and advanced queue. Based on Gradio

    GitHub repository with 129 stars and 9 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: ai, comfyui, comfyui-nodes, gradio, artificial-intelligence, generative-ai

Trending in Python

  1. 1. mvanhorn/last30days-skill

    AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

    GitHub repository with 40,614 stars and 3,271 forks.

    Trending score: 5.82; stars gained: +1,312; forks gained: +87.

    Language: Python

  2. 2. chopratejas/headroom

    Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

    GitHub repository with 24,986 stars and 1,636 forks.

    Trending score: 5.73; stars gained: +2,844; forks gained: +202.

    Language: Python

    Topics: agent, ai, anthropic, claude-code, compression, context-engineering

  3. 3. pewdiepie-archdaemon/odysseus

    Self-hosted AI workspace.

    GitHub repository with 69,665 stars and 8,819 forks.

    Trending score: 5.70; stars gained: +951; forks gained: +165.

    Language: Python

  4. 4. NousResearch/hermes-agent

    The agent that grows with you

    GitHub repository with 192,327 stars and 33,531 forks.

    Trending score: 5.48; stars gained: +990; forks gained: +282.

    Language: Python

    Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude

  5. 5. safishamsi/graphify

    AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

    GitHub repository with 66,467 stars and 6,719 forks.

    Trending score: 5.25; stars gained: +1,314; forks gained: +109.

    Language: Python

    Topics: antigravity, claude-code, codex, gemini, graphrag, knowledge-graph

  6. 6. hugohe3/ppt-master

    AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images · by Hugo He

    GitHub repository with 27,112 stars and 2,418 forks.

    Trending score: 5.10; stars gained: +903; forks gained: +61.

    Language: Python

    Topics: ai-agent, aippt, office, powerpoint, powerpoint-generation, ppt

Trending topic: audio-generation

  1. 1. vllm-project/vllm-omni

    A framework for efficient model inference with omni-modality models

    GitHub repository with 5,127 stars and 1,107 forks.

    Trending score: 2.65; stars gained: +19; forks gained: +9.

    Language: Python

    Topics: audio-generation, diffusion, image-generation, inference, model-serving, multimodal

  2. 2. fluxions-ai/vui

    Real-time voice assistant — WebRTC streaming, faster-whisper ASR, local LLM, Vui Nano (300M) TTS. OpenAI Realtime API compatible. Voice cloning, barge-in, ~9× realtime on a 4090. Apache 2.0.

    GitHub repository with 700 stars and 72 forks.

    Trending score: 1.18; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: audio-generation, conversational-ai, edge-ai, lightweight, llama, multi-speaker

  3. 3. thxxx/VTS

    Voice-to-sound SFX generation from a vocal sketch and text prompt.

    GitHub repository with 32 stars and 3 forks.

    Trending score: 1.12; stars gained: +4; forks gained: +2.

    Language: Python

    Topics: audio-generation, diffusion-model, pytorch, sound-effects, text-conditioned, voice-conditioned

  4. 4. xiaomi-research/controlfoley

    ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling

    GitHub repository with 134 stars and 3 forks.

    Trending score: 0.48; stars gained: +2; forks gained: +0.

    Language: Python

    Topics: audio-controlled-video-to-audio, audio-generation, foley-art, foley-sound-synthesis, text-controlled-video-to-audio, text-to-audio

  5. 5. light-and-ray/Minimalistic-Comfy-Wrapper-WebUI

    MCWW: Additional non-node based UI for ComfyUI focused on inference. Stable UI states; presets; and advanced queue. Based on Gradio

    GitHub repository with 129 stars and 9 forks.

    Trending score: 0.05; stars gained: +0; forks gained: +0.

    Language: Python

    Topics: ai, comfyui, comfyui-nodes, gradio, artificial-intelligence, generative-ai