vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

GitHub repository with 5,127 stars and 1,107 forks.

Language: Python

Topics: audio-generation, diffusion, image-generation, inference, model-serving, multimodal, pytorch, transformer, video-generation

Open provider repository

24h trend summary

Trending score 2.65, activity score 0.05, stars gained +19, forks gained +9.

Latest metric snapshot

2026-06-13: 5,127 stars and 1,107 forks.

Similar repositories

1. vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

GitHub repository with 5,127 stars and 1,107 forks.

Trending score: 2.65; stars gained: +19; forks gained: +9.

Language: Python

Topics: audio-generation, diffusion, image-generation, inference, model-serving, multimodal
2. fluxions-ai/vui

Real-time voice assistant — WebRTC streaming, faster-whisper ASR, local LLM, Vui Nano (300M) TTS. OpenAI Realtime API compatible. Voice cloning, barge-in, ~9× realtime on a 4090. Apache 2.0.

GitHub repository with 700 stars and 72 forks.

Trending score: 1.18; stars gained: +2; forks gained: +0.

Language: Python

Topics: audio-generation, conversational-ai, edge-ai, lightweight, llama, multi-speaker
3. thxxx/VTS

Voice-to-sound SFX generation from a vocal sketch and text prompt.

GitHub repository with 32 stars and 3 forks.

Trending score: 1.12; stars gained: +4; forks gained: +2.

Language: Python

Topics: audio-generation, diffusion-model, pytorch, sound-effects, text-conditioned, voice-conditioned
4. xiaomi-research/controlfoley

ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling

GitHub repository with 134 stars and 3 forks.

Trending score: 0.48; stars gained: +2; forks gained: +0.

Language: Python

Topics: audio-controlled-video-to-audio, audio-generation, foley-art, foley-sound-synthesis, text-controlled-video-to-audio, text-to-audio
5. light-and-ray/Minimalistic-Comfy-Wrapper-WebUI

MCWW: Additional non-node based UI for ComfyUI focused on inference. Stable UI states; presets; and advanced queue. Based on Gradio

GitHub repository with 129 stars and 9 forks.

Trending score: 0.05; stars gained: +0; forks gained: +0.

Language: Python

Topics: ai, comfyui, comfyui-nodes, gradio, artificial-intelligence, generative-ai

Trending in Python

1. mvanhorn/last30days-skill

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

GitHub repository with 40,614 stars and 3,271 forks.

Trending score: 5.82; stars gained: +1,312; forks gained: +87.

Language: Python
2. chopratejas/headroom

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

GitHub repository with 24,986 stars and 1,636 forks.

Trending score: 5.73; stars gained: +2,844; forks gained: +202.

Language: Python

Topics: agent, ai, anthropic, claude-code, compression, context-engineering
3. pewdiepie-archdaemon/odysseus

Self-hosted AI workspace.

GitHub repository with 69,665 stars and 8,819 forks.

Trending score: 5.70; stars gained: +951; forks gained: +165.

Language: Python
4. NousResearch/hermes-agent

The agent that grows with you

GitHub repository with 192,327 stars and 33,531 forks.

Trending score: 5.48; stars gained: +990; forks gained: +282.

Language: Python

Topics: ai, ai-agent, ai-agents, anthropic, chatgpt, claude
5. safishamsi/graphify

AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.

GitHub repository with 66,467 stars and 6,719 forks.

Trending score: 5.25; stars gained: +1,314; forks gained: +109.

Language: Python

Topics: antigravity, claude-code, codex, gemini, graphrag, knowledge-graph
6. hugohe3/ppt-master

AI generates a real, editable PowerPoint from any document — native shapes & animations, speaker notes voiced as audio narration, and the option to follow your own .pptx template, not slide images · by Hugo He

GitHub repository with 27,112 stars and 2,418 forks.

Trending score: 5.10; stars gained: +903; forks gained: +61.

Language: Python

Topics: ai-agent, aippt, office, powerpoint, powerpoint-generation, ppt

vllm-project/vllm-omni

24h trend summary

Latest metric snapshot

Similar repositories

Trending in Python

Trending topic: audio-generation