arashsajjadi/ai-powered-video-analyzer
An offline AI-powered video analysis tool with object detection (YOLO), image captioning (BLIP), speech transcription (Whisper), audio event detection (PANNs), and AI-generated summaries (LLMs via Ollama). It ensures privacy and offline use with a user-friendly GUI.
GitHub repository with 86 stars and 22 forks.
Language: Python
Topics: ai-video-analysis, blip2, gui, image-captioning, image-captioning-ai, llm, object-detection, offline-processing, ollama, ollama-api